Commit Graph

336 Commits (4ee1c9e60d278a5172c18549bfebbbe533fdfade)

Author SHA1 Message Date
dzhwinter 4ee1c9e60d "add sequence expand kernel"
7 years ago
emailweixu b3f076a6e4
Merge pull request #9168 from emailweixu/fix_compile
7 years ago
chengduo 597ba3f3f2 add more times close test (#9215)
7 years ago
yangyaming 869a6f9cea Add python wrapper.
7 years ago
yangyaming ea788fc5df Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-9171
7 years ago
Liu Yiqun 961151f17a Disable the link flags on Mac.
7 years ago
Xin Pan 898e0ffa21
Merge pull request #9190 from panyx0718/p2p
7 years ago
Xin Pan ce55975bb5 fix
7 years ago
Xin Pan 18ac6947d0 Enable P2P memory copy
7 years ago
yangyaming 332b665fc7 Enhanced cpp implementation and unit test.
7 years ago
Tao Luo c068d9c19e
Merge pull request #9065 from Xreki/core_inference_shared_library
7 years ago
Tao Luo c0421379b7
Merge pull request #9043 from Xreki/core_inference_remove_clone
7 years ago
Kexin Zhao df99b16a16
Merge pull request #9167 from kexinzhao/pool2d_fp16
7 years ago
xuwei06 ab3543e35e Fix compilation for gcc5.4
7 years ago
Kexin Zhao 8ebfc153dd update
7 years ago
Kexin Zhao 3f5705c346
Merge pull request #9148 from kexinzhao/cast_op_fp16
7 years ago
Kexin Zhao bfbc25bdb8 add fp16 pool2d support
7 years ago
Liu Yiqun 371c53f88c Add profiling event in feed, fetch and load op.
7 years ago
Liu Yiqun 253ba6672f Merge branch 'develop' into core_inference_remove_clone
7 years ago
qiaolongfei a39c861530 rm unused private field in profiler
7 years ago
Kexin Zhao 8e7310146f
Merge pull request #9143 from kexinzhao/numpy_conv2d_pool2d_fp16
7 years ago
Kexin Zhao f3c5e81556 add fp16 for cast op
7 years ago
Xin Pan 21e2c42a46
Merge pull request #9141 from panyx0718/develop
7 years ago
Tao Luo a448fbe9e1
Merge pull request #9134 from putcn/fix-selected-row-dep
7 years ago
Tao Luo 20be8e7e33
Merge pull request #9104 from ranqiu92/doc_dir
7 years ago
Xin Pan 1ca1e1c384 Fix a program copy regression.
7 years ago
qingqing01 7c1a0b77a0
Delete the detection_output_op, which had been split into several operators. (#9121)
7 years ago
Kexin Zhao e967d19b0a add more tests
7 years ago
Kexin Zhao a13ec3432a fix test error
7 years ago
Kexin Zhao e4de5dc347 add conv2d fp16 support
7 years ago
Xi Chen d20c6eb6de add math_function to selected_rows_functor dependency list
7 years ago
qingqing01 1cd700d8e8
Fix bug in LRN operator. (#9124)
7 years ago
ranqiu 64775126f3 change the dir of docs
7 years ago
qingqing01 b5a16dca20
Fix a critical bug in softmax_with_cross_entropy_op backward. (#9120)
7 years ago
Thuan Nguyen 1e4c504e60 Implement Select OP (#9088)
7 years ago
qingqing01 45073b7c39
Always synchronize when copy data on GPU from C++ to Numpy array. (#9110)
7 years ago
Xin Pan d284cf88e5
Merge pull request #9037 from panyx0718/develop
7 years ago
dzhwinter 128adf53cb
[Speed]implement cudnn sequence softmax cudnn (#8978)
7 years ago
Kexin Zhao e26f1123da
Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017)
7 years ago
dzhwinter 7140071152
"exported scatter to python" (#9038)
7 years ago
chengduo 11c43e5da3
Merge pull request #9072 from chengduoZH/feature/refine_parallel_do
7 years ago
Abhinav Arora 41894da145
Add changes to channel that are needed for select op (#9084)
7 years ago
Yibing Liu 90afbd2856 Move back operator's event to RunImpl()
7 years ago
Xin Pan 4840c49b27 Better timeline
7 years ago
chengduoZH ef28e7deba refine parallel_do_grad
7 years ago
Liu Yiqun 6c614814da Limit the symbol table of fluid shared library.
7 years ago
Yu Yang 48f213e5a1
Merge pull request #8991 from reyoung/feature/shuffle_reader
7 years ago
Cao Ying 881c5227ab
Merge pull request #8843 from zhouhanqing/Paddle-ReduceProd
7 years ago
Liu Yiqun 9ed8e2a082 Merge branch 'develop' into core_inference_remove_clone
7 years ago
Liu Yiqun 8ecad98578 Add the bool variable to decide whether to have a copy of the program in ExecutorPrepareContext.
7 years ago