Commit Graph

373 Commits (7c041e48f48e111b3ab0d5876c2b419e90c7b591)

Author SHA1 Message Date
yangyaming 58730ba131 Enhance unit test.
7 years ago
yangyaming bf3f56e899 Finish adaption for backward.
7 years ago
Liu Yiqun 371c53f88c Add profiling event in feed, fetch and load op.
7 years ago
Liu Yiqun 253ba6672f Merge branch 'develop' into core_inference_remove_clone
7 years ago
qiaolongfei a39c861530 rm unused private field in profiler
7 years ago
typhoonzero b8f4c8599e pserver runs in parallel
7 years ago
Kexin Zhao 8e7310146f
Merge pull request #9143 from kexinzhao/numpy_conv2d_pool2d_fp16
7 years ago
Kexin Zhao f3c5e81556 add fp16 for cast op
7 years ago
Xin Pan 21e2c42a46
Merge pull request #9141 from panyx0718/develop
7 years ago
Tao Luo a448fbe9e1
Merge pull request #9134 from putcn/fix-selected-row-dep
7 years ago
Tao Luo 20be8e7e33
Merge pull request #9104 from ranqiu92/doc_dir
7 years ago
Xin Pan 1ca1e1c384 Fix a program copy regression.
7 years ago
qingqing01 7c1a0b77a0
Delete the detection_output_op, which had been split into several operators. (#9121)
7 years ago
Kexin Zhao e967d19b0a add more tests
7 years ago
Kexin Zhao a13ec3432a fix test error
7 years ago
Kexin Zhao e4de5dc347 add conv2d fp16 support
7 years ago
Xi Chen d20c6eb6de add math_function to selected_rows_functor dependency list
7 years ago
qingqing01 1cd700d8e8
Fix bug in LRN operator. (#9124)
7 years ago
ranqiu 64775126f3 change the dir of docs
7 years ago
qingqing01 b5a16dca20
Fix a critical bug in softmax_with_cross_entropy_op backward. (#9120)
7 years ago
Thuan Nguyen 1e4c504e60 Implement Select OP (#9088)
7 years ago
qingqing01 45073b7c39
Always synchronize when copy data on GPU from C++ to Numpy array. (#9110)
7 years ago
Xin Pan d284cf88e5
Merge pull request #9037 from panyx0718/develop
7 years ago
dzhwinter 128adf53cb
[Speed]implement cudnn sequence softmax cudnn (#8978)
7 years ago
yangyaming 352fa41a16 Finish adapting forward.
7 years ago
Kexin Zhao e26f1123da
Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017)
7 years ago
dzhwinter 7140071152
"exported scatter to python" (#9038)
7 years ago
chengduo 11c43e5da3
Merge pull request #9072 from chengduoZH/feature/refine_parallel_do
7 years ago
Abhinav Arora 41894da145
Add changes to channel that are needed for select op (#9084)
7 years ago
Yibing Liu 90afbd2856 Move back operator's event to RunImpl()
7 years ago
Xin Pan 4840c49b27 Better timeline
7 years ago
chengduoZH ef28e7deba refine parallel_do_grad
7 years ago
Liu Yiqun 6c614814da Limit the symbol table of fluid shared library.
7 years ago
Yu Yang 48f213e5a1
Merge pull request #8991 from reyoung/feature/shuffle_reader
7 years ago
Cao Ying 881c5227ab
Merge pull request #8843 from zhouhanqing/Paddle-ReduceProd
7 years ago
Liu Yiqun 9ed8e2a082 Merge branch 'develop' into core_inference_remove_clone
7 years ago
Liu Yiqun 8ecad98578 Add the bool variable to decide whether to have a copy of the program in ExecutorPrepareContext.
7 years ago
武毅 d13ce35875 Feature/send recv can now retry (#9027)
7 years ago
dzhwinter 14fe40aaa6
Refine/nccl (#9009)
7 years ago
chengduo 788c600e9d
Merge pull request #8932 from chengduoZH/feature/add_concat_rows
7 years ago
Liu Yiqun c0a9aebe1c Remove the clone of program in C++ Executor.Run().
7 years ago
chengduoZH 92e2207e18 refine doc
7 years ago
Yu Yang 164f2382af Polish code
7 years ago
chengduoZH ff09b21cd0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_concat_rows
7 years ago
Yu Yang e13aec601a
Merge pull request #8830 from reyoung/feature/recordio_file_reader
7 years ago
Yu Yang f9974a4a12 Make double_buffer reader async
7 years ago
Yu Yang a8c076e577 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/shuffle_reader
7 years ago
chengduoZH b9397b2668 remove concat_rows
7 years ago
QI JUN 7287630e83
Repair nccl op test (#8575)
7 years ago
Yu Yang b52ad9de92
Merge pull request #9000 from reyoung/feature/extract_prepare_from_executor_run
7 years ago
Tao Luo b62874429d
Merge pull request #8910 from Xreki/core_inference_profile
7 years ago
Yu Yang 43d09a1c5f Extract Prepare from Executor
7 years ago
Yu Yang 225efa671f Remove dims in base class
7 years ago
QI JUN f7e9fe57d3
[Memory]More memory optimization policy (#8690)
7 years ago
Yu Yang 2ea4a5d96c Polish double buffer reader
7 years ago
kexinzhao 607eec30a8
Merge pull request #8946 from kexinzhao/fix_cuda_arch_fp16
7 years ago
Yancey b5ef315cf1
Fix dist compile error (#8987)
7 years ago
qingqing01 b3d26cd3ad
Fix bug in detection_output and mAP calculation in SSD. (#8985)
7 years ago
Yu Yang 46ae4075ee Polish ShuffleReader and test
7 years ago
Kexin Zhao c88f58dbd8 add comment
7 years ago
chengduoZH f1c3ecb2b2 add concat rows
7 years ago
chengduo 685f03762e
Merge pull request #8890 from chengduoZH/feature/fix_bug_of_elementwise
7 years ago
Kexin Zhao 3b44b849d3 address comments
7 years ago
fengjiayi dd1244f3c9
Merge pull request #8943 from JiayiFeng/fix_bugs_in_readers
7 years ago
Yu Yang 7eedced82a Polish RecordIO
7 years ago
Yu Yang cfca8a3a26 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
Yu Yang fea43077f6 Refine
7 years ago
pzelazko-intel 4730a4be24 MKLDNN pool2d OP kernel added (#8879)
7 years ago
Kexin Zhao 95de7617eb fix bug
7 years ago
Kexin Zhao 1998d5afa2 add gpu info func to get compute cap
7 years ago
Kexin Zhao d400b4192d fix math function arch mismatch for older GPU
7 years ago
fengjiayi 614c33fb3a fix a potential bug in the c++ reader
7 years ago
chengduoZH 1509ce6638 enhancement look_up_table
7 years ago
fengjiayi aa3f5058d3
Merge pull request #8841 from JiayiFeng/dev_double_buffer_for_cpp_reader
7 years ago
QI JUN b341bac7e1
Refine cast op (#8923)
7 years ago
Yancey 8468037918
Fix sparse update memory error for distributed training (#8837)
7 years ago
fengjiayi 35e1e0d521 uses channel to replace the traditional buffer
7 years ago
fengjiayi b3a11fdf3a Merge branch 'rm_reader_HasNext' into dev_double_buffer_for_cpp_reader
7 years ago
fengjiayi 6e5736e270 fix a compile error
7 years ago
fengjiayi 4e517881f7 remove HasNext
7 years ago
Liu Yiqun a8e8507767 Refine the profile codes for inference.
7 years ago
武毅 9dd34e4169 update unpushed commits for zerocopy grpc (#8900)
7 years ago
zhouhanqing 9d78971d8b Some comments have been modified.
7 years ago
Xin Pan b825c79261
Merge pull request #8897 from panyx0718/message
7 years ago
zhouhanqing 3ca968441d Merge branch 'develop' into Paddle-ReduceProd
7 years ago
kexinzhao 90215b7844
Add float16 GEMM math function on GPU (#8695)
7 years ago
武毅 45af8c1e99 Performance/zero copy variable seriralization (#8839)
7 years ago
Xin Pan 9a27d3af23 Print exception message from threads
7 years ago
chengduoZH 53d19f5b1e Add ElementwiseOpInferVarType
7 years ago
qingqing01 ffda2c414d
Clipping bbox in the mAP evaluator calculation. (#8872)
7 years ago
Yiqun Liu fecc9a38c6
Add test for nested RecordEvent. (#8773)
7 years ago
Xin Pan a9b9ec45ab
Merge pull request #8775 from panyx0718/test2
7 years ago
Yu Yang 9d4c93a0a7 Fix CI
7 years ago
chengduo abb10556e8
Merge pull request #8859 from chengduoZH/feature/refine_exe_log
7 years ago
Yu Yang b536799af0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
Xin Pan 30e556d675 Use vlog instead.
7 years ago
Yu Yang db46778bdd Polish codes and comments
7 years ago
Yu Yang 5cb79524d2 Fix CI
7 years ago
QI JUN 47ca1814f3
fix mac build error (#8856)
7 years ago
chengduoZH f7c7135673 Add log before op Run
7 years ago