Commit Graph

7101 Commits (9cb8f503026c6d3d25fa80e34b8fa2ca0bea6d2f)

Author SHA1 Message Date
Yu Yang 9cb8f50302 Complete fetch op 7 years ago
Yu Yang 254d7ff4f5 Refactor local_scopes 7 years ago
Yu Yang b2c7a9b828 Wait by stream 7 years ago
Yu Yang e8a7e5d1e6 Update 7 years ago
Yu Yang 8f0590e7c5 Add ncclAllReduce 7 years ago
Yu Yang c15d2c9edc Update 7 years ago
Yu Yang d470763f6c Stash 7 years ago
Yu Yang 9fc0b596a9 Test more 7 years ago
Yu Yang 0ef9edf566 Stash 7 years ago
Yu Yang 5e87cd7574 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor 7 years ago
Yu Yang 8b397d1602 Make recordio file reader thread-safe by default 7 years ago
Yu Yang 8c9cd369dc Polish code style 7 years ago
Yu Yang 6f0dfd89a4 Single GPU ParallelExecutor complete 7 years ago
Kexin Zhao 8e7310146f
Merge pull request from kexinzhao/numpy_conv2d_pool2d_fp16 7 years ago
Xin Pan 21e2c42a46
Merge pull request from panyx0718/develop 7 years ago
Tao Luo a448fbe9e1
Merge pull request from putcn/fix-selected-row-dep 7 years ago
Tao Luo 20be8e7e33
Merge pull request from ranqiu92/doc_dir 7 years ago
Xin Pan 1ca1e1c384 Fix a program copy regression. 7 years ago
qingqing01 7c1a0b77a0
Delete the detection_output_op, which had been split into several operators. () 7 years ago
Kexin Zhao e967d19b0a add more tests 7 years ago
Kexin Zhao a13ec3432a fix test error 7 years ago
Kexin Zhao e4de5dc347 add conv2d fp16 support 7 years ago
Xi Chen d20c6eb6de add math_function to selected_rows_functor dependency list 7 years ago
qingqing01 1cd700d8e8
Fix bug in LRN operator. () 7 years ago
ranqiu 64775126f3 change the dir of docs 7 years ago
qingqing01 b5a16dca20
Fix a critical bug in softmax_with_cross_entropy_op backward. () 7 years ago
Yu Yang d84ddcf123 Stash 7 years ago
Yu Yang 193c0a7e43 Handle var hazard 7 years ago
Thuan Nguyen 1e4c504e60 Implement Select OP () 7 years ago
qingqing01 45073b7c39
Always synchronize when copy data on GPU from C++ to Numpy array. () 7 years ago
Yu Yang 35744e7b36 Polish code 7 years ago
Xin Pan d284cf88e5
Merge pull request from panyx0718/develop 7 years ago
Yu Yang ae88fdefb7 Use thread pool 7 years ago
dzhwinter 128adf53cb
[Speed]implement cudnn sequence softmax cudnn () 7 years ago
Kexin Zhao e26f1123da
Add fp16 mul op support and bind paddle fp16 to numpy fp16 () 7 years ago
dzhwinter 7140071152
"exported scatter to python" () 7 years ago
Tao Luo cf2addd21f
Merge pull request from luotao1/with_fluid 7 years ago
chengduo 11c43e5da3
Merge pull request from chengduoZH/feature/refine_parallel_do 7 years ago
Abhinav Arora 41894da145
Add changes to channel that are needed for select op () 7 years ago
Yu Yang 692a0f7425 Better name 7 years ago
Yu Yang baef1124fb ParallelExecutor And dependency engine 7 years ago
Yibing Liu 90afbd2856 Move back operator's event to RunImpl() 7 years ago
Xin Pan 4840c49b27 Better timeline 7 years ago
chengduoZH ef28e7deba refine parallel_do_grad 7 years ago
Luo Tao 76e1c6af9f enable WITH_FLUID option 7 years ago
Yu Yang 48f213e5a1
Merge pull request from reyoung/feature/shuffle_reader 7 years ago
Cao Ying 881c5227ab
Merge pull request from zhouhanqing/Paddle-ReduceProd 7 years ago
武毅 d13ce35875 Feature/send recv can now retry () 7 years ago
dzhwinter 14fe40aaa6
Refine/nccl () 7 years ago
chengduo 788c600e9d
Merge pull request from chengduoZH/feature/add_concat_rows 7 years ago