Commit Graph

170 Commits (44c662b4e1e1344d606ee486a3a7c3173fdf1b58)

Author SHA1 Message Date
dzhwinter 44c662b4e1 Merge remote-tracking branch 'origin/develop' into fix/cudnn
7 years ago
Yu Yang c36dd3b338
Merge pull request #11114 from reyoung/feature/yep
8 years ago
dzhwinter 2b9ef7e249 "fix"
8 years ago
dzhwinter 75d8e8ca33 "fix compiled in manylinux"
8 years ago
dzhwinter 4777aec9be "done"
8 years ago
dzhwinter 7971d4a310
Feature/deterministic (#11205)
8 years ago
yuyang18 53dab95b75 Static DSO handle
8 years ago
yuyang18 c5115950a8 Use static for dlsym
8 years ago
yuyang18 7cf8b656a2 Remove lock in device context
8 years ago
Xin Pan 7eca286159
Merge pull request #11078 from panyx0718/improve_profiler
8 years ago
gongweibao 4fb7cc7f5e
Move sync_mode device ctx from grpc server (#10881)
8 years ago
Xin Pan 75ea577fd3 allow profiler and timeline to work when dev_ctx is nullptr.
8 years ago
Xin Pan f14e579cc3 clean up
8 years ago
Xin Pan 3cb6395688 better profiler and benchmark
8 years ago
Xin Pan 0d598cf9f6
Merge pull request #10822 from panyx0718/dist_opt
8 years ago
Xin Pan 08e4970e45 follow comments
8 years ago
Xin Pan b4dd4c048d multi-thread handlerequest
8 years ago
Krzysztof Binias 0aa01929c1 Add backward
8 years ago
Tao Luo 85b6bb5886
Merge pull request #10747 from jczaja/prv-mkldnn-pooling-reuse
8 years ago
dzhwinter 0e4467eee4
"fix compile" (#10657)
8 years ago
Xin Pan 40a2ee9ae8
Merge pull request #10621 from panyx0718/fix_profile
8 years ago
Jacek Czaja 5f1333058c - Draft of reuse of pooling mkldnn operator
8 years ago
yuyang18 dfbe06ccab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fix_ninja_build
8 years ago
Xin Pan 94c0a64d62 Fix a profiler race condition
8 years ago
yuyang18 dc6ce071d4 Polish cmake
8 years ago
yuyang18 7c777dd549 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
8 years ago
yuyang18 08295f9877 Add build strategy
8 years ago
typhoonzero 7b0c0273f4 update by comments
8 years ago
typhoonzero f5840d8925 follow comments
8 years ago
typhoonzero 04bde96e4c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
8 years ago
fengjiayi 2bff03bc1e fix a compile error (#10488)
8 years ago
chengduoZH 345737d0fe add sync
8 years ago
typhoonzero a135fec1fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
8 years ago
typhoonzero 17009d0627 workable version
8 years ago
Xin Pan dce0732d5e
Merge pull request #10380 from panyx0718/dist_timeline
8 years ago
typhoonzero a529d790b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
8 years ago
typhoonzero 3667578ec2 testing
8 years ago
chengduoZH d36af62c1e wrap_shfl_x_sync
8 years ago
typhoonzero d9320dcd94 complete code
8 years ago
Xin Pan 5a9f17f02b clean up
8 years ago
Xin Pan 76d8b14bce Add timeline support for distributed training
8 years ago
chengduo 54797abd53
Merge pull request #10347 from chengduoZH/replace___shfl_with__shfl_sync
8 years ago
chengduoZH e97c1a8ca0 fix __shfl
8 years ago
chengduoZH 0cc635497c merge develop
8 years ago
Yiqun Liu 6084af47ef
Fix the bug when a input variable of op is dispensable. (#10268)
8 years ago
chengduo 4fbde42cdf Fix __shfl_down_sync_ of cross_entropy (#10345)
8 years ago
chengduoZH b8f7fa97b6 replace __shfl with __shfl_sync
8 years ago
chengduoZH 90d73c79c3 fix shfl_sync for CUDA8.0
8 years ago
dzhwinter eb6f9dd5de
Feature/cuda9 cudnn7 (#10140)
8 years ago
Yu Yang c02ba51de0
Merge pull request #10191 from reyoung/feature/strict_dynload
8 years ago