Commit Graph

7237 Commits (ab5a3560dcda21c3886a1aebc83e3967de35ab4e)

Author SHA1 Message Date
Qiao Longfei f3dc3112cc
add split ids op (#9370)
7 years ago
chengduo 2e4a398638
Merge pull request #9216 from chengduoZH/feature/add_pinned_memory
7 years ago
Tao Luo c858f48979
Merge pull request #8887 from luotao1/infer_mkl
7 years ago
chengduoZH 9e99446e25 Add note for cudaMallocHost
7 years ago
chengduoZH a0e2cf03e4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_pinned_memory
7 years ago
chengduoZH 39004080f4 replace use_pinned with is_pinned
7 years ago
Xin Pan 3941c2ddec
Merge pull request #9355 from panyx0718/layer_norm
7 years ago
Luo Tao 6332bd1ed8 Merge branch 'develop' into infer_mkl
7 years ago
Qiao Longfei 4f522fa8d5
fix compile send_op on mac (#9360)
7 years ago
Yancey 1b0a17f415
Merge pull request #9303 from Yancey1989/split_send_op
7 years ago
Yancey1989 ebbb428db9 fix ci
7 years ago
Tao Luo cb3bbbd5c6
Merge pull request #9081 from kbinias/kbinias/mkldnn-activations
7 years ago
Qiao Longfei 8ccc61f334
support empty tensor (#9338)
7 years ago
chengduo 4a92e89623
Merge pull request #9337 from chengduoZH/feature/fix_concat
7 years ago
武毅 12856c5f69
Merge pull request #9325 from dzhwinter/fix/dropout1
7 years ago
chengduoZH aca9180a76 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fix_concat
7 years ago
Xin Pan 1a4be55a47 Pass cpu build
7 years ago
Xin Pan 904fa05f46 Improve layer_norm speed
7 years ago
Yancey1989 79af7cc9d3 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into split_send_op
7 years ago
Yancey1989 081b782434 update by comment
7 years ago
gongweibao cffe1a9112
Profiler can get elapsed time of `sendop` (#9345)
7 years ago
Darcy 8090eb6272 added proto_desc to device_tracer's dep list (#9342)
7 years ago
Krzysztof Binias d8bd436fc1 Fixed tests
7 years ago
Krzysztof Binias a64b312e3a Correcting for PR comments
7 years ago
Krzysztof Binias 4466f0bec8 MKLDNN Relu Tanh Sqrt Abs activations added
7 years ago
chengduoZH 750aff10ce code refine
7 years ago
chengduoZH 043f47b27f fix concat op
7 years ago
yi.wu bb815d4364 update
7 years ago
yi.wu a9a228ad8d fix dist compile
7 years ago
Luo Tao ae820a34bc Merge branch 'develop' into infer_mkl
7 years ago
Varun Arora 76ae540f8e
Move Select to concurrency.py; incorporate outputs (#9136)
7 years ago
武毅 9c35b0dc1b
Merge pull request #9287 from typhoonzero/pserver_prepare_before_run
7 years ago
Tao Luo e027eb40d7
Merge pull request #9123 from tpatejko/tpatejko/mkldnn-lrn
7 years ago
dzhwinter e33af2414b "fast hack"
7 years ago
typhoonzero 9367f11eb7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pserver_prepare_before_run
7 years ago
Yancey ee7f1ecd7c
Fix dist compile error (#9320)
7 years ago
Tao Luo 9126e626fc
Merge pull request #9165 from ROCmSoftwarePlatform/amd_cmake_01
7 years ago
qingqing01 8f8728635a
Fix bug for backward tanspiler when using parallel_do operator. (#9282)
7 years ago
typhoonzero a88cc46221 update
7 years ago
typhoonzero 972a102b92 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pserver_prepare_before_run
7 years ago
typhoonzero 5e6276edc1 fix transpiler bug
7 years ago
dzhwinter 13f1050ab0
"fix mixed_vector bug" (#9319)
7 years ago
Yu Yang 9e3e424ecb
Merge pull request #9299 from reyoung/feature/refactor_batch_norm
7 years ago
Tao Luo be0efbffce
Merge pull request #9233 from weixing02/build
7 years ago
sabreshao e0ac6bc436 CMake refine for HIP support.
7 years ago
gongweibao 990d6396fe
Reuduce memory copy when communication between trainer and pserver. (#9271)
7 years ago
weixing02 d70a70bcda Modified build.sh and remove build_doc.sh
7 years ago
whs b594251f89
Merge pull request #9082 from wanghaoshuang/average_model
7 years ago
Yu Yang 1d8fe2a220 Enhance device context pool (#9293)
7 years ago
Kexin Zhao 64c5c8f8b0
Merge pull request #9269 from kexinzhao/softmax_cudnn_fp16
7 years ago