Commit Graph

44 Commits (d12252e6a6479fe8031ef993a74a53bba8411eb8)

Author SHA1 Message Date
dzhwinter d376cf71b7 polish code for reading. test=develop
6 years ago
乔龙飞 Qiao Longfei 6e0e706198
Revert "cpu reduce mode did not need to broadcast params test=develop"
6 years ago
Qiao Longfei 97b143fb49 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-cpu-broadcast
6 years ago
Qiao Longfei fbadd4b60c follow comment test=develop
6 years ago
dzhwinter 04e9776aef add details. test=develop
6 years ago
Qiao Longfei 76072261f8 fix compiler
6 years ago
dzhwinter e537634d16 delete graph print pass. test=develop
6 years ago
dzhwinter 0a63234c85 follow comments. test=develop
6 years ago
dzhwinter 32a2014939 refine build strategy. test=develop
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
dzhwinter 2739096eec compatibable with python side mem_opt
6 years ago
WangZhen 3ce6172052 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
dzhwinter 8f3b252392 squash commits. test=develop
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
WangZhen e2ff300b02 add UT for quantization.
6 years ago
WangZhen 451896fce4 init quantization.
6 years ago
minqiyang 68a07328fa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_pyramid_dnn_support
6 years ago
minqiyang 4bfa110fd8 Add no lock optimize pass
6 years ago
chengduo eabb2105fa
Refactor MultiDevSSAGraphBuilder (#15090)
6 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Wu Yi 227e0c4518
fix nccl2 mode startup test=develop (#15132)
6 years ago
Yancey1989 ca8c77d966 selecte execution according to strategy test=develop
6 years ago
Yancey1989 1a4f79a7de fix unittest test=develop
6 years ago
Yancey1989 845bfd5807 cleanup code
6 years ago
Yancey1989 41a64f6a2a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
chengduo 550e7e410b
Code Clean parallel_executor.py (#14849)
6 years ago
dzhwinter 7cd24b1318
add ir memory optimize. (#14530)
6 years ago
Yancey1989 fd144954ed redefine api test=develop
6 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
6 years ago
Wu Yi 29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661)
7 years ago
gongweibao 867c312bc4
Fix allreduce dependency order. (#14586)
7 years ago
peizhilin 7c8c9dc9bf fix unit test cases
7 years ago
Xin Pan 759ffca423 some improvements
7 years ago
Xin Pan 99dffb91d6 allow to repeatedly share and update BuildStrategy
7 years ago
sneaxiy faac8a76ce remove unnecessary codes
7 years ago
sneaxiy cef8cc81db merge develop
7 years ago
Wu Yi 26200f2e42
[1.1] [project] train imagenet using large batch size (#13766)
7 years ago
sneaxiy 45559d042c move to pass
7 years ago
sneaxiy a314a80cdb merge develop
7 years ago
sneaxiy d87569134c test=develop
7 years ago
Xin Pan c2b3838cf5 add some comments
7 years ago
Xin Pan 0d9ee0dced fix resolve conflicts
7 years ago
Xin Pan b43e49fa31 fix
7 years ago
Xin Pan 36c2a9af27 pass builder allow cutomize pass in python.
7 years ago