Commit Graph

51 Commits (7f17e561d7a1dfa721532eddc4fa6d17f7c0761d)

Author SHA1 Message Date
sneaxiy d3ed070e10 test=develop
6 years ago
sneaxiy fb6201e93e test=develop
6 years ago
Xin Pan 36c2a9af27 pass builder allow cutomize pass in python.
6 years ago
Xin Pan a83a4fab5c
Merge pull request #13441 from panyx0718/ir2
6 years ago
Xin Pan ec6ee0a293 simplify and hide bcast_params
7 years ago
sneaxiy 612e1a3155 modification
7 years ago
sneaxiy d0b2453ecd merge develop
7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor
7 years ago
Xin Pan 626abfc33a code clean up and renaming
7 years ago
Xin Pan e4d7d7ae8f pass refactoring
7 years ago
Yancey1989 d14afcedeb polish function name
7 years ago
Yancey1989 1effba3312 fix pe with cpu place
7 years ago
Yancey1989 23433def4b Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Yancey1989 93401c98e1 overlap rpc op memcpy in distributed training
7 years ago
yuyang18 7c777dd549 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
7 years ago
yuyang18 08295f9877 Add build strategy
7 years ago
typhoonzero 7b0c0273f4 update by comments
7 years ago
yuyang18 e5281b3c2d Clean code & add execution strategy
7 years ago
typhoonzero 928418a9ac Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
chengduoZH 97cb5479ae change PE strategy
7 years ago
typhoonzero d9320dcd94 complete code
7 years ago
yangyaming 82571deb89 Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
Yu Yang 7a395881d4 Add customize_loss_grad option to PE
7 years ago
Yu Yang 5305c5f845 Correctly implement destructor of ParallelExecutor
7 years ago
Yu Yang b4aaa00a8a Polish logic of ParallelExecutor
7 years ago
typhoonzero 0bf799a523 wip testing
7 years ago
qingqing01 2b7e5bd366
Support testing during training by ParallelExecutor. (#9738)
7 years ago
Xin Pan 4bbfa9eccb Add feed to ParallelExecutor
7 years ago
Xin Pan b123ce88a1 Add enable/disable for delayed ops
7 years ago
Yu Yang 02aaecca35 Fix CPU compile
7 years ago
Yu Yang edfd741e3a Add simple python wrapper for ParallelExecutor
7 years ago
Yu Yang a7b0d5bd26 Clean code
7 years ago
Yu Yang 79989c9025 Add SSA builder
7 years ago
Yu Yang 64d7a30271 Extract SSAGraph
7 years ago
Yu Yang 6ebc6bf533 ReorganizeCode
7 years ago
Yu Yang ba227df941 Expose num_threads
7 years ago
Yu Yang 7643c2cbab Add flag for use event
7 years ago
Yu Yang c18c2f6ab0 Sync all computation streams at the end of run
7 years ago
Yu Yang 1f53193a63 Use atomic code
7 years ago
Yu Yang c7beac1426 Add dummy var
7 years ago
Yu Yang 5fa535b717 Wait all thread done
7 years ago
Yu Yang a87ce91c4b Use mtx
7 years ago
Yu Yang ea11a0a853 Use volitie
7 years ago
Yu Yang 0023c3bcf5 Use atomic bool
7 years ago
Yu Yang 9cb8f50302 Complete fetch op
7 years ago
Yu Yang 8c9cd369dc Polish code style
7 years ago
Yu Yang 6f0dfd89a4 Single GPU ParallelExecutor complete
7 years ago
Yu Yang 35744e7b36 Polish code
7 years ago
Yu Yang baef1124fb ParallelExecutor And dependency engine
7 years ago
Yang Yang 8f061e43b7 delete param name
7 years ago