Commit Graph

70 Commits (c8b90d8f9a78328516864b05bf84fc88dfb3d383)

Author SHA1 Message Date
Huihuang Zheng dd4361568e
Add ParallelExecutor Test for Cond API and Fix PE Checks Shape Bug (#22029)
5 years ago
Leo Chen 8f53735437 Fix memory overwriting of tensors returned by executor (#19030)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
gongweibao f5caf3443c
Fix reinitialized ncclid error! (#18025)
6 years ago
gongweibao dd4cd352c7
Fix sync_batch_norm_op ncclallreduce error! (#17918)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
chengduo bc833945a4
Add DropLocalExeScopes in ParallelExecutor (#17297)
6 years ago
Qiao Longfei 30618409db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
6 years ago
Qiao Longfei 4e218dabc5 code format test=develop
6 years ago
Qiao Longfei 8744f9a083 fix parallel executor async mode
6 years ago
Qiao Longfei f768fbf715 support multi graph
6 years ago
Xin Pan 32d5a16036 resolve conflicts
6 years ago
Xin Pan 26e32e095a allow compiler to use graph
6 years ago
Xin Pan 5e928e579a try unify Executor and ParallelExecutor
6 years ago
Yancey1989 35cda13e9f fix unittest test=develop
6 years ago
Yancey1989 ca8c77d966 selecte execution according to strategy test=develop
6 years ago
chengduo 550e7e410b
Code Clean parallel_executor.py (#14849)
6 years ago
sneaxiy 096673f675 refactor eager deletion
6 years ago
sneaxiy d3ed070e10 test=develop
6 years ago
sneaxiy fb6201e93e test=develop
6 years ago
Xin Pan 36c2a9af27 pass builder allow cutomize pass in python.
6 years ago
Xin Pan a83a4fab5c
Merge pull request #13441 from panyx0718/ir2
6 years ago
Xin Pan ec6ee0a293 simplify and hide bcast_params
6 years ago
sneaxiy 612e1a3155 modification
7 years ago
sneaxiy d0b2453ecd merge develop
7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor
7 years ago
Xin Pan 626abfc33a code clean up and renaming
7 years ago
Xin Pan e4d7d7ae8f pass refactoring
7 years ago
Yancey1989 d14afcedeb polish function name
7 years ago
Yancey1989 1effba3312 fix pe with cpu place
7 years ago
Yancey1989 23433def4b Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Yancey1989 93401c98e1 overlap rpc op memcpy in distributed training
7 years ago
yuyang18 7c777dd549 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
7 years ago
yuyang18 08295f9877 Add build strategy
7 years ago
typhoonzero 7b0c0273f4 update by comments
7 years ago
yuyang18 e5281b3c2d Clean code & add execution strategy
7 years ago
typhoonzero 928418a9ac Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
chengduoZH 97cb5479ae change PE strategy
7 years ago
typhoonzero d9320dcd94 complete code
7 years ago
yangyaming 82571deb89 Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
Yu Yang 7a395881d4 Add customize_loss_grad option to PE
7 years ago
Yu Yang 5305c5f845 Correctly implement destructor of ParallelExecutor
7 years ago
Yu Yang b4aaa00a8a Polish logic of ParallelExecutor
7 years ago
typhoonzero 0bf799a523 wip testing
7 years ago
qingqing01 2b7e5bd366
Support testing during training by ParallelExecutor. (#9738)
7 years ago
Xin Pan 4bbfa9eccb Add feed to ParallelExecutor
7 years ago
Xin Pan b123ce88a1 Add enable/disable for delayed ops
7 years ago
Yu Yang 02aaecca35 Fix CPU compile
7 years ago
Yu Yang edfd741e3a Add simple python wrapper for ParallelExecutor
7 years ago