Commit Graph

24 Commits (0073f9bdb0b43a8d298346e28a2b403fe351bac3)

Author SHA1 Message Date
MRXLT 55098b975e
fleet support paddle.optimzier (#28026)
5 years ago
gongweibao a7c5210051
Fix test_hdfs bug. (#26068)
5 years ago
gongweibao 0067a2e4ec
Save checkpoint automatically (#25917)
5 years ago
gongweibao 80f1c50738
Fix typo in interface. (#24779)
5 years ago
mapingshuo f0e743f136
fix AMP and recompute (#23551)
5 years ago
gongweibao 24a063f6ac
Add fleet checkpoint on local fs and remote fs(such as hdfs) for EDL (#22586)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
WangXi 3ec289a6a3 fix sync_batch_norm hang in fleet (#21838)
6 years ago
lilong12 da75ac8b6c bugfix: construct a DistributedStrategy instance if the passed one is None (#21545)
6 years ago
lilong12 53148e0696
modify the implementation of save_persistables and save_inference_model for fleet collective mode (#20802)
6 years ago
WangXi cadc6a9704 fix dgc test and bug when not set trainers_endpoints_, test=develop (#20617)
6 years ago
mapingshuo f55d1c6867
Fleet: deal with special case: strategy is None (#20359)
6 years ago
mapingshuo 9901f69677
Forward recompute3 (#19913)
6 years ago
gongweibao e8d3745c0f
change _origin_program test=develop (#19863)
6 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
6 years ago
Yi Liu 4ef6b8457a
adapte fleet api for localsgd and support nccl comm configuration in executor (#19443)
6 years ago
gongweibao 86f0591175
Remove node_num function. (#19167)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
guru4elephant 9c17a899d7
upgrade collective fleet api (#18533)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
tangwei12 101f74cb19
fix save/load in fleet (#17675)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
tangwei12 565d309501
Reformat fleet API (#17135)
6 years ago
tangwei12 1a4a51db2b
Fleet unify distributed training (#16791)
6 years ago