Commit Graph

11 Commits (44a0a4adccec9348a5ed325062e2ca3106480c56)

Author SHA1 Message Date
Chengmo 16596f6498
Fix Paddle Cloud role maker () 6 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug () 6 years ago
Chengmo 728ec1b43d
Add GEO-SGD distribute training algorithm () 6 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces () 6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training () 6 years ago
gongweibao 86f0591175
Remove node_num function. () 6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. () 6 years ago
guru4elephant 357311fdb7
make fleet support mpi job submit directly () 6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader () 6 years ago
tangwei12 4c735f24ea
fix bug in fleet, test=develop () 6 years ago
tangwei12 101f74cb19
fix save/load in fleet () 6 years ago