Commit Graph

11 Commits (86f059117579585db5e0ab59c1543177860260a3)

Author SHA1 Message Date
gongweibao 86f0591175
Remove node_num function. (#19167)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
jiaqi 02c370c3dc
support filelist size < trainer num && fix pull dense (#18956)
6 years ago
tangwei12 d845848341
do some odd jobs (#18641)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
Qiao Longfei 23f8a4b1c3 assign role_maker before use (#18137)
6 years ago
guru4elephant 58f3e1bad7
add paddle cloud role maker for customized usage, note this is only for industrial users that have cloud environment pre-configuration (#18121)
6 years ago
tangwei12 101f74cb19
fix save/load in fleet (#17675)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
tangwei12 565d309501
Reformat fleet API (#17135)
6 years ago
tangwei12 1a4a51db2b
Fleet unify distributed training (#16791)
6 years ago