Commit Graph

14 Commits (53c16eabe42c0e499070bc0511c1d5501617d420)

Author SHA1 Message Date
danleifeng 3fe63d6780 add store_true to use_paddlecloud argument in launch.py (#21168)
6 years ago
WangXi 9d8ec42353 launch.py remove setting for nccl sync, test=develop (#20909)
6 years ago
WangXi e78d7f57bb Print the rank which trainer is error in launch.py, test=develop (#20838)
6 years ago
WangXi 8c2c8dc626 distribute.launch use poll to query subprocess (#19853)
6 years ago
danleifeng 0865b5a9a0 distribute launch : add use_paddlecloud argument (#19273)
6 years ago
gongweibao 86f0591175
Remove node_num function. (#19167)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
gongweibao da9143c1cc
Polish codes of old prs. (#17938)
6 years ago
gongweibao f3e5a5cf67
Unset https_proxy and http_proxy in our launch.py (#17915)
6 years ago
gongweibao 6a1df46991
Fine tuning launch.py (#17223)
6 years ago
chengduo ca03f4989a
fix distributed launch.py (#17571)
6 years ago
Yan Xu 266444b8af
fix dist launch script test=develop (#17404)
6 years ago
Yan Xu b4c3a6aa0b
[Imperative] implement imperative NCCLParallelContext (#16477)
6 years ago
Yan Xu d424e5b4c9
add launch mp distributed job py module test=develop (#15620)
7 years ago