Commit Graph

16 Commits (6e7bfe30a6e9584a8ce21a6f6ef66a2bfb3b1097)

Author SHA1 Message Date
gongweibao ad2bc0c364 Fix a distribution bug and cleanup some not need logs. (#22381)
5 years ago
danleifeng f5262865c0 change select_gpus into absolute values in launch.py (#22031)
5 years ago
danleifeng 3fe63d6780 add store_true to use_paddlecloud argument in launch.py (#21168)
5 years ago
WangXi 9d8ec42353 launch.py remove setting for nccl sync, test=develop (#20909)
5 years ago
WangXi e78d7f57bb Print the rank which trainer is error in launch.py, test=develop (#20838)
5 years ago
WangXi 8c2c8dc626 distribute.launch use poll to query subprocess (#19853)
5 years ago
danleifeng 0865b5a9a0 distribute launch : add use_paddlecloud argument (#19273)
6 years ago
gongweibao 86f0591175
Remove node_num function. (#19167)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
gongweibao da9143c1cc
Polish codes of old prs. (#17938)
6 years ago
gongweibao f3e5a5cf67
Unset https_proxy and http_proxy in our launch.py (#17915)
6 years ago
gongweibao 6a1df46991
Fine tuning launch.py (#17223)
6 years ago
chengduo ca03f4989a
fix distributed launch.py (#17571)
6 years ago
Yan Xu 266444b8af
fix dist launch script test=develop (#17404)
6 years ago
Yan Xu b4c3a6aa0b
[Imperative] implement imperative NCCLParallelContext (#16477)
6 years ago
Yan Xu d424e5b4c9
add launch mp distributed job py module test=develop (#15620)
6 years ago