Commit Graph

323 Commits (36c0410223dd6608f44707738decd1288f19ec4d)

Author SHA1 Message Date
lilong12 36c0410223
Revert "Initialize gloo for low level collective apis (#27356)", test=document_fix (#27665)
5 years ago
Li Fuchen 516d84b22a
fix tests warpctc (#27639)
5 years ago
lilong12 fa73e4a284
Initialize gloo for low level collective apis (#27356)
5 years ago
YUNSHEN XIE d1c2a3bc6f
disable ut test_warpctc_op,test=document_fix (#27632)
5 years ago
gongweibao 86fa043205
init test=develop (#27554)
5 years ago
Chen Weihang c143326df5
try to fix test_paddle_save_load unknown timeout (#27536)
5 years ago
WangXi e550fc02ae
fleet2.0 add fp16 grad compression (#27480)
5 years ago
YUNSHEN XIE b6ecf35627
disable test_paddle_save_load,test=document_fix (#27534)
5 years ago
YUNSHEN XIE f0a5eef58f
cancel three disable ut (#27359)
5 years ago
YUNSHEN XIE 25902b2c12
del exclusive ut which name with test_dist_ (#27316)
5 years ago
wanghuancoder bf8e030eed
modify test_imperative_using_non_zero_gpu from use two gpus to one gpu (#27348)
5 years ago
Leo Chen 189e10f1dd
Remove unnecessary requirements (#27341)
5 years ago
YUNSHEN XIE 8fe1c2d1c2
move three ut to execute only at night (#27314)
5 years ago
chalsliu c8e54c5e42
Disable unit-test test_fleet_rolemaker_new
5 years ago
YUNSHEN XIE 6947a58a1f
disable three unittests,test=document_fix (#27299)
5 years ago
Zhen Wang b671538623
* Reduce the training iterations in test_fetch_unmerged and test_fuse_bn_act_pass. (#27234)
5 years ago
123malin 60c3ef3ab8
【paddle.fleet】parameter_server_optimizer support auto_strategy (#27181)
5 years ago
liym27 d3874ab44a
Move unittest test_optimizer_in_control_flow from CI multi_cards. (#27185)
5 years ago
Dong Daxiang f7d08b7db8
【paddle.fleet】refine launch and distributed repr string for print (#27093)
5 years ago
tangwei12 ca6100de5f
disable ut, fix it @malin (#27200)
5 years ago
Dong Daxiang 0443b480b8
【paddle.fleet】add auto parallel L1 implementations (#27090)
5 years ago
YUNSHEN XIE d8984a6b90
limit timeout value setting on linux (#26923)
5 years ago
Chengmo d0962abd20
supplement bug fix of parameter server (#26217)
5 years ago
Feiyu Chan f4083010a7
Add unified RNN APIs (#26588)
5 years ago
gongweibao fc5acdd041
Fix fleet ut timeout issue. (#26694)
5 years ago
Kaipeng Deng 84e7be3178
add TensorDataset for multiprocess DataLoader (#26332)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
gongweibao ed4a4ea37e
fix test=develop (#26632)
5 years ago
YUNSHEN XIE 39fe0d35aa
find timeout unittests (#26371)
5 years ago
gongweibao 7c42f056e2
Finetune the performance of the unittests. (#26402)
5 years ago
Dong Daxiang 83cd185947
【paddle.fleet】Meta from optimizer (#26392)
5 years ago
lilong12 e92f770c42
Add collective ops (reduce) (#26340)
5 years ago
Chengmo 623e14fbc5
disable test_dist_fleet_ctr (#26507)
5 years ago
tangwei12 d12ac984bf
disable UT temp (#26404)
5 years ago
123malin 57d434df5d
add save/load for parameter server (#26235)
5 years ago
ceci3 56890dc729
Add SyncBatchNorm (#26032)
5 years ago
Zhou Wei 5017aa76e6
set default python3,fix incompatible,cache dir for third party,unify error code,for windows (#26178)
5 years ago
YUNSHEN XIE 30e1083e82
disable test_sampling_id_op (#26313)
5 years ago
Dong Daxiang 50a5bcfc9d
【paddle.fleet】paddle.fleet -> paddle.distributed.fleet. (#26186)
5 years ago
Leo Chen 2d95280e1f
Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903)
5 years ago
Kaipeng Deng dbc88bb900
Add iterable dataset support for multiprocess DataLoader (#25558)
5 years ago
JZ-LIANG 54003b873e
【paddle.fleet】add lamb to fleet meta optimizer (#26025)
5 years ago
lilong12 8caee2ad51
【paddle.fleet】add the support for multi-node training for pipeline (#25907)
5 years ago
gongweibao 0067a2e4ec
Save checkpoint automatically (#25917)
5 years ago
123malin 2191a08317
【paddle.fleet】fleet_util move to paddle.fleet (#25805)
5 years ago
WangXi 2c9d0f3cb9
【paddle.fleet】Add dgc to fleet meta optimizer (#25738)
5 years ago
danleifeng 3dd2e3801a
【paddle.fleet】add fleetrun command for distributed running (#25806)
5 years ago
WangXi a6c87fd091
Add amp to fleet meta optimizer, test=develop (#25770)
5 years ago
JZ-LIANG 8ebffc78c9
add lars to fleet meta optimizer (#25884)
5 years ago
Dong Daxiang 8d2896f1fe
【paddle.fleet】Fleet run graph in Executor and add two more strategies (#25844)
5 years ago