Commit Graph

240 Commits (b4a3b75002d4a61f5855884ecdd8bb3c29eb84a8)

Author SHA1 Message Date
123malin b4a3b75002
bug fix: invalid learning rate decay in pserver async mode (#20325)
6 years ago
tangwei12 a010d883b4
doc fix, test=develop, test=document_fix (#20239)
6 years ago
Chengmo 494d6cf252
Fix transpiler en doc (#20149)
6 years ago
tangwei12 b5a410466c
Trainer heartbeat for async mode (#19600)
6 years ago
Chengmo 728ec1b43d
Add GEO-SGD distribute training algorithm (#20018)
6 years ago
123malin 6c74e7387f
fix APIs, test=document_preview (#19954)
6 years ago
tangwei12 6a1db2044c
fix sync_with_distributed_lookup_table, test=develop (#19737)
6 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces (#18971)
6 years ago
Yi Liu 4ef6b8457a
adapte fleet api for localsgd and support nccl comm configuration in executor (#19443)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
tangwei12 19dac67e9f
fix distribute transpiler GRPC error code 4, RPC Deadline (#18984)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
tangwei12 659b72a97c
fix document of python api get_startup_program() (#17764)
6 years ago
yaoxuefeng ac92e4c066
fix distributed_transpiler.py api test=develop (#17668)
6 years ago
gongweibao 0d561ef442
fix 2dconn test=develop (#17681)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
Qiao Longfei 92e7d5d7cc
fix distribute doc test=develop (#17318)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
tangwei12 7330cd639c
truncated_gaussian_random supported in distributed training, test=develop (#17091)
6 years ago
tangwei12 1a4a51db2b
Fleet unify distributed training (#16791)
6 years ago
Qiao Longfei 0608f8ca56 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Qiao Longfei d640c6cfa9 fix pylint
6 years ago
Qiao Longfei 542b52fac3 fix trainer_id
6 years ago
Qiao Longfei de65398cb8 update transpiler and listen and serv op
6 years ago
Qiao Longfei b8491bfd4e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
tangwei12 971f3bc9b0
fix params with only 1 dim (#15828)
6 years ago
Qiao Longfei 8bda4ab213 parameter recv can run
6 years ago
Qiao Longfei fbd186bd5d complete recv op
6 years ago
Qiao Longfei 4356f186b4 complete parameter_send
6 years ago
tangwei12 90df7ff378
transpiler.py code clean (#15555)
6 years ago
Wu Yi 22db82c053
fix tangwei merge issue test=develop (#15506)
6 years ago
tangwei12 8b50ad80ff
checkpoint at distributed training (#14854)
6 years ago
Qiao Longfei 810439a993 fix style test=develop
7 years ago
Qiao Longfei b16e832d4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
7 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
7 years ago
Qiao Longfei 49cce3fd0e fix dist sparse l2 decay
7 years ago
haowang101779990 66ea718452 en api improve format Dec 27
7 years ago
JiabinYang b2f789c66d add test transpiler dist test, test=develop
7 years ago
JiabinYang c35fdf1581 Merge branch 'add_prefetch_in_nce' of https://github.com/seiriosPlus/Paddle into feature/add_prefech_hs
7 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
7 years ago
tangwei12 b653ed0516 add prefetch and remvoe selectedrows of bias
7 years ago
Wu Yi aebc175cd4
add nccl2 dist tests (#14755)
7 years ago
tangwei12 4cb0100c8e add prefetch in nce
7 years ago
Tink_Y 6d04a9cf47 fix api format and example (#14686)
7 years ago
Qiao Longfei c8b0241da2 fix code style test=develop
7 years ago
Qiao Longfei 9589babe12 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
7 years ago
Qiao Longfei 87e4edd2ea fix grad_varname in remote prefetch
7 years ago
Qiao Longfei d98c59fd2c support none sliced variable
7 years ago