Commit Graph

367 Commits (0e101c4f6fd4f2d038d725ecae5729d904ef694e)

Author SHA1 Message Date
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
Chengmo e85fcaa712
Fix fluid.embedding in Distributed Training (#25174)
5 years ago
ShenLiang a6ab43aab4
add type for lsgd, test=develop (#24702)
5 years ago
swtkiwi f5c6dd6def
test=develop (#24522)
5 years ago
Chen Weihang 25a233e46d
Simplify Program printing code to improve debugging efficiency (#23918)
5 years ago
tangwei12 853f2e5272
fix bug at sync with communicator (#23077)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
tangwei12 66a3150135
SYNC with communicaotor (#22344)
5 years ago
tangwei12 948299aec8
deprecated for distribute transpiler api (#22513)
5 years ago
tangwei12 82bc814a57
integrated HALF_ASYNC to communicator (#21869)
5 years ago
123malin 985bceac53
Bug fix for sparse recorder (#21969)
5 years ago
Chengmo 418abc92f4
Update pyramid related OP (#21372)
5 years ago
Chengmo 5c33919328
Fix grad clip (#21784)
5 years ago
123malin 7fb817d447
add distributed_strategy (#21710)
5 years ago
tangwei12 9ad940fdfe
memory leak for cpu (#21174)
5 years ago
hutuxian c5aec2fe68
Paddlebox Related to Framework (#21586)
5 years ago
Kaipeng Deng ebfb720a63
add Adam beta1/beta2 support Variable (#21234)
5 years ago
123malin 20cdff0e02
Optimize decay (#20816)
5 years ago
tangwei12 1d925440ca
fix fetch handler error with pslib (#20679)
5 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug (#20366)
5 years ago
123malin b4a3b75002
bug fix: invalid learning rate decay in pserver async mode (#20325)
5 years ago
tangwei12 a010d883b4
doc fix, test=develop, test=document_fix (#20239)
5 years ago
Chengmo 494d6cf252
Fix transpiler en doc (#20149)
5 years ago
Chengmo eb05db7104
Speed GEO-SGD (#20158)
5 years ago
tangwei12 b5a410466c
Trainer heartbeat for async mode (#19600)
5 years ago
Chengmo 728ec1b43d
Add GEO-SGD distribute training algorithm (#20018)
5 years ago
Zeng Jinle 5f2290ab84
Add deprecated memory optimize doc (#20111)
5 years ago
123malin 6c74e7387f
fix APIs, test=document_preview (#19954)
5 years ago
tangwei12 6a1db2044c
fix sync_with_distributed_lookup_table, test=develop (#19737)
5 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces (#18971)
6 years ago
Yi Liu 4ef6b8457a
adapte fleet api for localsgd and support nccl comm configuration in executor (#19443)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
tangwei12 19dac67e9f
fix distribute transpiler GRPC error code 4, RPC Deadline (#18984)
6 years ago
Tao Luo 2f8c7e021f
remove unused inference_transpiler unit-tests (#19130)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
Zeng Jinle c194b0c835
Try to deprecate unstable python memory optimize (#18983)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
Yi Liu 157211c4e1
supports distributed classification (#18690)
6 years ago
tangwei12 d845848341
do some odd jobs (#18641)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
chengduo e06c69c788
Fix default value of fluid.memory_optimize (#18295)
6 years ago
tangwei12 659b72a97c
fix document of python api get_startup_program() (#17764)
6 years ago
yaoxuefeng ac92e4c066
fix distributed_transpiler.py api test=develop (#17668)
6 years ago
gongweibao 0d561ef442
fix 2dconn test=develop (#17681)
6 years ago
tangwei12 0d3c48e0a8
fix doc in transpiler, test=develop (#17313)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago