Commit Graph

218 Commits (b34933d9ee3b61dbbd642fd02f244c36d0d14550)

Author SHA1 Message Date
Tao Luo bcddbc78d4
remove -Wmaybe-uninitialized warning (#19653)
6 years ago
123malin 2f037c3189
fix the diff between async mode and async_half mode (#19535)
6 years ago
tangwei12 f45cb1c2ca
fix bug of communicator flag, test=develop (#19635)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader (#18350)
6 years ago
Qiao Longfei 0e08e91c18
optimize communicator merge sparse gradient test=develop (#18159)
6 years ago
tangwei12 101f74cb19
fix save/load in fleet (#17675)
6 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
Tao Luo 3d19f44a89
remove unused SERIAL compiler option (#17500)
6 years ago
Qiao Longfei 287de41c04
Optimize communicator flags (#17494)
6 years ago
Qiao Longfei d831f1b0ba fix brpc code
6 years ago
Qiao Longfei 8b8a0487c7 fix compile test=develop
6 years ago
Qiao Longfei a541c25ab6 fix cpplint test=develop
6 years ago
Qiao Longfei 0608f8ca56 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
gongweibao bf606bce8a
Fix grpc log message. (#16735)
6 years ago
Qiao Longfei 766666a957 add log for FLAGS_communicator_send_wait_times
6 years ago
Qiao Longfei 4031c1a7b1 fix ci build test=develop
6 years ago
Qiao Longfei 9861a92f6f change the return type of NewTempScope to unique ptr test=develop
6 years ago
Qiao Longfei fb6cc3a1bd follow commnet, optimize code and add comment test=develop
6 years ago
Qiao Longfei d8974e6da0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Qiao Longfei 392e97aae5 fix cpplint test=develop
6 years ago
Qiao Longfei b65adf7f65 add communicator_send_wait_times
6 years ago
Qiao Longfei 63acbe7a65 fix bug
6 years ago
Qiao Longfei 0ff1e64fab fix a bug
6 years ago
Qiao Longfei 0997cf8f65 add more check
6 years ago
sneaxiy f8ed2c229e try to fix ci error
6 years ago
Qiao Longfei 93464b25ac update async_sparse_param_update_recorder
6 years ago
Qiao Longfei 542b52fac3 fix trainer_id
6 years ago
Qiao Longfei be0c482304 update trainer_id
6 years ago
Qiao Longfei c60f312d1b add trick
6 years ago
Qiao Longfei 103c9bb376 update rpc_client
6 years ago
Qiao Longfei b7661d7e56 add some log
6 years ago
Qiao Longfei e8fe5186a1 complete parameter_recv
6 years ago
Qiao Longfei d5c7898201 complete pserver side update
6 years ago
Qiao Longfei de65398cb8 update transpiler and listen and serv op
6 years ago
Qiao Longfei 25e2b41729 add AsyncSparseParamUpdateRecorder test
6 years ago
Qiao Longfei c6e82785aa init async_sparse_param_update_recorder
6 years ago
Qiao Longfei 039d783db5 change communicator_recv_wait_ms to communicator_max_send_grad_num_before_recv
6 years ago
Qiao Longfei ea0df4e8a2 add some check
6 years ago
Qiao Longfei 065b68b6ca clean code
6 years ago
Qiao Longfei 347178bd97 fix pserver memory leak
6 years ago
Qiao Longfei c567debcd9 optimize log
6 years ago
Qiao Longfei 0fcdae8418 add communicator_test
6 years ago
Qiao Longfei 9b74707cbf fix compile problem
6 years ago
Qiao Longfei 23d3929a4b optimize merge vars
6 years ago
Qiao Longfei d3a14377d5 add fake rpc to send
6 years ago