Commit Graph

282 Commits (17030ff28b9a54bb57779e9b8448a6d222110ec5)

Author SHA1 Message Date
Qi Li cced930b61
[ROCM] update fluid operators for rocm (part1), test=develop (#31077)
4 years ago
石晓伟 efa54629fb
fix header file paths of gflags, commit 3, test=develop (#30273)
4 years ago
tangwei12 032414ca2a
[Feature] one ps (3/4) (#29604)
4 years ago
LoveAn b5d4a1f33d
Add the strategy of skipping cc/cu test compilation and execution in CI (#29499)
4 years ago
LoveAn 671555ed32
Compiling operator libraries with Unity build (#29130)
4 years ago
123malin b5c6342336
Update ps gpu (#29209)
4 years ago
123malin 03d4665f44
prefetch optimize (#29095)
4 years ago
123malin fbf9564f6b
【paddle.distributed.fleet】Optimize ParameterServer's Async Mode (#28442)
4 years ago
YUNSHEN XIE ba0756325a
exec ut no more than 15s 1 (#28439)
4 years ago
tangwei12 202bfab1be
Feature/large scale kv save base/delta (#27470)
4 years ago
123malin aa3b4ed717
【paddle.fleet】geo send sparse optimize (#27719)
4 years ago
Chen Weihang 4ba977c720
Polish some error message in opeators (#27876)
4 years ago
123malin a4f850748a
【paddle.fleet】bug fix for parameter_recv (#27838)
4 years ago
tangwei12 fd616fadc2
repen heartbeat ut (#27684)
4 years ago
MRXLT 20fb01fb00
fix distributed error info (#27206)
4 years ago
123malin cc780b1977
test=develop, optimize geo communicator (#26857)
4 years ago
Chengmo 0e101c4f6f
Fix test dist fleet heter ctr (#27513)
4 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
4 years ago
tangwei12 bc5f0246a8
large scale kv speedup (#26510)
4 years ago
Chengmo a72752263b
support heter-xpu-ps (#27018)
5 years ago
Chengmo 7f2aa2db3c
【paddle.fleet】Support Heter Parameter Server (#25998)
5 years ago
Leo Chen 844583c8fd
Refine paddle.manual_seed (#26496)
5 years ago
yaoxuefeng efee426742
support generator seed in related kernals test=develop (#26495)
5 years ago
Chengmo eeeef957c7
Fix ps gpu (#26218)
5 years ago
tangwei12 253fd407e8
Fix/distibuted heart beat (#25902)
5 years ago
tangwei12 2d9dbd31ad
Fix/mkl dnn (#25835)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
tangwei12 4b3778a3ee
Revert/barrier for sync (#25417)
5 years ago
Chengmo e85fcaa712
Fix fluid.embedding in Distributed Training (#25174)
5 years ago
tianshuo78520a 770c11a117
fix make device_context error (#25045)
5 years ago
tangwei12 be6a315fbd
Fix/sync barrier (#25016)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
qingqing01 6162cf2f2e
Make optimizer consistent in dygraph and static-graph and remove some LOG-INFO. (#23426)
5 years ago
tangwei12 ad9c8f6d2d
fix communicator when break under pyreder mode (#22911)
5 years ago
tangwei12 07e13b84cd
remove vlog, test=develop (#22898)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
tangwei12 66a3150135
SYNC with communicaotor (#22344)
5 years ago
Wilber de009152a7 Compile without nccl deps. [2/2] (#22484)
5 years ago
Chengmo 8f36c39537
Fix GEO-SGD init & send Bug (#22375)
5 years ago
tangwei12 82bc814a57
integrated HALF_ASYNC to communicator (#21869)
5 years ago
123malin 985bceac53
Bug fix for sparse recorder (#21969)
5 years ago
123malin 7fb817d447
add distributed_strategy (#21710)
5 years ago
zhouwei25 a01663ca1f remove patch command and file of cares to Improved quality of Paddle Repo (#21776)
5 years ago
Chengmo a86f11b5f5
Speed GEO dense calc & communication (#21579)
5 years ago
tangwei12 9ad940fdfe
memory leak for cpu (#21174)
5 years ago
Tao Luo 70eb397677
remove unused snappy/snappystream depends in distributed codes (#21484)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
5 years ago
Chengmo bc8e600ce5
Fix rpc not wait in GEO communicator (#20967)
5 years ago