Commit Graph

15 Commits (f8da5536edaa004fd42988539508f6810a2fe958)

Author SHA1 Message Date
WangXi 572c466d19
[Prepare for MultiProcess xpu] unified gen nccl id, refine imperative reducer (#30455)
4 years ago
ShenLiang 01e2874a0e
Support multi-stream communication for dynamic graph distributed (#29525)
4 years ago
ShenLiang e2d01eb650
Support dynamic graph distributed (#28997)
4 years ago
danleifeng a24d186814
fix nccl init failed in parallel dygraph mode (#28497)
4 years ago
Chen Weihang c42e656179
Add retry for dygraph parallel socket bind (#28404)
4 years ago
danleifeng f29fb396df
dygraph nccl init support host domain name (#28107)
4 years ago
Leo Chen a5b3263782
Refine error msg in paddle/fluid/imperative (#27521)
4 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Yi Liu 2169e6fb58
Initialize global nccl_comm in PE (#23275)
5 years ago
Yi Liu 121b2aed4d
initialize global nccl context in dygraph (#23037)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
chengduo cca26f5c42
polish multi process warning info (#19961)
5 years ago
chengduo 5436d66667
close socket connect (#17862)
6 years ago
Yan Xu 55e3c6949b
disable reuse port test=develop (#16704)
6 years ago
Yan Xu b4c3a6aa0b
[Imperative] implement imperative NCCLParallelContext (#16477)
6 years ago