Commit Graph

14 Commits (83a2fb1f08714d12728292924ea0e07f72451987)

Author SHA1 Message Date
WangXi 83a2fb1f08
Add collective async wait op (#31463)
4 years ago
Qi Li 93c1d9e761
[ROCM] update fluid platform for rocm39 (part3), test=develop (#30913)
4 years ago
wanghuancoder 35c5b23f68
use iwyu clean include second time, test=develop (#30829)
4 years ago
WangXi b1026f64af
【kunlun】dygraph supports multi xpu card training (#30671)
4 years ago
ShenLiang e2d01eb650
Support dynamic graph distributed (#28997)
5 years ago
GaoWei8 c10dcff12d
refine PADDLE_ENFORCE (#25456)
5 years ago
GaoWei8 fb70682f00
fix PADDLE_ENFORCE (#25297)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Yi Liu 14e7041c6d
Fix CUDAHandleHolder destruction problem. (#23772)
5 years ago
Yi Liu 2169e6fb58
Initialize global nccl_comm in PE (#23275)
5 years ago
Wilber de009152a7 Compile without nccl deps. [2/2] (#22484)
5 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext (#19407)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago