Commit Graph

105 Commits (8c19d7aa2f89a38b3a68e53c73d88af16a3de8ce)

Author SHA1 Message Date
Thunderbrook 3789a69923
solve bug in heter mode (#31531)
4 years ago
Qi Li 65bcaeb004
[ROCM] update fluid operators for rocm (part5), test=develop (#31258)
4 years ago
danleifeng d1075df2e8
topo and memory performance for heterps (#30440)
4 years ago
Thunderbrook c4f279fe8d
support multi node in heterps (#31102)
4 years ago
Qi Li a60d93fb77
[ROCM] update fluid framework for rocm (part2), test=develop (#31010)
4 years ago
Thunderbrook 565354f676
support save multi sparse table in one path (#31108)
4 years ago
Qi Li 8fe09faf14
[ROCM] update fluid framework for rocm (part1), test=develop (#31009)
4 years ago
wanghuancoder 35c5b23f68
use iwyu clean include second time, test=develop (#30829)
4 years ago
Thunderbrook cb66c53c2d
dump to cpu (#30750)
4 years ago
Thunderbrook 1bebc09253
solve build gpu task core (#30626)
4 years ago
Leo Chen 81217a94d8
unify calling cudaSetDevice (#30470)
4 years ago
hutuxian 40ede12631
Ascend Framework Part1: OP & Wrapper (#30281)
4 years ago
yaoxuefeng 6e0da01c61
Heter ps new (#30198)
4 years ago
tangwei12 5e839e4da5
add sparse embedding & load vars for 2.0 & gloo bug fix (#30306)
4 years ago
Thunderbrook 0b8e1fadc5
add topo-aware in heter-ps (#30087)
4 years ago
Thunderbrook 0ca6de171f
add include (#29952)
4 years ago
YUNSHEN XIE 2a01756bf3
remove duplicate ut names (#29809)
4 years ago
Thunderbrook 09b6e71928
heter box (#29734)
4 years ago
lilong12 ff6a145011
update, test=develop (#29559)
4 years ago
Thunderbrook 0073f9bdb0
support ps-gpu (#28752)
4 years ago
wanghuancoder 41aad9bfcd
revert 4 files, from clear include by iwyu, test=develop (#27895)
4 years ago
Thunderbrook 6f69a4cb05
add xpu in heter mode (#27000)
4 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
4 years ago
tangwei12 99626502f7
【paddle.fleet】gloo and util (#27213)
4 years ago
Chen Weihang 79149c8ee6
polish framework error message part 8 (#27269)
4 years ago
Thunderbrook 5205748481
fix eigen in push sparse; fix hadoop command (#26872)
5 years ago
yaoxuefeng a47d92d868
fleet add save with whitelist test=develop (#23376)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
Thunderbrook a83e0f264c
fix heter proto (#26093)
5 years ago
Thunderbrook fd2947babf
fix compile error with mkl (#26030)
5 years ago
123malin 2191a08317
【paddle.fleet】fleet_util move to paddle.fleet (#25805)
5 years ago
Thunderbrook 0cb60c700d
add heter ps mode (#25682)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
hutuxian 1c224e26af
support CMatchAuc (#24990)
5 years ago
hutuxian b8f17a049d
fix problem in dump and add log (#24891)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
hutuxian e6b87b3193
Support AucRunner in PaddleBox (#22884)
5 years ago
hutuxian 123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel (#24377)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
xujiaqi01 1034ca316f
add timeout and http store in communication (#23436)
5 years ago
hutuxian df64a96686
support set_test_mode and set comlog level(#23905)
5 years ago
xujiaqi01 d98084e7ec
add save with prefix (#23449)
5 years ago
hutuxian 94a3789fd0
Add AfsAPI in PaddleBox (#23419)
5 years ago
xujiaqi01 3a45767d49
add fleet pslib pull and push sparse op and push dense op (#23139)
5 years ago
xujiaqi01 68ea1ad55b
add clear one table (#23089)
5 years ago
danleifeng ae3bb16d06
add MaskAucCalculator in paddlebox (#23157)
5 years ago
hutuxian 0c30098f8b
Add need_save_delta parameter to solve OOM (#23097)
5 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
5 years ago
hutuxian 1a7962be97
Paddlebox about box_wrapper (#22497)
5 years ago