Commit Graph

91 Commits (88e6dc4ac5a5f0a4ed0c54365e4210528da6f3ab)

Author SHA1 Message Date
Thunderbrook 0b8e1fadc5
add topo-aware in heter-ps (#30087)
4 years ago
Thunderbrook 0ca6de171f
add include (#29952)
4 years ago
YUNSHEN XIE 2a01756bf3
remove duplicate ut names (#29809)
4 years ago
Thunderbrook 09b6e71928
heter box (#29734)
4 years ago
lilong12 ff6a145011
update, test=develop (#29559)
4 years ago
Thunderbrook 0073f9bdb0
support ps-gpu (#28752)
4 years ago
wanghuancoder 41aad9bfcd
revert 4 files, from clear include by iwyu, test=develop (#27895)
4 years ago
Thunderbrook 6f69a4cb05
add xpu in heter mode (#27000)
4 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
4 years ago
tangwei12 99626502f7
【paddle.fleet】gloo and util (#27213)
4 years ago
Chen Weihang 79149c8ee6
polish framework error message part 8 (#27269)
4 years ago
Thunderbrook 5205748481
fix eigen in push sparse; fix hadoop command (#26872)
5 years ago
yaoxuefeng a47d92d868
fleet add save with whitelist test=develop (#23376)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
Thunderbrook a83e0f264c
fix heter proto (#26093)
5 years ago
Thunderbrook fd2947babf
fix compile error with mkl (#26030)
5 years ago
123malin 2191a08317
【paddle.fleet】fleet_util move to paddle.fleet (#25805)
5 years ago
Thunderbrook 0cb60c700d
add heter ps mode (#25682)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
hutuxian 1c224e26af
support CMatchAuc (#24990)
5 years ago
hutuxian b8f17a049d
fix problem in dump and add log (#24891)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
hutuxian e6b87b3193
Support AucRunner in PaddleBox (#22884)
5 years ago
hutuxian 123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel (#24377)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
xujiaqi01 1034ca316f
add timeout and http store in communication (#23436)
5 years ago
hutuxian df64a96686
support set_test_mode and set comlog level(#23905)
5 years ago
xujiaqi01 d98084e7ec
add save with prefix (#23449)
5 years ago
hutuxian 94a3789fd0
Add AfsAPI in PaddleBox (#23419)
5 years ago
xujiaqi01 3a45767d49
add fleet pslib pull and push sparse op and push dense op (#23139)
5 years ago
xujiaqi01 68ea1ad55b
add clear one table (#23089)
5 years ago
danleifeng ae3bb16d06
add MaskAucCalculator in paddlebox (#23157)
5 years ago
hutuxian 0c30098f8b
Add need_save_delta parameter to solve OOM (#23097)
5 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
5 years ago
hutuxian 1a7962be97
Paddlebox about box_wrapper (#22497)
5 years ago
yaoxuefeng 2235ee1a5e
multi-loss optimization by adding a DownpourOpt worker (#22025)
5 years ago
Wilber a90fa54092
Compile without nccl deps. [1/2] (#22509)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
xujiaqi01 371f377bea
add GeneralRoleMaker (#22295)
5 years ago
xujiaqi01 e3a457d34b
add collective communication library in fleet (#22211)
5 years ago
Thunderbrook c3cf42d0f7
add table id in cache shuffle (#21585)
5 years ago
xujiaqi01 c05706fe73
fix code style of fleet_wrapper (#21639)
5 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
5 years ago
Thunderbrook 0d17c1b816
solve pslib core in stop worker (#21263)
5 years ago
Thunderbrook 349e82d669
support general embedding params (#21217)
5 years ago
xujiaqi01 23876de55b
fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052)
5 years ago
xujiaqi01 9e045170c0
add copy table (#21086)
5 years ago
xujiaqi01 48669aa8f0
fix several sparse table issuses (#20686)
5 years ago
xujiaqi01 cedc04775c
support change shuffle and train thread num (#19841)
5 years ago