Commit Graph

17748 Commits (c0550b54a50580d2e5985816d2ef085230f04e54)

Author SHA1 Message Date
tangwei12 c0550b54a5
Feature/large scale kv save base/delta (#27470) (#27990)
5 years ago
mapingshuo 50d24899cf
fix kunlun kernel of reshape op (#27989)
5 years ago
Zhou Wei b57254ed61
[cherry-pick2.0]Add tensor clone 2.0 (#27982)
5 years ago
123malin c0061ff56f
【paddle.fleet】geo send sparse optimize (#27719) (#27979)
5 years ago
Guanghua Yu 51dd268cfe
error message optimization in mean_xpu,softmax_with_cross_entropy_op_xpu,test=kunlun (#27968)
5 years ago
Feiyu Chan 429c0b62b1
support channel last in BatchNorm*d (#27961)
5 years ago
mapingshuo 39c31a20e5
reshape support bool, test=develop (#27944) (#27971)
5 years ago
Qinghe JING 1f45c06e92
add reduce xpu op test=develop;test=kunlun (#27960)
5 years ago
guofei 6bbb6e7f45
Implement the function of OutScaleForTraining/OutScaleForInference in dygraph (#26601)
5 years ago
YUNSHEN XIE fea09fe534
disable ut quickly (#27793)
5 years ago
chentianyu03 d05058d268
Remove and reorganize the alias of APIs (#27717)
5 years ago
Leo Chen 9a2a4b5f65
Support setting xpu place in dygraph mode (#27909)
5 years ago
Thunderbrook 3ee6ad6ec5
solve bug in pull_dense_worker (#27918)
5 years ago
MRXLT 263a9e97fd
Fix adam (#27778)
5 years ago
Double_V b0edda4d99
kunlun add op (#27890)
5 years ago
Jack Zhou c791df09cf
Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast
5 years ago
wangchaochaohu c5fcc96d5b
xpu support for fill_constant Op (#27675)
5 years ago
tianshuo78520a a820871669
Change PR-CI-Kunlun Test Number (#27923)
5 years ago
Chengmo 328cb289ed
【paddle.fleet】fix sparse load (#27680)
5 years ago
tangwei12 cf70d5b350
fix paddle error informations (#27889)
5 years ago
wawltor 95aa53425d
update the code for the topk message optimize
5 years ago
Chen Weihang 4ba977c720
Polish some error message in opeators (#27876)
5 years ago
123malin a4f850748a
【paddle.fleet】bug fix for parameter_recv (#27838)
5 years ago
QingshuChen 2712d07644
support kunlun matmul_v2 (#27910)
5 years ago
zhang wenhui 5a83496c8d
Multi task (#26002)
5 years ago
zhang wenhui 7a58431c0a
fix norm api doc, test=develop (#27652)
5 years ago
yinhaofeng 3eb106da6d
Lookup table v2 xpu (#27888)
5 years ago
Zhang Ting d5cc144c60
tune backward filter algorithm for float16 (#27529)
5 years ago
wanghuancoder 41aad9bfcd
revert 4 files, from clear include by iwyu, test=develop (#27895)
5 years ago
hutuxian 3f2a6ab65d
fix error msg (#27887)
5 years ago
xiaoting ae01801f0a
Add dropout and log_loss for kunlun (#27790)
5 years ago
Guanghua Yu 70c8c31371
support mean,softmax_with_cross_entropy on Baidu Kunlun (#27792)
5 years ago
Chengmo 1607e87cb9
add xpu sgd & momentum (#27728)
5 years ago
Leo Chen 049696bf67
Refine the format of printing tensor (#27673)
5 years ago
hong19860320 c90d35564b
Add batch_norm and layer_norm XPU kernels (#27818)
5 years ago
joanna.wozna.intel ddcd1b5381
Add bfloat16 resnet50 test (#27755)
5 years ago
xiaoting 6da7a7458b
add conv for xpu, test=kunlun (#27809)
5 years ago
Thunderbrook 04be37c57f
add xpu slice op (#27349)
5 years ago
Thunderbrook 8c25dfaacc
op error info (#27856)
5 years ago
Wilber 345574a6ed
Demo CMakeLists add openmp flag. (#27848)
5 years ago
ShenLiang 6d63cd2b93
add gather_op xpu, test=kunlun (#27822)
5 years ago
Feiyu Chan 1d95a0fbc3
fix error message for nce_op (#27863)
5 years ago
gongweibao 4237fefeb4
Add shellcheck tools and modify copyright hook (#27722)
5 years ago
Chengmo c5f2802d56
【paddle.fleet】Update fleetrun & ps-heter (#27472)
5 years ago
Shang Zhizhou bbc837ee72
add info log for trt input dynamic shape check (#27796)
5 years ago
guofei 2e1bca99ca
Refine the gradient calculation errors caused by renaming in while_grad (#27814)
5 years ago
wanghuancoder 8fa4c09889
add load_op_xpu for Baidu Kunlun (#27817)
5 years ago
Wilber 9005c5a260
Lite subgraph support arm cpu. (#27827)
5 years ago
Jacek Czaja 55e63763ec
[oneDNN] adaptive pool support (#27747)
5 years ago
chen zhiyu 6335e6a0a6
add musl option (#27798)
5 years ago