Commit Graph

12424 Commits (5ded39f226d9db0b391fa4d24fa92e21ed023044)

Author SHA1 Message Date
Jacek Czaja 9e527d9956
[oneDNN] Added basic changes for elementwise_add_grad bf16 (#30925)
4 years ago
liuyuhui 4a8b8b4547
[Kunlun] add gen_bkcl_id_op, support multi XPU cards training using multiprocess (#30858)
4 years ago
wanghuancoder 90d92111cf
let LayerList could add [None], test=develop (#30911)
4 years ago
taixiurong 24873f4f77
dyngraph (#30892)
4 years ago
Zhen Wang 71acde9afc
Use correct master weights in AdamW. (#30895)
4 years ago
Jacek Czaja abfa822650
[oneDNN]Extended adaptive pooling support for oneDNN pool kernel (#30757)
4 years ago
Zhang Ting e97905c5fa
improve performance of momentum (#30881)
4 years ago
cucuzg ac2e2e6b7f
add clip_by_norm on kunlun, *test=kunlun (#30862)
4 years ago
Kaipeng Deng 302427170f
remove numpy array check in single-process dataloader. test=develop (#30861)
4 years ago
wawltor b7560a59ab
fix the broadcast for the large second input (#30818)
4 years ago
JamesLim 6e1e036a75
Implement cuda kernel for index_sample. (#30380)
4 years ago
AshburnLee 666efc2336
Call new cudnn batch norm API regardless of data type and data layout (#30157)
4 years ago
石晓伟 2ac4143b6c
support xpu with analysis predictor, test=develop (#30832)
4 years ago
joejiong 05d2b7a37f
Update paddle.static.Print with paddle2.0 api (#30846)
4 years ago
Aurelius84 e49d0746dd
[CustomOp] Support install as Package and Add load interface (#30798)
4 years ago
Adam Osewski 4f066e316e
Layer normalization fuse pass. (#30721)
4 years ago
WangXi b1026f64af
【kunlun】dygraph supports multi xpu card training (#30671)
4 years ago
LielinJiang 3a3ff75c52
Fix unittest random failed of test_datasets (#30804)
4 years ago
Shang Zhizhou b909450994
fix trt plugin clone and initialize bugs in TRT7.1+ (#30709)
4 years ago
Shang Zhizhou 200ee33df8
fix unittest random error (#30808)
4 years ago
xiemoyuan db87087283
Optimize the encoder of Transformer. (#30439)
4 years ago
WangXi 31ed9c9eed
Fleet distributed strategy support pure fp16 (#30754)
4 years ago
Aurelius84 2c974cc316
【CustomOp】support setup.py to compile custom op (#30753)
4 years ago
Jiaqi Liu 65a9744cfd
fix paddle.static.acc and auc sample code bug, test=document_fix (#30715)
4 years ago
Wojciech Uss fc00240575
A fix for oneDNN matmul kernel. Fixes issue #30309 (#30723)
4 years ago
tianshuo78520a a12b6bb9cb
add readme in whl package (#30726)
4 years ago
WeiXin 3491acfb1e
Split unittest. (#30727)
4 years ago
liu zhengxi a87d78f1a9
update gather_tree doc (#30693)
4 years ago
liu zhengxi fef3654b4e
upgrade gather_tree to core.ops (#30697)
4 years ago
jakpiase f8da5536ed
REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719)
4 years ago
liym27 13ef444fa6
[Dy2Stat] Fix error message when the message has more than one lines. (#30714)
4 years ago
Tao Luo 824a79d383
Revert "Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)" (#30708)
4 years ago
jakpiase d834f4e6e8
Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)
4 years ago
Leo Chen 1a13626f5f
polish printing dtype (#30682)
4 years ago
WangXi a28a202603
fix test_gen_nccl_id_op failed (#30686)
4 years ago
123malin 164275704d
test=develop, fix nonzero astuple=true (#30647)
4 years ago
yingshengBD 0eea5d714f
post quantize support insert fake_quantize_dequantize node before the OPs that will be used in VIS's faceid models (#30659)
4 years ago
123malin 06a3e31148
test=develop, fix test_lookahead (#30677)
4 years ago
yukavio 8c5f158172
remove PrettyTable dependence from paddle.flops (#30675)
4 years ago
chentianyu03 fb7fbc7a5d
fix abs bug and add abs test case (#30637)
4 years ago
ShenLiang 9514b4aa5f
Fix scatter grad bug (#30604)
4 years ago
Qi Li 1f5841c2a0
[ROCM] update cmake and dockerfile, test=develop (#30598)
4 years ago
Zhen Wang 4a9de931a2
Fix the bug in fleet amp_init. (#30606)
4 years ago
cnn 7e9f336b58
update document of paddle.vision.dataset, test=document (#30414)
4 years ago
guofei 430f8449f1
Fix the error of save_quantized_model (#30583)
4 years ago
TTerror 10271ddfc4
support reduce_max op on kunlun (#30581)
4 years ago
WeiXin ca33821475
延长单测'test_static_save_load'超时 (#30599)
4 years ago
chentianyu03 358106fcb0
make abs op support complex types (#30375)
4 years ago
huangxu96 138620084c
Add fleet amp_init() (#30572)
4 years ago
wanghuancoder 27a5c0cff6
fix layers train eval bug (#30580)
4 years ago