Commit Graph

4631 Commits (8ab29f4beae4e216f3043ce8420b063028355c31)

Author SHA1 Message Date
WeiXin 8ab29f4bea
delay timeout of unnittest 'test_static_save_load'. (#30975)
4 years ago
Chen Weihang 010f2caa23
try to fix reader and signal test failed (#30960)
4 years ago
liym27 12c15bebe4
[Static setitem] Support index is ellipsis for setitem in static mode (#30836)
4 years ago
liuyuhui 87197f8c2e
[kunlun]fix sync in multi kunlun xpu dygraph training. (#30943)
4 years ago
Jacek Czaja 9e527d9956
[oneDNN] Added basic changes for elementwise_add_grad bf16 (#30925)
4 years ago
liuyuhui 4a8b8b4547
[Kunlun] add gen_bkcl_id_op, support multi XPU cards training using multiprocess (#30858)
4 years ago
Jacek Czaja abfa822650
[oneDNN]Extended adaptive pooling support for oneDNN pool kernel (#30757)
4 years ago
cucuzg ac2e2e6b7f
add clip_by_norm on kunlun, *test=kunlun (#30862)
4 years ago
wawltor b7560a59ab
fix the broadcast for the large second input (#30818)
4 years ago
JamesLim 6e1e036a75
Implement cuda kernel for index_sample. (#30380)
4 years ago
AshburnLee 666efc2336
Call new cudnn batch norm API regardless of data type and data layout (#30157)
4 years ago
joejiong 05d2b7a37f
Update paddle.static.Print with paddle2.0 api (#30846)
4 years ago
Adam Osewski 4f066e316e
Layer normalization fuse pass. (#30721)
4 years ago
WangXi b1026f64af
【kunlun】dygraph supports multi xpu card training (#30671)
4 years ago
Shang Zhizhou b909450994
fix trt plugin clone and initialize bugs in TRT7.1+ (#30709)
4 years ago
Shang Zhizhou 200ee33df8
fix unittest random error (#30808)
4 years ago
xiemoyuan db87087283
Optimize the encoder of Transformer. (#30439)
4 years ago
WangXi 31ed9c9eed
Fleet distributed strategy support pure fp16 (#30754)
4 years ago
Wojciech Uss fc00240575
A fix for oneDNN matmul kernel. Fixes issue #30309 (#30723)
4 years ago
WeiXin 3491acfb1e
Split unittest. (#30727)
4 years ago
liu zhengxi fef3654b4e
upgrade gather_tree to core.ops (#30697)
4 years ago
jakpiase f8da5536ed
REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719)
4 years ago
liym27 13ef444fa6
[Dy2Stat] Fix error message when the message has more than one lines. (#30714)
4 years ago
Tao Luo 824a79d383
Revert "Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)" (#30708)
4 years ago
jakpiase d834f4e6e8
Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)
4 years ago
Leo Chen 1a13626f5f
polish printing dtype (#30682)
4 years ago
WangXi a28a202603
fix test_gen_nccl_id_op failed (#30686)
4 years ago
chentianyu03 fb7fbc7a5d
fix abs bug and add abs test case (#30637)
4 years ago
ShenLiang 9514b4aa5f
Fix scatter grad bug (#30604)
4 years ago
Qi Li 1f5841c2a0
[ROCM] update cmake and dockerfile, test=develop (#30598)
4 years ago
Zhen Wang 4a9de931a2
Fix the bug in fleet amp_init. (#30606)
4 years ago
TTerror 10271ddfc4
support reduce_max op on kunlun (#30581)
4 years ago
WeiXin ca33821475
延长单测'test_static_save_load'超时 (#30599)
4 years ago
chentianyu03 358106fcb0
make abs op support complex types (#30375)
4 years ago
huangxu96 138620084c
Add fleet amp_init() (#30572)
4 years ago
lilong12 8126a41d73
fix the bug of all_reduce pipeline gradient multiple times (#30437)
4 years ago
Aurelius84 621bc4f771
[Dy2static]Fix paddle prefix in is_paddle_api (#30569)
4 years ago
Aurelius84 5067e3a8d2
[Dy2Static]Enhance check of TracedLayers out vars (#30576)
4 years ago
liym27 ff25c5b36f
Fix bug: GetAttrValue should deal with attr with attrType vector<double> (#30536)
4 years ago
WangXi 572c466d19
[Prepare for MultiProcess xpu] unified gen nccl id, refine imperative reducer (#30455)
4 years ago
ykkk2333 549855ac20
add rmsprop_op_xpu test=kunlun (#30493)
4 years ago
Leo Chen 7043b8cfc6
support layer_norm fp16 in dygraph amp (#30430)
4 years ago
Zhang Ting 66c514ce83
[2.0 API] device guard (#30307)
4 years ago
WangXi 7a0a576e51
fix adamw lr_to_coeff is fixed when dygraph (#30526)
4 years ago
WeiXin c0fb03a0dc
Supplement PR29988(https://github.com/PaddlePaddle/Paddle/pull/29988) (#30507)
4 years ago
hutuxian 40ede12631
Ascend Framework Part1: OP & Wrapper (#30281)
4 years ago
gongweibao bdae7ed326
Fix potential port conflicts. (#30508)
4 years ago
QingshuChen 8489d4f76f
optimize batch_norm & pool op for kunlun (#30490)
4 years ago
taixiurong 5e5c2827a3
fix range op crash in dygraph xpu place (#30469)
4 years ago
WeiXin 18ecd433f5
Avoid bug on 'MAC python3.5/6'. (#30485)
4 years ago