Commit Graph

4507 Commits (0b74428db8f4ab09b456faceca5c357105c80b1d)

Author SHA1 Message Date
LielinJiang 0b74428db8
Fix Conv2DTanspose bug when padding='same' (#29915)
4 years ago
lilong12 01950ceb42
fix the bug in pipeline data parallelism (#29731)
4 years ago
Chen Weihang a6072055be
[Complex] Handle complex to real after type promotion (#29855)
4 years ago
Chen Weihang 1a304e6c06
[Complex] Add support for complex grad accumulated (#29889)
4 years ago
guofei 80eb77788f
Skip Windows Multi-GPU test of test_fetch_lod_tensor_array (#29508)
4 years ago
Leo Chen 6b258317cb
fix TransferInplaceBack (#29830)
4 years ago
QingshuChen 59b47f3b32
feat: support check_nan_inf for kunlun/xpu device (#29694)
4 years ago
wawltor 7498df2587
add the cumsum unit test for the develop (#29881)
4 years ago
wanghuancoder 26f9ab70f7
if PR have no .py files, do not use 'python coverage run', to speedup unit test (#29739)
4 years ago
Tao Luo 5d130d5670
Revert "fix conv2d int8 windows UT (#29528)" (#29869)
4 years ago
tangwei12 032414ca2a
[Feature] one ps (3/4) (#29604)
4 years ago
jakpiase edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772)
4 years ago
Chen Weihang 0e0bb1b97d
replace exit method (#29862)
4 years ago
lidanqing 067d7f1d0d
fix conv2d int8 windows UT (#29528)
4 years ago
liym27 97e75ad0f5
[setitem] Support Tensor setitem in static mode (#29708)
4 years ago
YUNSHEN XIE 24ce051a84
remove duplicate ut reload (#29810)
4 years ago
ceci3 c4eb5d0378
fix unittest timeout (#29820)
4 years ago
chentianyu03 ddfc3d2c2f
change grad elementwise_mul for complex types (#29757)
4 years ago
chentianyu03 2a260d9b0e
change the grad of div when complex types (#29804)
4 years ago
Guo Sheng 356efd36fa
Remove test_rnn_decode_api from disable list. (#29814)
4 years ago
TTerror 82aa01c373
add nearest_interp_v2 on kunlun (#29725)
4 years ago
whs 82630408b4
Support double backward rsqrt (#29589)
4 years ago
xiaoting 55725cd2e1
fix for timeout, test=develop (#29788)
4 years ago
LielinJiang a94c3cbbf3
register cudnn conv double grad for depthwise conv (#29807)
4 years ago
liym27 0cc42e34c6
Migrate 4 APIs about array to paddle.tensor.* (#29565)
4 years ago
liym27 41a7b07159
[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created (#29769)
4 years ago
LielinJiang e5af650b71
Add double grad for conv_transpose (#29706)
4 years ago
Wojciech Uss 6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations (#28420)
4 years ago
Huihuang Zheng dfffee8a5d
[Dy2stat] Enable jit.save to Save Without Running (#29579)
4 years ago
liym27 a0b60716f1
[Dy2Stat] Support grammar: for ele in var[idx] (#29541)
4 years ago
chentianyu03 b59b6d7ae6
Complex op test (#29753)
4 years ago
liym27 096c048b45
Fix unitest test_slice (#29740)
4 years ago
Huihuang Zheng 2e788bd81e
Reduce batch size ot fix CPU memory, test=develop (#29736)
4 years ago
chentianyu03 71063b8137
add conj op for complex types (#29527)
4 years ago
Chen Weihang 6cfa59de1b
[Complex] Add real & imag op and api for complex tensor (#29672)
4 years ago
TTerror af8ded773a
update activation op on kunlun (#29577)
4 years ago
ceci3 cc387159f3
add pad and concat double grad (#29549)
4 years ago
liuyuhui f13c3a9cd7
[Kunlun] PR1:Support one Kunlun card training in parallel executor (#29337)
4 years ago
YUNSHEN XIE d0b789d27f
disable ut test_cumsum_op (#29613)
4 years ago
Jack Zhou 84bae27779
fix wmt14 doc, remove backward, add bidirect direction in rnn api (#29633)
4 years ago
YUNSHEN XIE 2926e74326
New UT should not exceed 15s (#29492)
4 years ago
Chen Weihang f02aece1f0
Add complex dtype op (add) test example (#29603)
4 years ago
AshburnLee efea540ca9
Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732)
4 years ago
lijianshe02 7779768b53
add transpose double grad test=develop (#29600)
4 years ago
ShenLiang 1efef8baed
Fix bug of matmul_v2 for broadcast case (#29599)
4 years ago
qingqing01 8d549fc85d
Add clip double grad (#29590)
4 years ago
Tao Luo 81acc3278c
disable test_parallel_executor_profiler in cuda 10.1 (#29581)
4 years ago
wangchaochaohu ac4bae8ee9
elementwise_add_grad Op optimization (#29575)
4 years ago
WangXi 467c716963
gen nccl id use socket (#29431)
4 years ago
Bai Yifan d72604cd46
fix unittst unstable issue on ci machine (#29588)
4 years ago