Commit Graph

29930 Commits (0020d915063eda3ff641f50d0e0fca8f44fce552)
 

Author SHA1 Message Date
lilong12 01950ceb42
fix the bug in pipeline data parallelism (#29731)
4 years ago
LielinJiang 0f4b218640
Enable bilateral_slice unittest on windows platform (#29896)
4 years ago
Ren Wei (任卫) 95df0e1447
Add the ipipe log param prefix (#29545)
4 years ago
YUNSHEN XIE 2a01756bf3
remove duplicate ut names (#29809)
4 years ago
Chen Weihang a6072055be
[Complex] Handle complex to real after type promotion (#29855)
4 years ago
Chen Weihang 1a304e6c06
[Complex] Add support for complex grad accumulated (#29889)
4 years ago
taixiurong c7acad9f2f
support some shape for matmul and cast in xpu place (#29900)
4 years ago
guofei 80eb77788f
Skip Windows Multi-GPU test of test_fetch_lod_tensor_array (#29508)
4 years ago
Leo Chen 6b258317cb
fix TransferInplaceBack (#29830)
4 years ago
QingshuChen 59b47f3b32
feat: support check_nan_inf for kunlun/xpu device (#29694)
4 years ago
wawltor 7498df2587
add the cumsum unit test for the develop (#29881)
4 years ago
wanghuancoder 26f9ab70f7
if PR have no .py files, do not use 'python coverage run', to speedup unit test (#29739)
4 years ago
Tao Luo 5d130d5670
Revert "fix conv2d int8 windows UT (#29528)" (#29869)
4 years ago
tangwei12 032414ca2a
[Feature] one ps (3/4) (#29604)
4 years ago
jakpiase edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772)
4 years ago
Chen Weihang 0e0bb1b97d
replace exit method (#29862)
4 years ago
Wilber 2c0a4a3470
call_statck is turned on default when ON_INFER=ON (#29798)
4 years ago
lidanqing 067d7f1d0d
fix conv2d int8 windows UT (#29528)
4 years ago
Wilber ad0b01ffe2
lod operator should not be reused in memory_optimize pass. (#29828)
4 years ago
tianshuo78520a 9370aa6f56
Update openssl (#29424)
4 years ago
liym27 97e75ad0f5
[setitem] Support Tensor setitem in static mode (#29708)
4 years ago
YUNSHEN XIE 24ce051a84
remove duplicate ut reload (#29810)
4 years ago
Jacek Czaja c9e874fc8e
[oneDNN] Unit test for checking oneDNN caching (#29606)
4 years ago
Thunderbrook 09b6e71928
heter box (#29734)
4 years ago
LielinJiang 1092da82b2
Change the conditions of hapi printing logs (#29792)
4 years ago
Jacek Czaja 7b33720c90
[oneDNN] Tensor copy fix to oneDNN tensors (#29771)
4 years ago
123malin a400b76db7
Roll cuda kernel (#29655)
4 years ago
wuhuanzhou e7ac74c85b
optimize compilation time of argmin/argmax op (#29595)
4 years ago
ceci3 c4eb5d0378
fix unittest timeout (#29820)
4 years ago
wuhuanzhou c1797c8827
Optimize op benchmark ci log (#29586)
4 years ago
Zhou Wei 3f83ec61c2
move running unittest on windows to another file (#29815)
4 years ago
chentianyu03 ddfc3d2c2f
change grad elementwise_mul for complex types (#29757)
4 years ago
chentianyu03 2a260d9b0e
change the grad of div when complex types (#29804)
4 years ago
syyxsxx e219b8ccef
fix api link for the any, all, isfinite
4 years ago
YUNSHEN XIE e617707229
add retry for download lcov failed (#29789)
4 years ago
Guo Sheng 356efd36fa
Remove test_rnn_decode_api from disable list. (#29814)
4 years ago
ShenLiang f65f1caad3
opt sparse allreduce using ncclgather (#29819)
4 years ago
TTerror 82aa01c373
add nearest_interp_v2 on kunlun (#29725)
4 years ago
yukavio 0f97ff0368
fix flops (#29818)
4 years ago
wangchaochaohu 01c37c8e02
refine the compiler error for half2 operation (#29816)
4 years ago
whs 82630408b4
Support double backward rsqrt (#29589)
4 years ago
Zhang Ting b76f5a8489
fix the bug of dropout_grad (#29813)
4 years ago
cc 61820fd217
add the time threshold of quantization tests, test=develop (#29786)
4 years ago
Qi Li 6a9b307e8a
[ROCM] add dockerfile for rocm3.5 and rocm3.9, test=develop (#29735)
4 years ago
xiaoting 55725cd2e1
fix for timeout, test=develop (#29788)
4 years ago
LielinJiang a94c3cbbf3
register cudnn conv double grad for depthwise conv (#29807)
4 years ago
ShenLiang 01e2874a0e
Support multi-stream communication for dynamic graph distributed (#29525)
4 years ago
wangchaochaohu f350aa59ff
Fix the compiler error for half type (#29799)
4 years ago
wuhuanzhou 27aa15150c
Add approval for PR-CI-OP-benchmark (#29797)
4 years ago
huangxu96 a29006d128
Optimizer trans momentum (#29597)
4 years ago