Commit Graph

18150 Commits (b33aaea86ccaf6f41eca018377e36ed91b64dd27)

Author SHA1 Message Date
wawltor b33aaea86c
add the op version check for the elementwise ops, test=op_version (#30010)
4 years ago
Chengmo 4cbcc9b6da
fix momentum op register (#29941)
4 years ago
hutuxian 7c1f69bdf0
add op_version for flip op [test=op_version] (#30019)
4 years ago
ceci3 77c1684397
register ModifyAttr for instance_norm, test=op_version (#29938)
4 years ago
Leo Chen 47d10c55d5
Enhance debugging (#30001)
4 years ago
FlyingQianMM d42f93e504
add op_register_version for allclose op; test=op_version (#29968)
4 years ago
wawltor 8f49f9d5c9
change the elementwise ops version check, test=op_version
4 years ago
guofei b23faf37be
Add moving_average_abs_max_scale op_register_version test=develop (#29957)
4 years ago
Thunderbrook 0ca6de171f
add include (#29952)
4 years ago
zhangchunle 631d783748
fix bug in windows ci (#29963)
4 years ago
Pei Yang 6206b9bc71
fix ut:trt_resnext_test, trt_quant_int8_yolov3_r50_test, test_trt_dynamic_shape_ernie, test_trt_dynamic_shape_ernie_fp16_ser_deser, trt_cascade_rcnn_test (#29977)
4 years ago
wangxinxin08 be8b5fd18a
register op version for conv2d_transpose, conv3d_transpose and depthwise_conv2d_transpose, test=op_version (#29937)
4 years ago
石晓伟 958612231f
compile the denormal.cc on aarch64, test=develop (#29956)
4 years ago
Guo Sheng 6ac4f0af6a
Register op version for coalesce_tensor. (#29940)
4 years ago
Chen Weihang a1d9a14e89
support grad accumulated across batch (#29942)
4 years ago
cc 6a0102b038
map matmul/squeeze2+matmul/reshape2+matmul to mul (#29911)
4 years ago
Huihuang Zheng d038746e1c
Fix Unix Sleep for Wrong Time. test=develop (#29953)
4 years ago
YUNSHEN XIE 121658d251
Support xpu ut coverage (#29892)
4 years ago
Jack Zhou 5a4e42ca9a
add gru op_register_version; test=op_version; (#29931)
4 years ago
Wilber 2b1d796cd0
[Inference] Solve 2.0 trt performance reduce compare 1.8. (#29925)
4 years ago
Qi Li 913f77a4b7
Register op version for print, test=op_version (#29945)
4 years ago
石晓伟 181ea1870b
flush denormals to zero, test=develop (#29924)
4 years ago
cc 7667e59bf7
add op version for fake_quant and fake_dequant ops, test=op_version (#29923)
4 years ago
石晓伟 acb5e86363
fix a bug in reset_tensor_array, test=develop (#29620)
4 years ago
liuyuhui 3d1741b794
[Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor (#29926)
4 years ago
Wilber 332da133a1
Support mips arch (#29903)
4 years ago
LielinJiang eab0b60e16
Register op version for grid_sampler, test=op_version (#29916)
4 years ago
liym27 9602a182b2
[Dynamic Inplace] Support ShareInplaceVersionCounterWith for C++ Tensor (#29842)
4 years ago
liuyuhui 4427df37cf
[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574)
4 years ago
LielinJiang 0f4b218640
Enable bilateral_slice unittest on windows platform (#29896)
4 years ago
Ren Wei (任卫) 95df0e1447
Add the ipipe log param prefix (#29545)
4 years ago
YUNSHEN XIE 2a01756bf3
remove duplicate ut names (#29809)
4 years ago
Chen Weihang a6072055be
[Complex] Handle complex to real after type promotion (#29855)
4 years ago
Chen Weihang 1a304e6c06
[Complex] Add support for complex grad accumulated (#29889)
4 years ago
taixiurong c7acad9f2f
support some shape for matmul and cast in xpu place (#29900)
4 years ago
Leo Chen 6b258317cb
fix TransferInplaceBack (#29830)
4 years ago
QingshuChen 59b47f3b32
feat: support check_nan_inf for kunlun/xpu device (#29694)
4 years ago
tangwei12 032414ca2a
[Feature] one ps (3/4) (#29604)
4 years ago
jakpiase edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772)
4 years ago
Wilber 2c0a4a3470
call_statck is turned on default when ON_INFER=ON (#29798)
4 years ago
Wilber ad0b01ffe2
lod operator should not be reused in memory_optimize pass. (#29828)
4 years ago
liym27 97e75ad0f5
[setitem] Support Tensor setitem in static mode (#29708)
4 years ago
YUNSHEN XIE 24ce051a84
remove duplicate ut reload (#29810)
4 years ago
Jacek Czaja c9e874fc8e
[oneDNN] Unit test for checking oneDNN caching (#29606)
4 years ago
Thunderbrook 09b6e71928
heter box (#29734)
4 years ago
Jacek Czaja 7b33720c90
[oneDNN] Tensor copy fix to oneDNN tensors (#29771)
4 years ago
123malin a400b76db7
Roll cuda kernel (#29655)
4 years ago
wuhuanzhou e7ac74c85b
optimize compilation time of argmin/argmax op (#29595)
4 years ago
Zhou Wei 3f83ec61c2
move running unittest on windows to another file (#29815)
4 years ago
chentianyu03 ddfc3d2c2f
change grad elementwise_mul for complex types (#29757)
4 years ago