Commit Graph

9667 Commits (8c296dea759fb64a68f21090a968f8f253b84972)

Author SHA1 Message Date
Pei Yang 8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop (#24329)
5 years ago
Guo Sheng 4a5de14426
Remove cusolver potrfBatched support on Windows. (#24338)
5 years ago
Tao Luo c1df7048c7
add UT for mkldnn_cache_capacity (#24336)
5 years ago
Feiyu Chan 420707c26e
API/OP (margin_rank_loss, nce, row_conv, positive_negative_pair) erro… (#24246)
5 years ago
joanna.wozna.intel 356f5ee220
[Refactoring] Unify op-dequant squashes (#24277)
5 years ago
liym27 ac9a7eeea4
[Dy2Stat]Support list pop (#24250)
5 years ago
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
Zhang Ting b71abeee1b
use 32 bit index to improve activation ops (#24206)
5 years ago
xujiaqi01 1034ca316f
add timeout and http store in communication (#23436)
5 years ago
Guo Sheng 1fc6cc502a
Fix cusolver loader for Windows (#24157)
5 years ago
wawltor d1e1d85881
add the graph batch reader for pslib mode (#24178)
5 years ago
liym27 803559499d
[Dy2Stat]Support LoDTensorArray for slice op (#23091)
5 years ago
Feiyu Chan 78cd3dd507
fix kron_op: when only one input needs gradient, test=develop (#24269)
5 years ago
Li Fuchen 5dc069d050
OP(warpctc, add_position_encoding, scaled_dot_product_attention) error message enhancement (#24261)
5 years ago
liym27 19511dfa25
Fix bug in assign op: support to infer shape for LOD_TENSOR_ARRAY. (#24268)
5 years ago
joanna.wozna.intel b43b46e619
[INT8] Add requant-op squash (#24143)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
hutuxian 3e2bc8715f
Try to fix UT Random Fail (#24223)
5 years ago
lidanqing 61ec30f030
Update QAT INT8 2.0 doc (#24127)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
Tao Luo d7850dcdc0
add noavx_axpy and noavx_axpy_noadd (#24207)
5 years ago
Huihuang Zheng 3047b69f94
Make Allocation Smaller on BuddyAllocatorTest to Decrease CI Failure (#24205)
5 years ago
Chen Weihang 9b851ba216
[dy2static] Add print transformer and unify print format (#24068)
5 years ago
Li Fuchen 077e5a0fe5
Add trace op (#23873)
5 years ago
wangchaochaohu fa43d74a3a
fix the intermediate node of graph for fusion group test=develop (#24184)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
5 years ago
HappyAngel 8ea325aa53
c++ API ( average_accumulates, tensor_array_to_tensor and average_accumulates) error message enhance. test=develop (#23631)
5 years ago
ShenLiang 0fb9b208ab
Add batch_fc op in contrib (#24017)
5 years ago
liuwei1031 9a93f6aae0
improve efficiency of runtime InferVarType (#22778)
5 years ago
hong bfb60efb1f
change compare forece_cpu default value; test=develop (#23888)
5 years ago
wangchaochaohu 2270864019
Fusion group optimize for cuda codegen(#23940)
5 years ago
ShenLiang 94dfb7d770
opt the postprocess, test=develop (#24155)
5 years ago
Chengmo 28a558e867
Update index sample (#24109)
5 years ago
Zhang Ting ab8f8fa70d
fix example code, test=develop, test=document_fix (#24139)
5 years ago
Feiyu Chan e01262e691
add kron op (#24105)
5 years ago
Jacek Czaja eb411613e9
[DNNL] refine activations Inplace support (#24145)
5 years ago
wangchaochaohu 6bf26ef156
fix warning mac compiler (#24138)
5 years ago
lijianshe02 a398464e77
API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, … (#24118)
5 years ago
Guo Sheng a8c0fb4e86
Add cholesky_op (#23543)
5 years ago
Jacek Czaja 461e6a01ec
[DNNL] activations Inplace support (#24123)
5 years ago
wangchaochaohu 6ba7c3ac92
Reduce the construction time of fuction about profiler (#24117)
5 years ago
Zhang Ting fb0729ee7f
avoid warnings in MAC compile (#24124)
5 years ago
qingqing01 b059fb955d
Add trainable_statistics in attr for batch_norm. (#24072)
5 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
5 years ago
石晓伟 46f3139c7f
supports loading model from memory, test=develop (#24098)
5 years ago
xiaoting 96ffebef55
fix bicubic, change int to floor (#24063)
5 years ago
Zeng Jinle a67eea9f00
polish code by adding final, test=develop, test=develop (#24114)
5 years ago
Zeng Jinle ab2e284235
fix compilation failure (#24091)
5 years ago
石晓伟 34d7d6aef0
declare the stream::Priority as enum class, test=develop (#24013)
5 years ago
Pei Yang 695a53c874
remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop (#23805)
5 years ago