Commit Graph

18 Commits (03479469a700ce30edea0fe80a7c14982a6082db)

Author SHA1 Message Date
danleifeng 425279a57b Improve elementwise operators performance in same dimensions. (#19763)
5 years ago
zhouwei25 84c728013c fix the compilation issue on windows caused by mkl_CSRMM (#19533)
6 years ago
Yihua Xu b920395842 Use sparse matrix to implement fused emb_seq_pool operator (#19064)
6 years ago
Yihua Xu 7396788694 Optimize gelu operation with mkl erf.
6 years ago
tensor-tang ee2321debd
Revert 15770 develop a6910f900 gelu mkl opt (#15872)
6 years ago
Yihua Xu 676995c86c Optimze Gelu with MKL Erf function (#15770)
6 years ago
tensor-tang 8117725852 add jit kernel hsum, hmax and softmax refer code
6 years ago
peizhilin ed5bd5e586 test=develop
6 years ago
Yu Yang 7b10bf0e60 Use mkl
6 years ago
Jacek Czaja 8bfa1fa9bb - ASUM MKL integration
6 years ago
tensor-tang 1be85d011d add mkl vsqr and vpow
6 years ago
tensor-tang 3dd66390b2 add blas vexp
7 years ago
tensor-tang 0ec1f65cf1 fix blas dot and add cblas scal
7 years ago
tensor-tang a2203d0466 add cblas dot
7 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
tensor-tang 6644ce79a5 add mklml vmul
7 years ago
tensor-tang 43cee33a23 add mkl packed gemm
7 years ago
tensor-tang f503f12925 enable dynamic load mklml lib on fluid
7 years ago