Commit Graph

44 Commits (b6a4349dd40eee17e485e149e09af4b29caa3d66)

Author SHA1 Message Date
wawltor b6a4349dd4
fix the error message for the math dir
5 years ago
ShenLiang c609066074
Add Matmul op (#26411)
5 years ago
Guo Sheng a8c0fb4e86
Add cholesky_op (#23543)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
6 years ago
danleifeng 425279a57b Improve elementwise operators performance in same dimensions. (#19763)
6 years ago
Bob Zhu c670058a8d add support of matmul with multiple head even different width and height (#19708)
6 years ago
Tao Luo 0a46d34538
refine some PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19607)
6 years ago
zhouwei25 84c728013c fix the compilation issue on windows caused by mkl_CSRMM (#19533)
6 years ago
Yihua Xu b920395842 Use sparse matrix to implement fused emb_seq_pool operator (#19064)
6 years ago
Bob Zhu 220eef602e Extend Matmul to support matrix multiplication with multiple heads (#18570)
6 years ago
Zeng Jinle f5641000bb
Add a unittest to inplace elementwise_add (#18385)
6 years ago
Yihua Xu 7396788694 Optimize gelu operation with mkl erf.
6 years ago
tensor-tang ee2321debd
Revert 15770 develop a6910f900 gelu mkl opt (#15872)
6 years ago
Yihua Xu 676995c86c Optimze Gelu with MKL Erf function (#15770)
6 years ago
Yu Yang 7b10bf0e60 Use mkl
7 years ago
Jacek Czaja 48e1b97e8e - Coding style fixes
7 years ago
Jacek Czaja cf40daee58 - Building fix to softmax for inference
7 years ago
Jacek Czaja 8bfa1fa9bb - ASUM MKL integration
7 years ago
tensor-tang 64f7516aee
fix lrn on mac (#14426)
7 years ago
tensor-tang 1be85d011d add mkl vsqr and vpow
7 years ago
tensor-tang cf5ea925c3 fix bugs
7 years ago
tensor-tang 3dd66390b2 add blas vexp
7 years ago
tensor-tang 0ec1f65cf1 fix blas dot and add cblas scal
7 years ago
tensor-tang a2203d0466 add cblas dot
7 years ago
tensor-tang f72ab8961e refine blas gemm
7 years ago
tensor-tang 6644ce79a5 add mklml vmul
7 years ago
tensor-tang 54c95e49f0 fix blas
7 years ago
tensor-tang 43cee33a23 add mkl packed gemm
7 years ago
tensor-tang a916c52579 refine gemm
7 years ago
tensor-tang 961e754c9f mkl split gemm for better perf
7 years ago
tensor-tang 1c5d6c5692 disable xsmm with float16
7 years ago
tensor-tang 64a8e6d20e refine the threshold functions
7 years ago
tensor-tang 6bc1aaaac7 refine the ColMajor replacement
7 years ago
tensor-tang de856da9a6 fix ColMajor and RowMajor replacement
7 years ago
tensor-tang c3941745b3 add libxsmm_gemm
7 years ago
tensor-tang f503f12925 enable dynamic load mklml lib on fluid
7 years ago
Tomasz Patejko e43c8f33cd MKL elementwise add: elementwise_add uses vAdd VML function when MKL is used
7 years ago
yuyang18 66590a0b88 Fix typo in blas_impl.h
7 years ago
Yu Yang 0a13d3c67a Move MatMul to blas_impl.h
7 years ago
Yu Yang ef6ea790dc Clean and extract blas
7 years ago
Yu Yang 815d888468 Clean MatMul
7 years ago
Yu Yang 4db43c6c9f Naive implement cblas
7 years ago
Yu Yang c888e01660 Refactor GEMM in blas
7 years ago