Commit Graph

66 Commits (b5ebca47a352412b01692d01aff7b6f4f371b685)

Author SHA1 Message Date
tensor-tang 8117725852 add jit kernel hsum, hmax and softmax refer code
6 years ago
peizhilin 1e7f83e60a add cuda dso support for windows
6 years ago
peizhilin 40a94a138f remove irrelevant fix for mkl
6 years ago
peizhilin ed5bd5e586 test=develop
6 years ago
Yu Yang 7b10bf0e60 Use mkl
6 years ago
liuhongyu 8daf67f90f fix bugs; test=develop
6 years ago
liuhongyu 968dd3c078 add cudnn 5 support; test=develop
6 years ago
phlrain cf1fe61004 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
Tao Luo ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
6 years ago
Jacek Czaja 8bfa1fa9bb - ASUM MKL integration
6 years ago
liuhongyu 05917c3c79 add cudnn lstm; test=develop
6 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
6 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
chengduozh f7847ca6a3 fix cublas warp error
6 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
6 years ago
peizhilin 7c8c9dc9bf fix unit test cases
6 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
6 years ago
peizhilin 6e66fadb95 clean up the pre-definitions on windows
6 years ago
qingqing01 fd7e643153
Convolution fusion operator. (#14449)
6 years ago
Wu Yi b32c13dc20
Add cudnn ctc loss (#12366)
6 years ago
tensor-tang 1be85d011d add mkl vsqr and vpow
6 years ago
Qiyang Min 698698f2fa
Merge branch 'develop' into fix_vlog
6 years ago
qingqing01 abe209234f
Exhaustive search for cuDNN conv. (#14286)
6 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times
6 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
6 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
6 years ago
whs 0c319e0b35
Add affine grid generator op (#12238)
6 years ago
dzhwinter 2d00e65819
namespace issue (#13543)
6 years ago
JiabinYang e322fc4e0e add error info for nccl not found
7 years ago
dzhwinter d361624c1d
platform module (#12932)
7 years ago
tensor-tang 3dd66390b2 add blas vexp
7 years ago
tensor-tang 0ec1f65cf1 fix blas dot and add cblas scal
7 years ago
tensor-tang a2203d0466 add cblas dot
7 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
Tao Luo d04ef276a5
Merge pull request #12745 from tensor-tang/refine/op/elewise_mul
7 years ago
dzhwinter 00463fdfe3
cudnn windows support (#12757)
7 years ago
tensor-tang 6644ce79a5 add mklml vmul
7 years ago
tensor-tang 43cee33a23 add mkl packed gemm
7 years ago
dzhwinter 99a99ec7e3
"remove lapack" (#11966)
7 years ago
Tao Luo 2dae8a4631
Merge pull request #11596 from tensor-tang/refine/mklml/dyload
7 years ago
Yi Wang 2625178add
No NCCL on macOS (#11652)
7 years ago
tensor-tang 28a0ef9522 remove usr local lib when dynamic load lib
7 years ago
tensor-tang 3e73a7a924 add usr local lib to dynamic search path
7 years ago
tensor-tang f503f12925 enable dynamic load mklml lib on fluid
7 years ago
Xin Pan d2afd21021 Remove cuptiFinalize.
7 years ago
yuyang18 53dab95b75 Static DSO handle
7 years ago
yuyang18 c5115950a8 Use static for dlsym
7 years ago
Yu Yang 3d53631bad Make dyload strictly use the same ABI in header
7 years ago
Luo Tao d4682247e1 auto find tensorrt library
7 years ago