Commit Graph

14 Commits (03479469a700ce30edea0fe80a7c14982a6082db)

Author SHA1 Message Date
Yiqun Liu 42b5bec6f9
Integrate NVRTC to support compiling CUDA kernel at runtime (#19422)
6 years ago
chengduozh f7847ca6a3 fix cublas warp error
6 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
6 years ago
dzhwinter 2d00e65819
namespace issue (#13543)
6 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
yuyang18 c5115950a8 Use static for dlsym
7 years ago
Yu Yang 3d53631bad Make dyload strictly use the same ABI in header
7 years ago
Kexin Zhao 7ed457e77a Fix cuda 7.5 error with cublas GEMM (#9811)
7 years ago
Yi Wang e185502ebe
Fix cpplint errors with paddle/fluid/platform/dynload (#9715)
7 years ago
Kexin Zhao d00bd9eb72 Update the cuda API and enable tensor core for GEMM (#9622)
7 years ago
kexinzhao 90215b7844
Add float16 GEMM math function on GPU (#8695)
7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
7 years ago
Yi Wang fc374821dd Correct #include path
7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt
7 years ago