Commit Graph

22 Commits (1a304e6c069391dd543a3f95a8f9b0826c3e7b93)

Author SHA1 Message Date
Chen Weihang 1a304e6c06
[Complex] Add support for complex grad accumulated (#29889)
5 years ago
chentianyu03 8f45d14263
add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199)
5 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
5 years ago
wangchaochaohu 3eacced950
[cuda11 support] add support for cublas load of same function name (parameter diff) (#26963)
5 years ago
Chen Weihang 172d4ecb6c
remove WITH_DSO compile option (#25444)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
6 years ago
Guo Sheng a8c0fb4e86
Add cholesky_op (#23543)
6 years ago
littletomatodonkey 1c08a2136e
test=develop, add addmm op (#23384)
6 years ago
Yiqun Liu 42b5bec6f9
Integrate NVRTC to support compiling CUDA kernel at runtime (#19422)
6 years ago
chengduozh f7847ca6a3 fix cublas warp error
7 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
7 years ago
dzhwinter 2d00e65819
namespace issue (#13543)
7 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
yuyang18 c5115950a8 Use static for dlsym
7 years ago
Yu Yang 3d53631bad Make dyload strictly use the same ABI in header
8 years ago
Kexin Zhao 7ed457e77a Fix cuda 7.5 error with cublas GEMM (#9811)
8 years ago
Yi Wang e185502ebe
Fix cpplint errors with paddle/fluid/platform/dynload (#9715)
8 years ago
Kexin Zhao d00bd9eb72 Update the cuda API and enable tensor core for GEMM (#9622)
8 years ago
kexinzhao 90215b7844
Add float16 GEMM math function on GPU (#8695)
8 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
8 years ago
Yi Wang fc374821dd Correct #include path
8 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt
8 years ago