Commit Graph

19 Commits (3630386a894528404da43a32cb3683f793baf8ad)

Author SHA1 Message Date
dzhwinter ab1097cd8e
Feature/template (#13093)
7 years ago
qingqing01 1f09bc320c
Support data type int8_t . (#12841)
7 years ago
yuyang18 27197290dc matmul support float16/double
7 years ago
Yu Yang ef6ea790dc Clean and extract blas
7 years ago
Yu Yang 815d888468 Clean MatMul
7 years ago
Yu Yang c888e01660 Refactor GEMM in blas
7 years ago
Yu Yang 2a06e307d0 Fix batch_gemm bugs
7 years ago
Kexin Zhao 92913027fc
fix unused var error (#9908)
7 years ago
Kexin Zhao 617e790a59
fix cuda 7.5 compile error (#9885)
7 years ago
Kexin Zhao 7ed457e77a Fix cuda 7.5 error with cublas GEMM (#9811)
7 years ago
Kexin Zhao d00bd9eb72 Update the cuda API and enable tensor core for GEMM (#9622)
7 years ago
Kexin Zhao ed2bc194c5
Merge pull request #9176 from kexinzhao/batch_norm_fp16
7 years ago
Kexin Zhao 39c676e208 initial commit
7 years ago
yangyaming bf3f56e899 Finish adaption for backward.
7 years ago
Kexin Zhao 3b44b849d3 address comments
7 years ago
kexinzhao 90215b7844
Add float16 GEMM math function on GPU (#8695)
7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
7 years ago
Yi Wang fc374821dd Correct #include path
7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt
7 years ago