Commit Graph

12 Commits (07dc5a1506b4c349b7771f7bec342c11ae0477b1)

Author SHA1 Message Date
Wu Yi 29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661)
6 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
6 years ago
chengduo 2c9839c847
add cuda version display (#13885)
6 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
6 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
6 years ago
fengjiayi 9f11da5931 Add synchronous TensorCopy and use it in double buffer
7 years ago
Yi Wang 535646cf25 Update (#9717)
7 years ago
Yi Wang 0c43a376e2
Fix cpplint errors with paddle/fluid/platform/gpu_info.* (#9710)
7 years ago
Kexin Zhao 1998d5afa2 add gpu info func to get compute cap
7 years ago
chengduoZH 00e596edbe get max threads of GPU
7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt
7 years ago