Commit Graph

14 Commits (c308c88d71e183e3ae79075c94f4ee5f72982fa8)

Author SHA1 Message Date
Huihuang Zheng ea6ee76fa9
GPU allocation uses fraction of available memory (#18896)
6 years ago
zhhsplendid 22715487dc add allocator flags
6 years ago
Wu Yi 29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661)
6 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
6 years ago
chengduo 2c9839c847
add cuda version display (#13885)
6 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
6 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
6 years ago
fengjiayi 9f11da5931 Add synchronous TensorCopy and use it in double buffer
7 years ago
Yi Wang 535646cf25 Update (#9717)
7 years ago
Yi Wang 0c43a376e2
Fix cpplint errors with paddle/fluid/platform/gpu_info.* (#9710)
7 years ago
Kexin Zhao 1998d5afa2 add gpu info func to get compute cap
7 years ago
chengduoZH 00e596edbe get max threads of GPU
7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt
7 years ago