Commit Graph

38 Commits (3e1404d2087e4ea52e61ba4273638a4fa00e2928)

Author SHA1 Message Date
zhaoyuchen2018 b93870e696
Improve topk performance. () 5 years ago
Zeng Jinle a710ccc0cb
refine error message of allocator again, test=develop () 6 years ago
wangchaochaohu 7695b713e1
gpu info query refine test=develop () 6 years ago
Wilber 751812a674
enable cpu machine to run paddle in gpu lib 6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop () 6 years ago
Zeng Jinle 08fa98f7cc
Fix gpu_info PADDLE_ENFORCE_GT when fraction_of_gpu_memory_to_use=1.0 () 6 years ago
Huihuang Zheng ea6ee76fa9
GPU allocation uses fraction of available memory () 6 years ago
zhouwei25 772e09560e Optimize the content of error reporting information, print error code and official document web sites () 6 years ago
liuwei1031 759530966c
print out error code of cudaGetDeviceProperties if failed () 6 years ago
Huihuang Zheng e4a5332416
Fix a typo in gpu_info.cc () 6 years ago
zhhsplendid 124f1df481 Add flags for init and re-alloc gpu 6 years ago
zhhsplendid 22715487dc add allocator flags 6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op () 6 years ago
sneaxiy 9c360cc798 test=develop 6 years ago
sneaxiy 51227bd447 lazy_allocator 6 years ago
Wu Yi 29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% () 6 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog 6 years ago
peizhilin 38715e6fd0 minor fix 6 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog 6 years ago
minqiyang 53433d7f2e Revert the changes of VLOG 6 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build 6 years ago
peizhilin b2f8d4183d Given the different fraction_of_gpu_memory_to_use depends on platform 6 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH () 6 years ago
peizhilin 7c8c9dc9bf fix unit test cases 6 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times 7 years ago
chengduo 2c9839c847
add cuda version display () 7 years ago
Xin Pan ab798a2832 clarify the fraction_of_gpu_memory flag 7 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization ()" 7 years ago
chengduo 1d91a49d2f
Some trivial optimization () 7 years ago
chenweihang da39d84a48 refine by reviewer's advice 7 years ago
chenweihang 61052cdbc6 polish high frequency enforce error message 7 years ago
fengjiayi 9f11da5931 Add synchronous TensorCopy and use it in double buffer 7 years ago
Yi Wang 0c43a376e2
Fix cpplint errors with paddle/fluid/platform/gpu_info.* () 7 years ago
Kexin Zhao 1998d5afa2 add gpu info func to get compute cap 7 years ago
chengduoZH 00e596edbe get max threads of GPU 7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. () 7 years ago
Yi Wang fc374821dd Correct #include path 7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt 7 years ago