chengduo
|
00b9e9a135
|
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
* refine cublase
test=develop
* code refine
* refine cublas
* add GEMME_EX
* add enable_cublas_tensor_op_math doc and add cublasCall
test=develop
* fix CublasCall for cuda version
test=develop
* fix error
test=develop
* fix GEMM_EX to be compatible with gcc 4.8
test=develop
* add GEMM_EX
test=develop
* to compatiable with gcc4.8
test=develop
|
6 years ago |
minqiyang
|
0c3227a523
|
Change the origin VLOG level to 10 times
Fix code to support cpplint syntax check
test=develop
|
7 years ago |
chengduo
|
2c9839c847
|
add cuda version display (#13885)
test=develop
|
7 years ago |
Xin Pan
|
ab798a2832
|
clarify the fraction_of_gpu_memory flag
test=develop
|
7 years ago |
typhoonzero
|
a4f7696a18
|
Revert "Some trivial optimization (#13530)"
This reverts commit 1d91a49d2f .
|
7 years ago |
chengduo
|
1d91a49d2f
|
Some trivial optimization (#13530)
* some trivial opt
* remove the fix of lod_tensor and shrink_rnn_memory_op
* refine ShrinkRNNMemoryOp
test=develop
|
7 years ago |
chenweihang
|
da39d84a48
|
refine by reviewer's advice
|
7 years ago |
chenweihang
|
61052cdbc6
|
polish high frequency enforce error message
|
7 years ago |
fengjiayi
|
9f11da5931
|
Add synchronous TensorCopy and use it in double buffer
|
7 years ago |
Yi Wang
|
0c43a376e2
|
Fix cpplint errors with paddle/fluid/platform/gpu_info.* (#9710)
* Fix cpplint errors with paddle/fluid/platform/gpu_info.*
* Update
|
7 years ago |
Kexin Zhao
|
1998d5afa2
|
add gpu info func to get compute cap
|
7 years ago |
chengduoZH
|
00e596edbe
|
get max threads of GPU
|
7 years ago |
qingqing01
|
24509f4af9
|
Fix the grammar in copyright. (#8403)
|
7 years ago |
Yi Wang
|
fc374821dd
|
Correct #include path
|
7 years ago |
Yi Wang
|
90648f336d
|
Move file to fluid/; Edit CMakeLists.txt
|
7 years ago |