Commit Graph

24 Commits (83a2fb1f08714d12728292924ea0e07f72451987)

Author SHA1 Message Date
ronnywang e03e46730c
[ROCM] fix gather_op, sigmoid_cross_entropy_with_logits_op, test=develop (#31467)
4 years ago
Qi Li 4d647ec137
[ROCM] update fluid platform for rocm (part5), test=develop (#31315)
4 years ago
liu zhengxi ae2be49f40
Add cublas_handle() to expose cublas_handle to ops (#31157)
4 years ago
Zhong Hui 16fe11d71e
fix softmax cross entropy integer overflow (#30590)
4 years ago
Qi Li 93c1d9e761
[ROCM] update fluid platform for rocm39 (part3), test=develop (#30913)
4 years ago
Huihuang Zheng 1cbb282d77
Add Retry Logic to CublasHandlerHolder
5 years ago
Huihuang Zheng 4c4d4ba5e0
Modify CublasHandleHolder to Fix Random Unittest Failure. test=develop (#29617)
5 years ago
AshburnLee efea540ca9
Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Chen Weihang 35ff1568e9 Add error message for cublas inItizalize failed (#21995)
6 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
6 years ago
Tao Luo 75d1571995
refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603)
6 years ago
sneaxiy ed409ac9f4 Revert "Revert "Remove op handle lock""
7 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
7 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
7 years ago
dzhwinter eb6f9dd5de
Feature/cuda9 cudnn7 (#10140)
7 years ago
Yu Yang 40e3fe173c Make cuda_helper.h Pass cpplint
7 years ago
chengduo b1224da8d9 Move reduceSum to elementwise_op_function.h (#9773)
7 years ago
chengduoZH 90dc33b5ff Add todo for reduceSum
7 years ago
chengduoZH b8938b448c refine Sum
7 years ago
chengduoZH a82883922e follow comments
7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt
7 years ago