Commit Graph

53 Commits (9df7bc2c5ac76bb5a0641fd0b87bb2f5f89940cb)

Author SHA1 Message Date
chengduo 46d01d798e
Revert "Revert "Remove workspace_handle in conv_cudnn (#15186)"" (#15290)
6 years ago
chengduozh 358e657f68 Revert "Remove workspace_handle in conv_cudnn (#15186)"
6 years ago
chengduo 064512aa47
Remove workspace_handle in conv_cudnn (#15186)
6 years ago
sneaxiy ed409ac9f4 Revert "Revert "Remove op handle lock""
6 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
6 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
6 years ago
sneaxiy d25395fc98 remove tensor core lock
6 years ago
chengduo b9fb03cf54
Move GetTensor to tensor_util (#15011)
6 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
6 years ago
sneaxiy ca84c2ca8f merge develop
6 years ago
Yu Yang 7604b1ad51 Fix Eigen macro when using GPU
6 years ago
sneaxiy c47c451a00 fix bug
6 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
6 years ago
Yu Yang 0d6718fcbd Pass compile
6 years ago
Yu Yang c774bcbd2d Merge device_context
7 years ago
sneaxiy faac8a76ce remove unnecessary codes
7 years ago
sneaxiy 7ff320f8cc merge develop
7 years ago
Yu Yang 90d9e5aee8
feat(platform): lazy initialization of devicecontext in pool (#14067)
7 years ago
Sylwester Fraczek 2098b42584 review fixes (Teamcity fails)
7 years ago
sneaxiy 5be6f762d0 remove_lock_in_some_ops
7 years ago
Brian Liu a53e8a8da6 Update MKLDNN integration framework to support Paddle multi-instances
7 years ago
chengduozh 82d2903b63 Fix fast ParallelExe bug
7 years ago
chengduo 2c9839c847
add cuda version display (#13885)
7 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
7 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
7 years ago
sneaxiy 612e1a3155 modification
7 years ago
sneaxiy d0b2453ecd merge develop
7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor
7 years ago
fengjiayi 82a1b35b9b Revert "Revert "Add CudnnHolder and use it in Conv and ConvTranspose op""
7 years ago
guochaorong 151e169eb7
Revert "Add CudnnHolder and use it in Conv and ConvTranspose op"
7 years ago
fengjiayi 1f36a4c27c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CudnnHolder
7 years ago
fengjiayi b0aca8824d make CudnnHolder thread safe
7 years ago
luotao1 7169f9378c fix mkldnn include format
7 years ago
fengjiayi 15cc9128be fix compile error
7 years ago
fengjiayi 04bfd5c10c add CudnnHolder to manage cudnn_handle and workspace
7 years ago
yuyang18 2d0e5592b5
Use std::map for Place <--> DeviceContext
7 years ago
chengduo da556ed6d4
enhance ParallelExecutor stable (#11637)
7 years ago
chengduoZH c99fca5f90 Add No Mutex
7 years ago
yuyang18 a1254a86ba Add lock to record_event.
7 years ago
yuyang18 7cf8b656a2 Remove lock in device context
7 years ago
Yu Yang 6b20b35589 Fix Transformer Hang Problem
7 years ago
Yi Wang 8dbd9c394e
Fix part of the cpplint errors in fluid/platform (#9802)
7 years ago
chengduoZH ab601c19c3 Add CUDAPinnedPlace
7 years ago
chengduoZH 18eb77303d add CUDAPinnedPlace
7 years ago
Yu Yang 1d8fe2a220 Enhance device context pool (#9293)
7 years ago
Kexin Zhao c88f58dbd8 add comment
7 years ago
Kexin Zhao 3b44b849d3 address comments
7 years ago
chengduo 84aea8a8a1
Merge pull request #8669 from chengduoZH/feature/concat_op
7 years ago
pzelazko-intel 8c71adaa8c MKLDNN conv2d kernel added (#8451)
7 years ago
chengduoZH 131ec276ed fix bug for big number; float->double and code refine
7 years ago