Commit Graph

107 Commits (e1a7a880576f072bf27cbda568bcc4a5bcfb25fd)

Author SHA1 Message Date
sneaxiy d231e55065 merge develop
7 years ago
qingqing01 abe209234f
Exhaustive search for cuDNN conv. (#14286)
7 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
7 years ago
Yu Yang c774bcbd2d Merge device_context
7 years ago
Yu Yang 057a682ee9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
dzhwinter 2835e04409 merge develop branch. test=develop
7 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
7 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
7 years ago
Zeng Jinle 8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
7 years ago
sneaxiy faac8a76ce remove unnecessary codes
7 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
7 years ago
sneaxiy 7ff320f8cc merge develop
7 years ago
dzhwinter 1ace55c8ee merge develop branch
7 years ago
Yu Yang 90d9e5aee8
feat(platform): lazy initialization of devicecontext in pool (#14067)
7 years ago
dzhwinter bf2e4cb188 cleard. staged
7 years ago
dzhwinter ebfe5a02b3 merge develop branch
7 years ago
Yu Yang c01696f8c2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
Sylwester Fraczek 2098b42584 review fixes (Teamcity fails)
7 years ago
sneaxiy 5be6f762d0 remove_lock_in_some_ops
7 years ago
Brian Liu a53e8a8da6 Update MKLDNN integration framework to support Paddle multi-instances
7 years ago
Yu Yang 461f71a90b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
chengduozh 82d2903b63 Fix fast ParallelExe bug
7 years ago
sneaxiy 2002e71da8 fix pinned allocator
7 years ago
chengduo 2c9839c847
add cuda version display (#13885)
7 years ago
Yu Yang 15076c325e Add comments and polish code style
7 years ago
Yu Yang 29f66c2408 Polish code
7 years ago
Yu Yang 8e3fdc6e65 Fix SetDevice on init
7 years ago
Yu Yang 58ed412f68 refactor(memory): rewrite memory allocation and make it extentable
7 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
7 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
7 years ago
sneaxiy d0b2453ecd merge develop
7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor
7 years ago
dzhwinter 85f8dd1c77 debug version
7 years ago
fengjiayi 7b577b92e0 fix a memory bug in CudnnHolder
7 years ago
fengjiayi 82a1b35b9b Revert "Revert "Add CudnnHolder and use it in Conv and ConvTranspose op""
7 years ago
guochaorong 151e169eb7
Revert "Add CudnnHolder and use it in Conv and ConvTranspose op"
7 years ago
fengjiayi 0236966b68 follow commits
7 years ago
fengjiayi 5398e1a3a6 fix bugs
7 years ago
fengjiayi f79ca23115 fix bugs
7 years ago
fengjiayi c501826f42 use framework::RWLock
7 years ago
fengjiayi b0aca8824d make CudnnHolder thread safe
7 years ago
fengjiayi 15cc9128be fix compile error
7 years ago
fengjiayi 407ff0bdbc use CudnnHolder in conv_cudnn_op
7 years ago
fengjiayi 04bfd5c10c add CudnnHolder to manage cudnn_handle and workspace
7 years ago
yuyang18 2d0e5592b5
Use std::map for Place <--> DeviceContext
7 years ago
yuyang18 7cf8b656a2 Remove lock in device context
7 years ago
Yu Yang 6b20b35589 Fix Transformer Hang Problem
7 years ago
Yi Wang 8dbd9c394e
Fix part of the cpplint errors in fluid/platform (#9802)
7 years ago
chengduoZH ab601c19c3 Add CUDAPinnedPlace
7 years ago
Yu Yang 1d8fe2a220 Enhance device context pool (#9293)
7 years ago
Kexin Zhao 3b44b849d3 address comments
7 years ago
chengduo 84aea8a8a1
Merge pull request #8669 from chengduoZH/feature/concat_op
7 years ago
pzelazko-intel 8c71adaa8c MKLDNN conv2d kernel added (#8451)
7 years ago
chengduoZH 131ec276ed fix bug for big number; float->double and code refine
7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
7 years ago
Yi Wang fc374821dd Correct #include path
7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt
7 years ago