Commit Graph

41 Commits (686eaf20ba8c4769cb26be0183bf0485e2e69f8a)

Author SHA1 Message Date
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH () 6 years ago
Yu Yang 0d6718fcbd Pass compile 6 years ago
Yu Yang c774bcbd2d Merge device_context 7 years ago
sneaxiy faac8a76ce remove unnecessary codes 7 years ago
sneaxiy 7ff320f8cc merge develop 7 years ago
Yu Yang 90d9e5aee8
feat(platform): lazy initialization of devicecontext in pool () 7 years ago
Sylwester Fraczek 2098b42584 review fixes (Teamcity fails) 7 years ago
sneaxiy 5be6f762d0 remove_lock_in_some_ops 7 years ago
Brian Liu a53e8a8da6 Update MKLDNN integration framework to support Paddle multi-instances 7 years ago
chengduozh 82d2903b63 Fix fast ParallelExe bug 7 years ago
chengduo 2c9839c847
add cuda version display () 7 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization ()" 7 years ago
chengduo 1d91a49d2f
Some trivial optimization () 7 years ago
sneaxiy 612e1a3155 modification 7 years ago
sneaxiy d0b2453ecd merge develop 7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor 7 years ago
fengjiayi 82a1b35b9b Revert "Revert "Add CudnnHolder and use it in Conv and ConvTranspose op"" 7 years ago
guochaorong 151e169eb7
Revert "Add CudnnHolder and use it in Conv and ConvTranspose op" 7 years ago
fengjiayi 1f36a4c27c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CudnnHolder 7 years ago
fengjiayi b0aca8824d make CudnnHolder thread safe 7 years ago
luotao1 7169f9378c fix mkldnn include format 7 years ago
fengjiayi 15cc9128be fix compile error 7 years ago
fengjiayi 04bfd5c10c add CudnnHolder to manage cudnn_handle and workspace 7 years ago
yuyang18 2d0e5592b5
Use std::map for Place <--> DeviceContext 7 years ago
chengduo da556ed6d4
enhance ParallelExecutor stable () 7 years ago
chengduoZH c99fca5f90 Add No Mutex 7 years ago
yuyang18 a1254a86ba Add lock to record_event. 7 years ago
yuyang18 7cf8b656a2 Remove lock in device context 7 years ago
Yu Yang 6b20b35589 Fix Transformer Hang Problem 7 years ago
Yi Wang 8dbd9c394e
Fix part of the cpplint errors in fluid/platform () 7 years ago
chengduoZH ab601c19c3 Add CUDAPinnedPlace 7 years ago
chengduoZH 18eb77303d add CUDAPinnedPlace 7 years ago
Yu Yang 1d8fe2a220 Enhance device context pool () 7 years ago
Kexin Zhao c88f58dbd8 add comment 7 years ago
Kexin Zhao 3b44b849d3 address comments 7 years ago
chengduo 84aea8a8a1
Merge pull request from chengduoZH/feature/concat_op 7 years ago
pzelazko-intel 8c71adaa8c MKLDNN conv2d kernel added () 7 years ago
chengduoZH 131ec276ed fix bug for big number; float->double and code refine 7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. () 7 years ago
Yi Wang fc374821dd Correct #include path 7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt 7 years ago