Commit Graph

210 Commits (936f0546e3aaa772514bba721167e897769ec2a9)

Author SHA1 Message Date
qingqing01 5ba231d80b
Merge pull request #6374 from reyoung/feature/remove_device_context_finish
7 years ago
Yang Yu 6b9567e0ac Remove DeviceContext::Finish
7 years ago
Yu Yang f291abfc53
Add HasCUDNN to detect if CUDNN is installed or not (#6349)
7 years ago
QI JUN 96a5f96cc1
fix bug in gpu default memory allocating policy (#6268)
7 years ago
QI JUN d066b07f14 change GPU memory allocating policy (#6159)
7 years ago
chengduo e50f35706a code refine (#6164)
7 years ago
Yu Yang 8ac02279f2
Fix the proformance problem of enforce (#6085)
7 years ago
武毅 4ecbab42d8
Fix compile on cudnn7 (#5982)
7 years ago
dangqingqing 696b0253e5 Refine paddle/v2/fluid/profiler.py.
7 years ago
dangqingqing 623f62a7dc Add cuda profiler tools and expose it in Python.
7 years ago
dangqingqing 322d69f209 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into nvprof
7 years ago
dangqingqing 6cf2dcbc1f Add cuda profiler tools.
7 years ago
武毅 a06bec1287
Conv cudnn 3d (#5783)
7 years ago
Qiao Longfei c9172c1cb3
Make enforce target (#5889)
7 years ago
Yu Yang c077a6d57c
Feature/support int64 for sum (#5832)
7 years ago
chengduoZH dec61ab6df Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_pool3d
7 years ago
chengduoZH 0bc2f41da9 remove conflict
7 years ago
chengduoZH 7e91da41e7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_pool3d
7 years ago
wanghaox 0968c7cd6b Update code and fix conflicts.
7 years ago
dzhwinter e97b89873a
"fix accuracy kernel bug" (#5673)
7 years ago
chengduoZH 74912c7d4e fix data layout
7 years ago
dangqingqing 884ce5d5a2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cmake_speed
7 years ago
chengduoZH ec1e2fc938 add cudnn_pool3d unit test
7 years ago
chengduoZH a93a59ec7d add cudnn 3d unit test
7 years ago
Yang Yu 174050277a Fix GPU Compile on Linux
7 years ago
dangqingqing 524ccba4fe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cmake_speed
7 years ago
dangqingqing f5e367655e Use G++ to compile some cu operators.
7 years ago
emailweixu 2378679a9e Fix a dead lock bug for dyload/nccl.h when nccl lib cannot be loaded (#5533)
7 years ago
Yang Yu 3187451ae7
CompareOp's kernel device type is decided by input tensor place
7 years ago
qingqing01 58db07b7bb Check errors for the cuda kernel calls. (#5436)
7 years ago
QI JUN afd1e844fd
remove unused code (#5219)
7 years ago
Dong Zhihong 16a39d24f3 fix conflict
7 years ago
Qiao Longfei 56b723c40d Cudnn batch norm op (#5067)
7 years ago
Dong Zhihong 0990c87bf6 checkin nccl operator
7 years ago
Yu Yang 94e741d6f0 Use external project for NCCL (#5028)
7 years ago
Yu Yang 43c6ff212e Feature/nccl dso (#5001)
7 years ago
Markus Kliegl 164898277c MatMul operator (#4856)
7 years ago
武毅 a3ccbdb3b6 Cudnn conv op (#4195)
7 years ago
Yang Yang(Tony) c3bf332666 Merge pull request #4537 from QiJune/executor_impl
7 years ago
Luo Tao 871a3f6e76 remove unused PADDLE_ONLY_CPU comment
7 years ago
Yang Yang e51557130e clean up for review
7 years ago
qijun 1f5192a27b fix executor gpu unittest
7 years ago
qijun 39f75a13a4 Merge remote-tracking branch 'baidu/develop' into executor_impl
7 years ago
Yi Wang 880b874b47 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into paddle_only_cpu
7 years ago
Yi Wang 2b204f048b Rename platform::GetDeviceCount into platform::GetCUDADeviceCount
7 years ago
qijun e02cc571cf Merge remote-tracking branch 'baidu/develop' into executor_impl
7 years ago
qijun fe10e86dd5 fix gpu build error
7 years ago
Yi Wang 4558807c48 Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU
7 years ago
Yu Yang 84500f9487 Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
7 years ago
qijun cb198fa7b6 merge baidu/develop
7 years ago