Commit Graph

189 Commits (1578c20aaf474ecbb3c3d082be9964a9fce26fa6)

Author SHA1 Message Date
wanghaox 0968c7cd6b Update code and fix conflicts.
7 years ago
dzhwinter e97b89873a
"fix accuracy kernel bug" (#5673)
7 years ago
dangqingqing 884ce5d5a2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cmake_speed
7 years ago
Yang Yu 174050277a Fix GPU Compile on Linux
7 years ago
dangqingqing 524ccba4fe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cmake_speed
7 years ago
dangqingqing f5e367655e Use G++ to compile some cu operators.
7 years ago
emailweixu 2378679a9e Fix a dead lock bug for dyload/nccl.h when nccl lib cannot be loaded (#5533)
7 years ago
Yang Yu 3187451ae7
CompareOp's kernel device type is decided by input tensor place
7 years ago
qingqing01 58db07b7bb Check errors for the cuda kernel calls. (#5436)
7 years ago
QI JUN afd1e844fd
remove unused code (#5219)
7 years ago
Dong Zhihong 16a39d24f3 fix conflict
7 years ago
Qiao Longfei 56b723c40d Cudnn batch norm op (#5067)
7 years ago
Dong Zhihong 0990c87bf6 checkin nccl operator
7 years ago
Yu Yang 94e741d6f0 Use external project for NCCL (#5028)
7 years ago
Yu Yang 43c6ff212e Feature/nccl dso (#5001)
7 years ago
Markus Kliegl 164898277c MatMul operator (#4856)
7 years ago
武毅 a3ccbdb3b6 Cudnn conv op (#4195)
7 years ago
Yang Yang(Tony) c3bf332666 Merge pull request #4537 from QiJune/executor_impl
7 years ago
Luo Tao 871a3f6e76 remove unused PADDLE_ONLY_CPU comment
7 years ago
Yang Yang e51557130e clean up for review
7 years ago
qijun 1f5192a27b fix executor gpu unittest
7 years ago
qijun 39f75a13a4 Merge remote-tracking branch 'baidu/develop' into executor_impl
7 years ago
Yi Wang 880b874b47 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into paddle_only_cpu
7 years ago
Yi Wang 2b204f048b Rename platform::GetDeviceCount into platform::GetCUDADeviceCount
7 years ago
qijun e02cc571cf Merge remote-tracking branch 'baidu/develop' into executor_impl
7 years ago
qijun fe10e86dd5 fix gpu build error
7 years ago
Yi Wang 4558807c48 Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU
7 years ago
Yu Yang 84500f9487 Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
7 years ago
qijun cb198fa7b6 merge baidu/develop
7 years ago
qijun 395051512d remove device context manager
7 years ago
qijun 6c4d1f551d refine codes
7 years ago
qijun 023ed5eb39 merge baidu/develop
7 years ago
qijun b5dbe88b5a follow comments
7 years ago
dzhwinter 8acc010691 Merge branch 'develop' into macro
7 years ago
dongzhihong 5423cb3e57 format
7 years ago
Yu Yang 8fd845e0fa Unify Map in OpDescBind
7 years ago
chengduoZH df59889984 remove conflict
7 years ago
qijun b611a479fc fix gpu build error
7 years ago
qijun 7a6fcc7d30 move EigenDeviceConverter to device_context.h
7 years ago
Yu Yang f2feb33384 Follow comments
7 years ago
Yu Yang 3a5693e0a8 Add Skeleton of Double support
7 years ago
chengduoZH 3c0f079333 remove conflict and fix InferShape function
7 years ago
Yu Yang bc30ba19ed Merge pull request #4375 from reyoung/feature/use_bool_for_enforce
7 years ago
chengduoZH 30a586df0c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into Add_pool_op
7 years ago
Qiao Longfei d0ad82cff1 fix nv_library (#4370)
7 years ago
Yu Yang 699dbe3be9 Use `bool` for PADDLE_ENFORCE, not int
7 years ago
Yu Yang ba1f5b5c58 Sync computation when Python invoke `run`
7 years ago
chengduoZH 0417e4e4bf fix framework::LoDTensor => Tensor
7 years ago
dangqingqing 41a2321a0e Refine platform::Transform function and fix prelu_op testing.
8 years ago
Yu Yang 87e4e25db1 Change Transform API
8 years ago