Commit Graph

5591 Commits (a02a68dc6daecbd3d5b17b57db03e0b3f916646e)

Author SHA1 Message Date
dzhwinter 7389ea98ea "add NCCL multi-GPU design doc"
7 years ago
dzhwinter 988a4a66bd Merge remote-tracking branch 'origin/develop' into feature/nccl_doc
7 years ago
hedaoyuan ef45af113a
Merge pull request #6408 from hedaoyuan/fix_bug
7 years ago
qingqing01 5ba231d80b
Merge pull request #6374 from reyoung/feature/remove_device_context_finish
7 years ago
Luo Tao 1d301731ac refine the gen_docs in build.sh
7 years ago
hedaoyuan e1247d8015 Fix compile error in android.
7 years ago
Yiqun Liu 00b64f6679
Add a c-api interface to initialize the thread environment of Paddle (#5773)
7 years ago
Qiao Longfei 36fcc95cab
Nmt decoder train (#6367)
7 years ago
Luo Tao 3b1529d2e5 add print_operators_doc in travis ci
7 years ago
Abhinav Arora 113c026d12
Swish activation operator (#6358)
7 years ago
QI JUN 3a0a4586a3
refine GPU memory allocation policy (#6373)
7 years ago
tensor-tang c096130a94
Merge pull request #6291 from tensor-tang/check_env
7 years ago
tensor-tang 32cc0db151 check if cmake has been installed
7 years ago
Tao Luo 8193cc55c4
Merge pull request #6371 from Yancey1989/install_dmidecode
7 years ago
tensor-tang f62dbebe20 follow comments and add limited version of dmidecode
7 years ago
Yang Yu 6b9567e0ac Remove DeviceContext::Finish
7 years ago
Yu Yang f291abfc53
Add HasCUDNN to detect if CUDNN is installed or not (#6349)
7 years ago
Tao Luo a34fc8b36b
Merge pull request #6213 from tensor-tang/mkldnn_lrn
7 years ago
Yancey1989 8dacb4050b install dmidecode in product docker image
7 years ago
whs e09e21beee
Merge pull request #6188 from wanghaoshuang/conv_fix
7 years ago
Abhinav Arora e557611f39
Fix equations in sequence_pool op (#6355)
7 years ago
Tao Luo de80c5696a
Merge pull request #6100 from guoshengCS/enhance-include-pool
7 years ago
Abhinav Arora 0d4f050955
Fix equation in logical or op (#6315)
7 years ago
QI JUN 7910d96ab8
build paddle_python before build paddle_docs (#6337)
7 years ago
gongweibao c7e739f542
Add LRN efficient GPU implement. (#5894)
7 years ago
kexinzhao 1d1555e2e5
Merge pull request #5716 from kexinzhao/float16
7 years ago
whs c22cf594f6
Merge pull request #6333 from wanghaoshuang/fix_type
7 years ago
qingqing01 5a1a04f648
Merge pull request #6326 from jacquesqiao/fix-int-overflow
7 years ago
chengduo b30e8bc00c
Merge pull request #6324 from chengduoZH/profiling/refine_elementwise_add
7 years ago
wanghaoshuang 6173f91cce uncomment code
7 years ago
wanghaoshuang 83537c7ada Fix warning about comparison of integers of different signs
7 years ago
chengduoZH 8711a9a22a refine code
7 years ago
qiaolongfei d303f7ae4f fix int overflow
7 years ago
Yu Yang 229c2e7883
Feature/while op sentiment analysis (#6282)
7 years ago
kexinzhao 94a36b8cc7
fix clip op doc operation (#6314)
7 years ago
kexinzhao dbf205002d
fix read and write tensor array op (#6312)
7 years ago
kexinzhao a5167ce0b4
fix lod_array_lengh op equation (#6307)
7 years ago
kexinzhao 002a7b4d01
fix scatter op equation (#6304)
7 years ago
kexinzhao 16822fb702
fix latex equation for clip by norm op (#6302)
7 years ago
kexinzhao 2c1270e40c
fix maxout op latex equation (#6303)
7 years ago
Abhinav Arora 1d04b19ce8
Fix the rendering of latex equation for adamax op (#6294)
7 years ago
Qiao Longfei 161128bada
add chunk eval layer (#6296)
7 years ago
tensor-tang 62c00e00eb add dimms locator info
7 years ago
qingqing01 1a8f20c6d1
Merge pull request #6284 from qingqing01/cuda_sync
7 years ago
tensor-tang e670453518 add script to check the cpu env
7 years ago
Qiao Longfei 45c8a88a3e
add crf_decoding layer (#6274)
7 years ago
qingqing01 e760641a3a
Merge pull request #6233 from qingqing01/momentum_op
7 years ago
dangqingqing 4e451a34db Remove the cuda stream synchronization between each operator.
7 years ago
chengduo 36444461dc
Merge pull request #6229 from chengduoZH/profiling/updata_elementwise_op
7 years ago
dangqingqing 62acf7990f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into momentum_op
7 years ago