Commit Graph

122 Commits (3356fb3c6eab41cdfcd745a09f182547822bb3b2)

Author SHA1 Message Date
chengduoZH 0cc635497c merge develop
7 years ago
chengduoZH b8f7fa97b6 replace __shfl with __shfl_sync
7 years ago
chengduoZH 90d73c79c3 fix shfl_sync for CUDA8.0
7 years ago
dzhwinter eb6f9dd5de
Feature/cuda9 cudnn7 (#10140)
7 years ago
xzl 4747b2c92d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_upsample_layer
7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
7 years ago
dzhwinter b9b75377a2
Feature/hooks (#7513)
7 years ago
xzl 94f65c3415 handle confilct
8 years ago
guosheng fe6af6b6ac Enhance the AvgPooling to support optional exclude-mode
8 years ago
xzl 7dc584f5c4 add upsample layer
8 years ago
xzl 0b9c4cd7e5 fix comments
8 years ago
xzl ba2e5de662 pull develop and fix conflict
8 years ago
xzl a54565ea01 delete mask pool interface from poolprojection
8 years ago
武毅 cdd1da3424
Hack auc for dense vector labels (#5274)
8 years ago
xzl afa690243e add cuda and cpu pool_forward_with_mask impl
8 years ago
Luo Tao cc7a7557a2 set CudnnPooling don't include padded values in default
8 years ago
Luo Tao 73da97207e fix AvgPoolBackward and using CaffeMode in test_AvgPoolFwdBwd
8 years ago
Luo Tao 8067a42bef refine avg-pooling, which is exclusive. refine related code.
8 years ago
chengduoZH 6374831966 fix conflict
8 years ago
chengduoZH d747c5d511 fix layers_test.py
8 years ago
chengduoZH 43da7675b3 fix conflict
8 years ago
chengduoZH 4dcc533206 fix conflict
8 years ago
chengduoZH 860bf192c9 Add maxPoolIdx
8 years ago
wanghaoshuang e3342ff8e7 Fix android build error.
8 years ago
wanghaoshuang d8cd67dd1e Make cudnn convolution layer and projection support for dilation.
8 years ago
chengduoZH 2377d71947 Add3DPooling
8 years ago
chengduoZH d5768ebc89 fix above comments
8 years ago
chengduoZH 0a7516d193 fix col2vol vol2col kernel
8 years ago
chengduoZH cfb86c4e23 Add vol2col and col2vol cuda kernel
8 years ago
dangqingqing 7da1db053b update cuda kernel.
8 years ago
dangqingqing da7b9a5eb3 Remove the warning in hl_batch_norm_forward_inference function.
8 years ago
dangqingqing 73192bb12a add a batch norm inference kernel.
8 years ago
liaogang 1d4fa2433c ClangFormat for proto and cuda
8 years ago
Yibing Liu abcb1e1023 add the check of cudnn version in cudnnBatchNorm
8 years ago
dangqingqing d6ca03eb21 Fix out of memory.
8 years ago
dangqingqing cc5f0951ec Fix bug for WarpCTCLayer.
8 years ago
hedaoyuan 1ab2e44243 Merge pull request #2449 from hedaoyuan/ImageExpandFunction
8 years ago
hedaoyuan 41e1484eb2 Fix hl_sequence_avg_forward.
8 years ago
hedaoyuan 09d712d6ae Remove useless code(Matrix::convExpand and Matrix::convShrink).
8 years ago
liaogang 6237f6f57a revert clang-format
8 years ago
liaogang f27fd9dc28 follow comments
8 years ago
liaogang 665cc0e7b2 remove redundant mutex
8 years ago
liaogang 5b8fe87faf dlopen lapacke api and remove gfotran
8 years ago
Luo Tao 53da530d90 package avg_gpu_backward
8 years ago
Peng Li 36dbfe86a7 Change ctcStatus_t to hl_warpctc_status_t to keep consistency
8 years ago
Peng Li 448e60b5fe Fix macro bug in hl_warpctc_warp.cc to support double precision
8 years ago
Liang Zhao 8fded24c75 implement top k classification error in class matrix
8 years ago
xutianbing a948eea3ed clean unused code.
8 years ago
Haonan 2558c3f15a revisions according to reviews
8 years ago
liaogang 80c1679284 Fix cudart bugs before initMain
8 years ago