Commit Graph

496 Commits (02e17396c24f0deb11826e37a579a69dc41ca382)

Author SHA1 Message Date
sneaxiy 64ad051b9a merge develop
7 years ago
sneaxiy c47c451a00 fix bug
7 years ago
Yihua Xu 669191c9cc Implement conv3d with mkldnn library (test=develop)
7 years ago
Hongyu Liu 4f71a6ee2c
Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm
7 years ago
Yibing Liu c7382df80f
Print assert failure id in lookup_table_op (#14698)
7 years ago
sneaxiy 096673f675 refactor eager deletion
7 years ago
phlrain cf1fe61004 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
Tao Luo 20120d9c97
Merge pull request #14608 from jczaja/prv-conv2d-transpose-mkldnn
7 years ago
Tao Luo ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
7 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
Tao Luo 4ec9de0122
Merge pull request #14628 from Sand3r-/mgallus/mkldnn-elementwise_mul
7 years ago
Clementine 6c71c1f8f9 Add activation gelu (#14569)
7 years ago
Michal Gallus 9455be0ba5 EltwiseMul: Extract StringToFormat to MKLDNN helper
7 years ago
Jacek Czaja 8bfa1fa9bb - ASUM MKL integration
7 years ago
liuhongyu 05917c3c79 add cudnn lstm; test=develop
7 years ago
peizhilin 38715e6fd0 minor fix
7 years ago
Jacek Czaja fb24690a58 - conv2d transpose MKL-DNN
7 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
7 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin b2f8d4183d Given the different fraction_of_gpu_memory_to_use depends on platform
7 years ago
Yu Yang 26af9cf90c
Merge pull request #14565 from chengduoZH/fix_cublas_warp_error
7 years ago
chengduozh f7847ca6a3 fix cublas warp error
7 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
7 years ago
peizhilin 445fff24dc add the bigobj option to NVCC compile
7 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
7 years ago
peizhilin 7c8c9dc9bf fix unit test cases
7 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
7 years ago
peizhilin 6e66fadb95 clean up the pre-definitions on windows
7 years ago
peizhilin 67562a6fcd Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 703b26e697 add profiler, parallel_executor back
7 years ago
chengduo a8d3aaae2a
print output log warning (#14497)
7 years ago
peizhilin 3a72a634cf Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin ee0fd78c81 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yu Yang f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
7 years ago
qingqing01 fd7e643153
Convolution fusion operator. (#14449)
7 years ago
Yu Yang 98bbfc17be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
peizhilin c59d3e83bc test case fix
7 years ago
peizhilin 8580b7a130 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Wu Yi b32c13dc20
Add cudnn ctc loss (#12366)
7 years ago
peizhilin d1a1fafc4c code style
7 years ago
peizhilin 162f2d4109 disable the openblas multi-thread on windows since no support
7 years ago
Yu Yang c8f6e70ab4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
peizhilin d1429ac4a5 add recordio support
7 years ago
Yu Yang 0d6718fcbd Pass compile
7 years ago
peizhilin be332a13bc Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yu Yang d93b2d0365 Refine code
7 years ago
peizhilin 1a9008c420 code style fix
7 years ago
tensor-tang 1be85d011d add mkl vsqr and vpow
7 years ago
peizhilin 13bfee1f85 Merge branch 'windows/build' into windows/online
7 years ago
peizhilin 7840d181c9 fix style issue
7 years ago
peizhilin dc339b78d7 fix code style
7 years ago
sneaxiy d231e55065 merge develop
7 years ago
peizhilin 9b558a8035 Merge branch 'windows/build' into windows/online
7 years ago
peizhilin 7638f0afb3 simplify the logic
7 years ago
peizhilin 6f9c70acb7 Merge branch 'windows/build' into windows/online
7 years ago
peizhilin ca60e1d34d Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 4bd0c4c5ee test=develop
7 years ago
peizhilin 4b1f1a8787 fix merge issue
7 years ago
Yu Yang 6ae0b91b39 Clean LockGuardPtr
7 years ago
peizhilin 52f7644f53 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Qiyang Min 698698f2fa
Merge branch 'develop' into fix_vlog
7 years ago
qingqing01 abe209234f
Exhaustive search for cuDNN conv. (#14286)
7 years ago
Yu Yang b59a9bfb7c Clean buffered_allocator
7 years ago
Yu Yang fdc689142c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
minqiyang 87450b9ad4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
7 years ago
peizhilin 4ffa92d4f0 Merge branch 'develop' into windows/build
7 years ago
peizhilin 45125ba538 fix share library issue
7 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
7 years ago
minqiyang fcc0452c8b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
7 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times
7 years ago
peizhilin 869487a2b7 Merge remote-tracking branch 'origin/develop' into windows/build
7 years ago
dzhwinter 234a1d9248 Merge remote-tracking branch 'origin/develop' into windows/debug
7 years ago
Yu Yang c774bcbd2d Merge device_context
7 years ago
Yu Yang 057a682ee9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
tensor-tang c9730d33d9 fix run error on mac
7 years ago
Qiao Longfei 2921f8a79c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
7 years ago
dzhwinter 2835e04409 merge develop branch. test=develop
7 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
7 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
7 years ago
Zeng Jinle 8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
7 years ago
sneaxiy 8684553633 stream callback support in cuda 10
7 years ago
sneaxiy faac8a76ce remove unnecessary codes
7 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
7 years ago
Qiao Longfei 59fbfbfbf7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
7 years ago
Qiao Longfei 9e4e9e9b6e clean rpc server profiler
7 years ago
tensor-tang c3cbf0b8ef
Merge pull request #14185 from tpatejko/tpatejko/mkldnn-conv-residual-data-reorder
7 years ago
peizhilin 71d7980f69 fix build issue 1
7 years ago
tensor-tang 6b49ee42c3
Merge pull request #14239 from tensor-tang/fix/avx
7 years ago
tensor-tang e09a7c793d remove the warning log since do not have avx2, avx512 flags
7 years ago
tensor-tang f524c1b62b throw error when mismatch cpu version
7 years ago
peizhilin 9d67c1fb69 cpu build support
7 years ago
dzhwinter 60f70b174d test=develop
7 years ago
sneaxiy 7ff320f8cc merge develop
7 years ago
Kaipeng Deng daed473d4a
Merge pull request #14089 from heavengate/pool_exclude
7 years ago
whs 0c319e0b35
Add affine grid generator op (#12238)
7 years ago
dzhwinter 0a180584e6 clean cmake. test=develop
7 years ago
dzhwinter 1ace55c8ee merge develop branch
7 years ago
Tomasz Patejko 8899d42265 MKLDNN conv residual data: primitive reuse interface used. Reorder done when formats are different
7 years ago
Yu Yang 90d9e5aee8
feat(platform): lazy initialization of devicecontext in pool (#14067)
7 years ago