Commit Graph

463 Commits (81520a24cf0edb065231ddeecea803a8f0149eeb)

Author SHA1 Message Date
Yu Yang 81520a24cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
7 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
7 years ago
Yu Yang 8175983ef9
Merge pull request #14814 from reyoung/feature/gprof
7 years ago
Yu Yang 5e60906996 Fix compile error
7 years ago
Yu Yang 7604b1ad51 Fix Eigen macro when using GPU
7 years ago
Yu Yang b22d638d8f Speed up SizeOfType
7 years ago
sneaxiy 66182abda6 add cuda cudnn version check
7 years ago
Zeng Jinle add98c9e7d
Merge pull request #14745 from sneaxiy/fix_eigen_deallocate
7 years ago
Tao Luo 54fcafb5f6
Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt
7 years ago
sneaxiy 0f96c2e80f fix thread-safety bug
7 years ago
Yihua Xu 65dbc7cca4
Merge branch 'develop' into develop_4f71a6ee2_conv3d_mkldnn_opt
7 years ago
tensor-tang 4a93db9288 remove jit namespace
7 years ago
sneaxiy 900765224c fix deallocate bug
7 years ago
liuhongyu 773dc73fbf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu 8daf67f90f fix bugs; test=develop
7 years ago
Xin Pan 052cc5f538
Merge pull request #14725 from ZongwuYang/my-cool-stuff
7 years ago
Wu Yi 29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661)
7 years ago
liuhongyu 968dd3c078 add cudnn 5 support; test=develop
7 years ago
ZongwuYang 1560eb4a6d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my-cool-stuff
7 years ago
ZongwuYang deb04809bd test=develop
7 years ago
Yihua Xu 669191c9cc Implement conv3d with mkldnn library (test=develop)
7 years ago
Hongyu Liu 4f71a6ee2c
Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm
7 years ago
Yibing Liu c7382df80f
Print assert failure id in lookup_table_op (#14698)
7 years ago
phlrain cf1fe61004 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
Tao Luo 20120d9c97
Merge pull request #14608 from jczaja/prv-conv2d-transpose-mkldnn
7 years ago
Tao Luo ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
7 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
Tao Luo 4ec9de0122
Merge pull request #14628 from Sand3r-/mgallus/mkldnn-elementwise_mul
7 years ago
Clementine 6c71c1f8f9 Add activation gelu (#14569)
7 years ago
Michal Gallus 9455be0ba5 EltwiseMul: Extract StringToFormat to MKLDNN helper
7 years ago
Jacek Czaja 8bfa1fa9bb - ASUM MKL integration
7 years ago
liuhongyu 05917c3c79 add cudnn lstm; test=develop
7 years ago
peizhilin 38715e6fd0 minor fix
7 years ago
Jacek Czaja fb24690a58 - conv2d transpose MKL-DNN
7 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
7 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin b2f8d4183d Given the different fraction_of_gpu_memory_to_use depends on platform
7 years ago
Yu Yang 26af9cf90c
Merge pull request #14565 from chengduoZH/fix_cublas_warp_error
7 years ago
chengduozh f7847ca6a3 fix cublas warp error
7 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
7 years ago
peizhilin 445fff24dc add the bigobj option to NVCC compile
7 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
7 years ago
peizhilin 7c8c9dc9bf fix unit test cases
7 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
7 years ago
peizhilin 6e66fadb95 clean up the pre-definitions on windows
7 years ago
peizhilin 67562a6fcd Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 703b26e697 add profiler, parallel_executor back
7 years ago
chengduo a8d3aaae2a
print output log warning (#14497)
7 years ago
peizhilin 3a72a634cf Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago