Commit Graph

9328 Commits (53bd51e3f4e00d06006e49765d4af1ba952e99b0)
 

Author SHA1 Message Date
Tao Luo 1f00a70bb9
Merge pull request #5721 from tensor-tang/omp
7 years ago
chengduoZH c359e39b59 add double type kernel
7 years ago
Tao Luo 9b56074083
Merge pull request #5705 from tensor-tang/mkldnn_concat
7 years ago
ranqiu 2e1cd3313d Update dot_prod_layer
7 years ago
dangqingqing 94e86897fa update code and fix conflicts.
7 years ago
tensor-tang f5df46e1a4 rename all Mkldnn to MKLDNN
7 years ago
Tao Luo ba86885456
Merge pull request #5345 from luotao1/ProtoDataProvider
7 years ago
chengduoZH 0bc2f41da9 remove conflict
7 years ago
tensor-tang 40450401a6 change macro, can use omp when paddle use mklml
7 years ago
Tao Luo 3375e3e27b
Merge pull request #5714 from tensor-tang/refine
7 years ago
QI JUN a391a44dd0
remove v2 framework (#5722)
7 years ago
caoying03 dfc5d1f19a add the l2 distance layer.
7 years ago
guosheng aa83e19e24 Remove lstm_op including in gru_op
7 years ago
dangqingqing 082bc7af56 Use CUDA_ARCH_NAME=All in the paddle/scripts/docker/build.sh and remove 20 21(20) in cmake/cuda.cmake.
7 years ago
ranqiu aa2507187e add dot_prod_layer
7 years ago
tensor-tang 1578c20aaf add the missing macro PADDLE_USE_MKLDNN
7 years ago
Luo Tao e01b09410d remove test_CompareTwoOpts
7 years ago
Qiao Longfei 0ce38b77f2
correct optimizer import (#5699)
7 years ago
Luo Tao c1931468e4 Merge branch 'develop' into ProtoDataProvider
7 years ago
chengduo 06d155b539
Merge pull request #5718 from chengduoZH/Add_pad_and_stride_for_conv_transpose_op
7 years ago
tensor-tang cece368d6f Merge remote-tracking branch 'upstream/develop' into refine
7 years ago
tensor-tang 00a51557ee
Merge pull request #5717 from Yancey1989/fix_framework_proto_error
7 years ago
Tao Luo ba9f252945
Merge pull request #5715 from tensor-tang/check
7 years ago
chengduoZH eb07047699 add padding
7 years ago
Yancey1989 d13c3a98ce fix no framework proto file
7 years ago
tensor-tang 23a674c98a switch the flag
7 years ago
Yiqun Liu c808fbbfcb
Support the build for multiple architectures at one cmake command (iOS). (#5677)
7 years ago
tensor-tang defd7ec641 mkldnn only need one trainer
7 years ago
tensor-tang 363f690d79 expose only one WITH_MKL to user, covering WITH_MKLDNN and WITH_MKLML
7 years ago
tensor-tang 8496eab45a make mklml necessary when with_mkldnn
7 years ago
dangqingqing 3d080f3ad5 Refine cmake about CUDA to automatically detect GPU arch by default.
7 years ago
guosheng afd1f36186 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-H0-GRUOp-dev
7 years ago
hedaoyuan bf712b5cef
Merge pull request #5711 from hedaoyuan/inference
7 years ago
Tao Luo 6f22e6c491
Merge pull request #5704 from luotao1/openblas
7 years ago
Yan Chunwei 186581d2cc
add is empty op (#5639)
7 years ago
wanghaox 06dc89e17b Merge branch 'sub_sequence_op' of https://github.com/wanghaox/Paddle into sub_sequence_op
7 years ago
wanghaox a0b7a07eb6 fix some comments
7 years ago
hedaoyuan b34f21bdf7 The pool_limit_size need be zero in mobile inference.
7 years ago
qingqing01 9cf6036533
Merge pull request #5707 from qingqing01/cos_sim_op
7 years ago
peterzhang2029 55c2c73f3c
Merge pull request #5703 from qingqing01/op_debug
7 years ago
Yang Yang(Tony) 18f0c40a97 feature/while_grad_op (#5554)
7 years ago
wanghaox 294570f5dd Merge branch 'sub_sequence_op' of https://github.com/wanghaox/Paddle into sub_sequence_op
7 years ago
wanghaox 49a594217b fix some typos
7 years ago
dangqingqing 24819df055 Fix cos_sim_op in debug mode.
7 years ago
qingqing01 01fa4cc73d
Remove the unused code.
7 years ago
tensor-tang 88feb517d4 Merge remote-tracking branch 'upstream/develop' into mkldnn_concat
7 years ago
tensor-tang 739858c889 add python interface for mkldnn_concat
7 years ago
tensor-tang 19c989ac15 fix error and pass unit test
7 years ago
Luo Tao 49ac0480e6 fix a lot of warnings -Wunused-but-set-variable and Wunused-variable when compile openBlas
7 years ago
dangqingqing 0321e1f861 Fix bilinear_tensor_product_op in debug mode.
7 years ago