Commit Graph

2669 Commits (af2f5fc8249bd449a3d90df6e5cce1a63f9c244c)

Author SHA1 Message Date
Michal Gallus 49b09327f6 MKLDNN elementwise_mul: Reorder on non-nchw input, fallback on non-16 divisable fm
6 years ago
Michal Gallus d14858e4ba MKLDNN elementwise_mul: Parallelize mul
6 years ago
Michal Gallus ed31936ba1 MKLDNN elementwise_mul: Support NCHW, update UT
6 years ago
Tomasz Patejko 700bcbf74f MKLDNN elementwise_mul: h and w loops implemented in xbyak
6 years ago
Tomasz Patejko ad09facafe MKLDNN elementwise_mul: CPU tests initially refactored. MKLDNN mul test for broadcast added
6 years ago
Tomasz Patejko 2d73ad180a MKLDNN elementwise_mul: simple xbyak version for AVX512
6 years ago
Tomasz Patejko 213ec37d6a MKLDNN elementwise_add: simple initial implementation of the operator for MKLDNN format
6 years ago
Wu Yi a2d9b34417
Refine operator cmake (#14413)
6 years ago
tensor-tang 7f17e561d7
Merge pull request #14423 from tensor-tang/fea/jit/act
6 years ago
Jiabin Yang 28bd5b7bad fix space_to_depth_op unicode problem (#14430)
6 years ago
Jacek Czaja 513bb6c151 Squashing MKL based softmax for inference
6 years ago
nhzlx 9b64aac41f add macro for pool2dDirectCUDAFunctor
6 years ago
whs 1722678258
Make nce support more distribution. (#13549)
6 years ago
nhzlx 83f8c403a7 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into fix_avg_pool_trt_bug
6 years ago
nhzlx b969116988 fxi avg pool trt bug and fix cpplint
6 years ago
tensor-tang 1f00723fa3 exp, sigmoid, tanh jitcode support more size
6 years ago
Qiyang Min d971d5b875
Merge pull request #14431 from velconia/fix_expand_op_dim_in_compile_time
6 years ago
Wu Yi b32c13dc20
Add cudnn ctc loss (#12366)
6 years ago
tensor-tang 8cda7b3d20 Merge remote-tracking branch 'ups/develop' into fea/jit/act
6 years ago
tensor-tang e2d6eddd32 remove ComputeDeprecated
6 years ago
tensor-tang 64f7516aee
fix lrn on mac (#14426)
6 years ago
Yu Yang c8f6e70ab4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
tensor-tang f65ddff8d1 unify act jitcode of relu, exp, sigmoid and tanh
6 years ago
tensor-tang 6a159071b6 add vtanh jitcode of size 8
6 years ago
tensor-tang 046374bcd1 add vsigmoid jitcode of size 8
6 years ago
minqiyang 560b29ccb7 Polish code
6 years ago
minqiyang 21d6e8e8c8 Polish code
6 years ago
minqiyang 50b6e4c6bc Fix expand grad op infer shape
6 years ago
Sylwester Fraczek 8a1eeec579 add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278)
6 years ago
chengduo 82773477ae
Add selu (#14415)
6 years ago
minqiyang 30147d7f58 Fix expand op incorrect infer shape
6 years ago
Yihua Xu 03ccb9a461 Optimize the stack operator
6 years ago
tensor-tang ee2a7f1b8c refine exp and fix error on avx
6 years ago
tensor-tang 1e06a32a0d add vexp jitcode of size 8
6 years ago
tensor-tang 2354409601
Merge pull request #14374 from tensor-tang/fea/jit/act
6 years ago
Tao Luo 5ef123c778 Merge branch 'develop' into dam_fc
6 years ago
dzhwinter d3aed98d86
Merge pull request #14320 from wopeizl/windows/online
6 years ago
Tao Luo d3e63e6e04
Merge pull request #14412 from jczaja/prv-dam-softmax
6 years ago
peizhilin be332a13bc Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Jacek Czaja b361579f09 - Softmax for Inference is enabled when ON_INFER is set
6 years ago
Tao Luo 980a6753a8 fix typo to pass the ci
6 years ago
Tao Luo 8f301f4618
Merge pull request #14381 from qingqing01/manylinux_v5_fix
6 years ago
peizhilin 1a9008c420 code style fix
6 years ago
Tao Luo e0d4e04bdd fix some compiler warning
6 years ago
Tao Luo 8ea13e336a add in_num_col_dims for fc
6 years ago
Tao Luo 9eb0ab1db3
Merge pull request #14384 from tensor-tang/refine/lrn
6 years ago
peizhilin 30ddc07a7e Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Qiao Longfei e65cbd3b06
Merge pull request #14387 from jacquesqiao/lookup_sparse_table_add_test_mode
6 years ago
Qiao Longfei 6cf8f24b1b
Merge pull request #14389 from jacquesqiao/fix_sgd_op_optimize_sparse_table
6 years ago
Xin Pan 10ab177f89
Merge pull request #14403 from PaddlePaddle/revert-14337-prv-dam-softmax
6 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
6 years ago
Tao Luo 5b9c62faee
Revert "Softmax op optimization for inference "
6 years ago
Tao Luo 6490bb2765
Merge pull request #14337 from jczaja/prv-dam-softmax
6 years ago
chengduo 9f68e9a7fe
fix auc op (#14385)
6 years ago
Dang Qingqing d219818434 Fix compiling in cuDNN v5.
6 years ago
Qiao Longfei efb5c03f60 sgd_op optimize selected rows do not enforce id < height
6 years ago
Qiao Longfei 7aa8b2ccf2 optimize code
6 years ago
Qiao Longfei 8d205c853c add is_test for lookup_sparse_table
6 years ago
tensor-tang b4dfba1779 refine lrn_op cpu forward and speedup
6 years ago
tensor-tang 1be85d011d add mkl vsqr and vpow
6 years ago
ruri 4a55fb5f5b Add density_prior_box_op (#14226)
6 years ago
tensor-tang 0043c42b3e add vrelu jitcode
6 years ago
peizhilin 61fa5218b9 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yibing Liu bd2943788b
Fix gather & stack op (#14355)
6 years ago
Yu Yang 8f9bfad246
perf(compile): speed up reduce_op compile by splitting files (#14294)
6 years ago
sneaxiy d231e55065 merge develop
6 years ago
Jacek Czaja 03299ed46c - Fix to linking for GPU builds of softmax inference
6 years ago
Jacek Czaja 0756343767 - Fix GPU compilation
6 years ago
Jacek Czaja d332326847 - Added unit tests for softmax is_test=True op
6 years ago
Jacek Czaja c1fccc29c1 - Noise adding removed for Test phase of softmax
6 years ago
peizhilin 7638f0afb3 simplify the logic
6 years ago
peizhilin d01a26280e Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Xin Pan ff28b1ffc0
Merge pull request #14071 from barrierye/add_similarity_focus_op
6 years ago
li099 688ed60116 Add lod tensor array to tensor op (#13990)
6 years ago
peizhilin e23061e0dc Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
chengduo 6c6e638550
Add InferVarType for some op (#14201)
6 years ago
peizhilin 1eec5a428f Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Kaipeng Deng 0b38822624
Merge pull request #14345 from heavengate/fix_grid_sampler
6 years ago
peizhilin ca60e1d34d Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin 52f7644f53 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Qiyang Min 698698f2fa
Merge branch 'develop' into fix_vlog
6 years ago
qingqing01 abe209234f
Exhaustive search for cuDNN conv. (#14286)
6 years ago
Yu Yang b59a9bfb7c Clean buffered_allocator
6 years ago
Kaipeng Deng f215534ecf
Merge pull request #14205 from heavengate/nearest_interp
6 years ago
dengkaipeng 72108d8dbe fix win compile error: EigenTenor * float unsupport. test=develop
6 years ago
Yu Yang 26fb34c365 Merge develop tiny fix
6 years ago
Yu Yang fdc689142c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
tensor-tang 22125ebaef
Merge pull request #14321 from tensor-tang/fea/jit/vscal
6 years ago
Tao Luo 34e9e59f4a
Merge pull request #14333 from kbinias/change-hardcoded-format-and-bump-mkldnn-version
6 years ago
minqiyang 87450b9ad4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
peizhilin 41b423d41b remove duplicate
6 years ago
peizhilin dcfab11193 merge from develop
6 years ago
peizhilin 4ffa92d4f0 Merge branch 'develop' into windows/build
6 years ago
chengduo c5b6573a5a
Fix input<tensor> (#14208)
6 years ago
Krzysztof Binias f1c1acf1ac Changed hardcoded format to any in convolution and bumped MKL-DNN version to 0.17-rc
6 years ago
Tao Luo 813e54efbd
Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug
6 years ago
minqiyang 3db9fad764 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
Xin Pan b03a44e062
Merge pull request #14026 from JiabinYang/add_reorg_op
6 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
6 years ago
minqiyang fcc0452c8b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago