Commit Graph

684 Commits (569951c418fb3c9f82cbdde9fda3910cc7033bff)

Author SHA1 Message Date
tensor-tang 102d93712e Merge remote-tracking branch 'ups/develop' into jit/seqpool
7 years ago
tensor-tang 0145f40f45 use height from params of jitcode
7 years ago
Qiao Longfei 3e1b914fcb update gru op forward kernel
7 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
7 years ago
tensor-tang c50060bb26 add jitcode impl and use it
7 years ago
tensor-tang 142bb41748 add seqpool jitkernel test and benchmark
7 years ago
tensor-tang e58a569c6c use seqpool jitkernel
7 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
7 years ago
sneaxiy d25395fc98 remove tensor core lock
7 years ago
Qiao Longfei 25d44d40ac sum op support empty selected rows as input
7 years ago
sneaxiy b56aca82e9 merge develop
7 years ago
chengduo b9fb03cf54
Move GetTensor to tensor_util (#15011)
7 years ago
minqiyang f4e7a47381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
7 years ago
wopeizl b117a5f208
Merge pull request #14931 from wopeizl/windows/mkl
7 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
7 years ago
peizhilin 9e60c58666 Merge remote-tracking branch 'upstream/develop' into windows/mkl
7 years ago
tensor-tang 1aaec571c2 fix enum style
7 years ago
minqiyang 6467eda534 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
7 years ago
sneaxiy 045dc12766 merge develop
7 years ago
tensor-tang d53c4756ad clean code and remove unused files
7 years ago
peizhilin 0b4f742e8a fix the build issue
7 years ago
peizhilin 1cc9d59838 disable xbyak on windows
7 years ago
peizhilin 07c7eaabb4 Merge remote-tracking branch 'upstream/develop' into windows/mkl
7 years ago
tensor-tang 893957f711 Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
tensor-tang 6648995f53 fix build
7 years ago
JiabinYang 3b7b2e1ded Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
7 years ago
sneaxiy a500dfa579 rewrite ddim
7 years ago
JiabinYang b5fa916413 fix bug after merge reyoung optimization, test=develop
7 years ago
peizhilin fa135bbf52 Fix the mkl build script on windows
7 years ago
Yu Yang 2803cf5776
Merge pull request #14868 from reyoung/feature/refine_w2v
7 years ago
peizhilin b601f2de8d include the mkl fix only
7 years ago
peizhilin 5a6d7fe2ff add mkl,ctc support for windows
7 years ago
JiabinYang 656040c726 merge reyoung optimization
7 years ago
wopeizl fa78fc60be
Merge pull request #14907 from wopeizl/windows/avx
7 years ago
tensor-tang 64a90b2f1c use vadd, vaddrelu, lstm and gru jitkernel
7 years ago
Yu Yang 740e1626ce Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_w2v
7 years ago
minqiyang bd0067b26c Polish code
7 years ago
Yu Yang bacf1d2399 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
7 years ago
peizhilin 01dd9061a0 add avx support for windows
7 years ago
Kaipeng Deng dc76e4b0f1
Merge pull request #14701 from heavengate/adaptive_pool
7 years ago
tensor-tang 5e97be7ba7 enable jitkernel mkl vexp, vsigmoid and vtanh
7 years ago
minqiyang 5fea8cd478 Add sorted_result parameter to SelectedRows Functor
7 years ago
tensor-tang ae17926987 enable jitkernel mkl vmul, vadd and vscal
7 years ago
minqiyang da796dfe05 Remove BinarySearch from Adam Op
7 years ago
tensor-tang 00d3afbcc9 add gru refer functions, test and benchmark
7 years ago
tensor-tang bf9302f950 add lstm, peephole refer and test
7 years ago
Yu Yang 7b10bf0e60 Use mkl
7 years ago
tensor-tang bf951fa737 add refer vrelu, videntity, vexp, vsigmoid, vtanh and test and benchmark
7 years ago
Yu Yang 15550a2753 Polish code
7 years ago
JiabinYang 50fce87905 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
7 years ago
tensor-tang e9216e82f9 add refer vscal, vaddbias and test and benchmark
7 years ago
tensor-tang 417d031f90 add refer vadd, vaddrelu, vsub and tests and benchmark
7 years ago
JiabinYang c2e851f7b2 test=develop, remove sparse bias and add prefetch and related tests
7 years ago
Yu Yang be11375661 Refine code
7 years ago
Yu Yang 8d9401152e Refine w2v
7 years ago
Yu Yang 81520a24cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
7 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
7 years ago
Yu Yang eeca721a99
Merge pull request #14842 from reyoung/feature/refine_eigen_tensor
7 years ago
Yihua Xu acc6ae49b1 Fix the issue to run on AVX2 and AVX512F machines (#14851)
7 years ago
tensor-tang adc7ba2edd Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
dengkaipeng a81fabd327 fix doc errors. test=develop
7 years ago
dengkaipeng 266c6856c9 add adaptive pool 2d & 3d. test=develop
7 years ago
dengkaipeng eab4745965 add adaptive mode for pool.
7 years ago
Yu Yang 7604b1ad51 Fix Eigen macro when using GPU
7 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
7 years ago
tensor-tang fab0ee8757 Merge remote-tracking branch 'ups/develop' into refine/jitkernel
7 years ago
Houjiang Chen c6b39a0099
Merge pull request #14714 from NHZlX/add_prelu_gpu
7 years ago
tensor-tang b523787f9f remove jit namespace
7 years ago
tensor-tang 4a93db9288 remove jit namespace
7 years ago
tensor-tang 77236e33fc init jitkernel
7 years ago
JiabinYang b387a19410 optimize op with blas
7 years ago
nhzlx f75815b78c add prelu gpu inference
7 years ago
Tao Luo ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
7 years ago
Jacek Czaja 48e1b97e8e - Coding style fixes
7 years ago
minqiyang 9d7c3b18c0 Polish code
7 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
Jacek Czaja cf40daee58 - Building fix to softmax for inference
7 years ago
JiabinYang 87648f8edf merge develop, test=develop
7 years ago
wopeizl db9284ecde
Merge pull request #14617 from wopeizl/windows/online
7 years ago
JiabinYang c3c3c0b33c polish code, test=develop
7 years ago
Jacek Czaja 8bfa1fa9bb - ASUM MKL integration
7 years ago
tangwei12 56a4912b76
Make NCE_OP more efficient and support SelectedRows (#14469)
7 years ago
peizhilin 6a85dd3278 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 38715e6fd0 minor fix
7 years ago
JiabinYang 7e4bd695e6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
7 years ago
JiabinYang b10df8bcfa refine code and add none bias ut, test=develop
7 years ago
Qiyang Min 30e47bce8b
Merge branch 'develop' into revert_vlog
7 years ago
tensor-tang 7a91271436
Merge branch 'develop' into fea/jit/rnn
7 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
JiabinYang 2f6b529aff refine code and comments, test=develop
7 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
7 years ago
tensor-tang 1f0291a51e add comments and follow comments
7 years ago
tensor-tang 557229bd39 Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
7 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
JiabinYang 02d68051db add sparsed bias grad, test=develop
7 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
7 years ago
JiabinYang 42470f14b7 test=develop
7 years ago
tensor-tang 6a7f83d45d enable gru jitcode and refine act and lstm jitcode
7 years ago
tensor-tang 686eaf20ba Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
7 years ago
JiabinYang 0fca16847c temp
7 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
7 years ago
peizhilin dfbac60398 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 7c8c9dc9bf fix unit test cases
7 years ago
tensor-tang 0c5ed5f6fc enable peephole jitcode
7 years ago
JiabinYang 3c6102a367 test=develop
7 years ago
tensor-tang e3b61cf52b init gru jitcode and fix lstm jitcode
7 years ago
tensor-tang 0f25446574 Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
7 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
7 years ago
peizhilin bef475c92b Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Jiabin Yang f7b55de9e5
Merge branch 'develop' into enhance_hierachical_sigmod_op
7 years ago
tensor-tang 3562051302 add gru refer code and remove redundant avx code
7 years ago
Zhaolong Xing ad349e770f
Merge pull request #14452 from NHZlX/fix_avg_pool_trt_bug
7 years ago
tensor-tang f913860873 jitkernel lstm refer support peephole
7 years ago
tensor-tang 2f9b5f2383
Merge branch 'develop' into fea/jit/rnn
7 years ago
JiabinYang 014e50c284 test=develop
7 years ago
peizhilin 67562a6fcd Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
tensor-tang 10fb4ceefc
Merge pull request #14351 from tpatejko/tpatejko/mkldnn-elementwise_mul
7 years ago
tensor-tang ce31deb7e9 refine refer code and add lstm refer code
7 years ago
nhzlx e62872df8b fix conflicts
7 years ago
tensor-tang c2cfb03a72 add lstm jitcode
7 years ago
peizhilin 25adf970b2 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
tensor-tang 7aa3aff338
Merge pull request #14465 from tensor-tang/fea/jit/exp
7 years ago
Tao Luo 1b894e495f
Merge pull request #14437 from jczaja/prv-softmax-mkl
7 years ago
peizhilin 3a72a634cf Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yihua Xu f4c869d872 Optimize the layer_norm operator with AVX intrinsic function (#14417)
7 years ago
Yu Yang 98bbfc17be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
Jacek Czaja 9b0eae3023 - Removing partial specialization of sotmax for inference for GPU
7 years ago
peizhilin a3e952f41d add the jit back
7 years ago
tensor-tang a19b3225a1 fix jitcode small size
7 years ago
tensor-tang 4dbdfa60ef sigmoid and tanh support all size
7 years ago
tensor-tang ccb8963705 refine exp jitcode with all size
7 years ago
tensor-tang d3eae8f61b refine relu and fix addrelu test
7 years ago
tensor-tang 4e67fe6a12 refine act and vxx with all size
7 years ago
tensor-tang ba3eaed7a7 exp support all size
7 years ago
tensor-tang 1ffce8c0ae fix build error on noavx
7 years ago
Michal Gallus c69c41604e MKLDNN elementwise_mul: Move Kernel to KernelPool to avoid segfaults
7 years ago
tensor-tang 7f17e561d7
Merge pull request #14423 from tensor-tang/fea/jit/act
7 years ago
Jacek Czaja 513bb6c151 Squashing MKL based softmax for inference
7 years ago
nhzlx 9b64aac41f add macro for pool2dDirectCUDAFunctor
7 years ago
whs 1722678258
Make nce support more distribution. (#13549)
7 years ago
nhzlx 83f8c403a7 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into fix_avg_pool_trt_bug
7 years ago
nhzlx b969116988 fxi avg pool trt bug and fix cpplint
7 years ago
tensor-tang 1f00723fa3 exp, sigmoid, tanh jitcode support more size
7 years ago
tensor-tang 8cda7b3d20 Merge remote-tracking branch 'ups/develop' into fea/jit/act
7 years ago
tensor-tang e2d6eddd32 remove ComputeDeprecated
7 years ago
tensor-tang 64f7516aee
fix lrn on mac (#14426)
7 years ago
Yu Yang c8f6e70ab4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
tensor-tang f65ddff8d1 unify act jitcode of relu, exp, sigmoid and tanh
7 years ago
tensor-tang 6a159071b6 add vtanh jitcode of size 8
7 years ago
tensor-tang 046374bcd1 add vsigmoid jitcode of size 8
7 years ago