Commit Graph

602 Commits (e402c0ec7d813264d76841fc4972ebc631f7696e)

Author SHA1 Message Date
tensor-tang d59f733551 refine softmax and use with cache
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
tangwei12 5cfc40dea8
nce add check sample lables, test=develop (#15463)
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
zhaozhehao e2ba9668b4 Tree conv op (#15217)
6 years ago
Qiao Longfei 4d15515c40 fix gru_gpu_kernel test=develop
7 years ago
Qiao Longfei 4feae25378 fix build problem test=develop
7 years ago
Qiao Longfei 4c7be265d3 update avx gru grad kernel test=develop
7 years ago
Qiao Longfei 9b16e54064 update gru_grad_op
7 years ago
Qiao Longfei e477d789a1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
7 years ago
Wu Yi fd85418329
[Feature] support mix precision training for resnet (#14899)
7 years ago
Qiao Longfei d0e3b24002 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
7 years ago
tensor-tang 223c61ca5e
Merge pull request #15170 from tensor-tang/jit/seqpool
7 years ago
Qiao Longfei c3b9edf958 follow comment test=develop
7 years ago
Qiao Longfei b16e832d4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
7 years ago
sneaxiy ed409ac9f4 Revert "Revert "Remove op handle lock""
7 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
7 years ago
Zeng Jinle f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
7 years ago
tensor-tang 102d93712e Merge remote-tracking branch 'ups/develop' into jit/seqpool
7 years ago
tensor-tang 0145f40f45 use height from params of jitcode
7 years ago
Qiao Longfei 3e1b914fcb update gru op forward kernel
7 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
7 years ago
tensor-tang c50060bb26 add jitcode impl and use it
7 years ago
tensor-tang 142bb41748 add seqpool jitkernel test and benchmark
7 years ago
tensor-tang e58a569c6c use seqpool jitkernel
7 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
7 years ago
sneaxiy d25395fc98 remove tensor core lock
7 years ago
Qiao Longfei 25d44d40ac sum op support empty selected rows as input
7 years ago
sneaxiy b56aca82e9 merge develop
7 years ago
chengduo b9fb03cf54
Move GetTensor to tensor_util (#15011)
7 years ago
minqiyang f4e7a47381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
7 years ago
wopeizl b117a5f208
Merge pull request #14931 from wopeizl/windows/mkl
7 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
7 years ago
peizhilin 9e60c58666 Merge remote-tracking branch 'upstream/develop' into windows/mkl
7 years ago
tensor-tang 1aaec571c2 fix enum style
7 years ago
minqiyang 6467eda534 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
7 years ago
sneaxiy 045dc12766 merge develop
7 years ago
tensor-tang d53c4756ad clean code and remove unused files
7 years ago
peizhilin 0b4f742e8a fix the build issue
7 years ago
peizhilin 1cc9d59838 disable xbyak on windows
7 years ago
peizhilin 07c7eaabb4 Merge remote-tracking branch 'upstream/develop' into windows/mkl
7 years ago
tensor-tang 893957f711 Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
tensor-tang 6648995f53 fix build
7 years ago
JiabinYang 3b7b2e1ded Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
7 years ago
sneaxiy a500dfa579 rewrite ddim
7 years ago
JiabinYang b5fa916413 fix bug after merge reyoung optimization, test=develop
7 years ago
peizhilin fa135bbf52 Fix the mkl build script on windows
7 years ago
Yu Yang 2803cf5776
Merge pull request #14868 from reyoung/feature/refine_w2v
7 years ago
peizhilin b601f2de8d include the mkl fix only
7 years ago
peizhilin 5a6d7fe2ff add mkl,ctc support for windows
7 years ago