Commit Graph

66 Commits (0823a7bc8b6c46a866d1e54f8cb96ccaab192bf2)

Author SHA1 Message Date
xuezhong fb261793b9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
6 years ago
peizhilin 061299be87 fix dependency
6 years ago
xuezhong 58ad40cc15 add sample_logits op
6 years ago
tensor-tang d59f733551 refine softmax and use with cache
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
zhaozhehao e2ba9668b4 Tree conv op (#15217)
6 years ago
tensor-tang e58a569c6c use seqpool jitkernel
6 years ago
tensor-tang 64a90b2f1c use vadd, vaddrelu, lstm and gru jitkernel
6 years ago
tensor-tang fab0ee8757 Merge remote-tracking branch 'ups/develop' into refine/jitkernel
7 years ago
tensor-tang 77236e33fc init jitkernel
7 years ago
nhzlx f75815b78c add prelu gpu inference
7 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
7 years ago
Yihua Xu f4c869d872 Optimize the layer_norm operator with AVX intrinsic function (#14417)
7 years ago
Yu Yang 98bbfc17be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
whs 1722678258
Make nce support more distribution. (#13549)
7 years ago
Yu Yang c8f6e70ab4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
peizhilin 41b423d41b remove duplicate
7 years ago
peizhilin dcfab11193 merge from develop
7 years ago
peizhilin 4ffa92d4f0 Merge branch 'develop' into windows/build
7 years ago
peizhilin 869487a2b7 Merge remote-tracking branch 'origin/develop' into windows/build
7 years ago
Yu Yang 057a682ee9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
dzhwinter 2835e04409 merge develop branch. test=develop
7 years ago
tensor-tang b81e1b655e fix jit on mac
7 years ago
peizhilin 9d67c1fb69 cpu build support
7 years ago
tensor-tang a3377f7b0a refine jitcode and add vmul jitcode implementation
7 years ago
tensor-tang a53b1b0b1b refine and init jitkernel vmul
7 years ago
tensor-tang 2139b9f677 add jit gencode
7 years ago
dzhwinter 316765839d add back jit simd instructions. stage.
7 years ago
dzhwinter bf2e4cb188 cleard. staged
7 years ago
Yu Yang c01696f8c2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
tensor-tang 21487d78bf add crf decode jit kernel
7 years ago
Qiyang Min d0fdcb2f6d
Merge pull request #14048 from velconia/change_sequence_pool_to_cpu
7 years ago
minqiyang e2a348cd10 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into change_sequence_pool_to_cpu
7 years ago
minqiyang 047fa2f9aa Add unit-test for sequence_pooling functor
7 years ago
tensor-tang 032c3a07e3 Merge remote-tracking branch 'ups/develop' into refine/jit/gru
7 years ago
chengduo a7497653d0
Refine Split op (#13967)
7 years ago
tensor-tang 640e789d3d add fusion gru jit kernel
7 years ago
Yu Yang 461f71a90b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
tensor-tang 36588b3365 fix illegal instruction of rnn1 and text
7 years ago
tensor-tang 6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
7 years ago
minqiyang 8ec748cfa0 Accelerate SelectedRows Functors:
7 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
7 years ago
tensor-tang 2a00969165 optimize lstm jitkernel keq8
7 years ago
Yu Yang 8e3fdc6e65 Fix SetDevice on init
7 years ago
tensor-tang 2d0ff6a3c2 add vexp and unit test
7 years ago
tensor-tang eeff268a6c clean and refine kernels
7 years ago
tensor-tang c260bf942d init jit kernel
7 years ago
tensor-tang bc9971dd6c fix deps
7 years ago
tensor-tang 612ba41aee add simple lstm compute
7 years ago
dzhwinter 34757efb8e fix windows compile
7 years ago