Commit Graph

647 Commits (0823a7bc8b6c46a866d1e54f8cb96ccaab192bf2)

Author SHA1 Message Date
Krzysztof Binias 0823a7bc8b Optimize the sequence padding op (#17403)
6 years ago
zhaoyuchen2018 8a2caacdbc
improve gru unit performance. (#16338)
6 years ago
Kaipeng Deng a71d8fdb87
Softmax_cross_entropy op add axis (#16806)
6 years ago
Yibing Liu 3c375751f8
Support seq len equal to 0 in sequence ops (#16935)
6 years ago
Kevin c474e7ddf5 fix overflow by int32 mul test=develop (#16794)
6 years ago
Qiao Longfei faae1b4170 fix cpplint test=develop
6 years ago
Qiao Longfei 0a8ff2ecd4 add cpu_merge_add_multi_noduplicated_test test=develop
6 years ago
Qiao Longfei 920a960974 optimize merge add if input rows of all selected rows is not duplicated
6 years ago
Qiao Longfei baf02328b2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Kaipeng Deng 54474637ae
Merge pull request #16057 from heavengate/softmax_axis
6 years ago
Qiao Longfei 30618409db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
dengkaipeng 90bd038d35 fix format. test=develop
6 years ago
phlrain 1580be5d6c fix sequence pad; test=develop
6 years ago
dengkaipeng 93701dba50 add jit kernel for softmax axis. test=develop
6 years ago
dengkaipeng 6c64182709 refine softmax kernel. test=develop
6 years ago
phlrain 802b33489a remove resize then seq num == 1; test=develop
6 years ago
sneaxiy 5a92e4c097 revert revert 16144
6 years ago
Zeng Jinle a91964c8fe Revert "PaddingRNN model memory optimize"
6 years ago
Zeng Jinle 0b49e43d3a
Merge pull request #16144 from sneaxiy/rnn_mem_opt
6 years ago
sneaxiy b26e9bd232 refine code
6 years ago
tensor-tang 6ff230a624 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
tensor-tang 14a764c930 simplify the jitkernel templates and tests
6 years ago
Yiqun Liu 5bde120243
Make parent_idx a dispensable output for beam_search op to support models saved by older paddle version. (#16106)
6 years ago
tensor-tang 802f362ac4 unify the kernelfuncs cache and add unit test
6 years ago
Qiao Longfei fab1b54d99 Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Qiao Longfei 3691a46fa3 improve communicator
6 years ago
Yiqun Liu 87248281f7
Fix error in CUDA kernel of beam_search. (#15957)
6 years ago
Yihua Xu 7396788694 Optimize gelu operation with mkl erf.
6 years ago
xuezhong 1dad36f6aa
Merge pull request #15609 from xuezhong/add_sample_logits_op
6 years ago
tensor-tang ee2321debd
Revert 15770 develop a6910f900 gelu mkl opt (#15872)
6 years ago
xuezhong 81870723c6
Merge pull request #15605 from xuezhong/fix_bug_for_lstmp
6 years ago
Yihua Xu 676995c86c Optimze Gelu with MKL Erf function (#15770)
6 years ago
xuezhong f2262d7336 update comment
6 years ago
xuezhong fb261793b9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
6 years ago
xuezhong fb9a6a2bc6 pass test for lstm op
6 years ago
xuezhong 2ba256df40 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
peizhilin 061299be87 fix dependency
6 years ago
xuezhong 4028943125 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
tensor-tang a6a1a92ef7
Merge pull request #15586 from tensor-tang/jit/cache
6 years ago
xuezhong 4c98c2ccc3 remove debug print
6 years ago
xuezhong 58ad40cc15 add sample_logits op
6 years ago
xuezhong 880836329d add cell clip and proj clip, fix bug for h0
6 years ago
Yiqun Liu 16d54f7f23
Return parent_idx in beam_search op (#15520)
6 years ago
tensor-tang a18c0d4242 cache fc kernel
6 years ago
tensor-tang 6e1ee7fb57 cache softmax kernel func
6 years ago
tensor-tang d59f733551 refine softmax and use with cache
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
tangwei12 5cfc40dea8
nce add check sample lables, test=develop (#15463)
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
zhaozhehao e2ba9668b4 Tree conv op (#15217)
6 years ago