Commit Graph

305 Commits (016bf51e3f88baeb5dfcaf30b4f67497217efe4a)

Author SHA1 Message Date
sneaxiy 92a2817a2b test=develop
7 years ago
chengduo a7497653d0
Refine Split op (#13967)
7 years ago
sneaxiy a9d7a9d720 test=develop
7 years ago
tensor-tang 23fc896bc2 Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
7 years ago
tensor-tang e5ce965952 refine and add eltadd_relu unit test
7 years ago
tensor-tang 7cb19a5976 fuse elementwise_add and relu
7 years ago
sneaxiy ac2eba4457 test=develop
7 years ago
tensor-tang b139b687de Merge remote-tracking branch 'ups/develop' into fix/jit/exp
7 years ago
tensor-tang 748435586a clean code exp avx
7 years ago
tensor-tang b4751a34a5 fix illegal instruction of rnn2
7 years ago
tensor-tang 36588b3365 fix illegal instruction of rnn1 and text
7 years ago
tensor-tang e69328c3bc fix warning and mac compile
7 years ago
tensor-tang 6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
7 years ago
sneaxiy 4b4af84e67 test=develop
7 years ago
Zeng Jinle 93606c2c2c
Merge pull request #13689 from sneaxiy/sparse_rmsprop
7 years ago
sneaxiy 5cedfb60c8 test=develop
7 years ago
Qiyang Min cab29828a5
Merge pull request #13829 from velconia/accelerate_sequence_pool_op
7 years ago
Qiyang Min e37c9e6732
Merge pull request #13828 from velconia/accelerate_selected_rows_functor
7 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
7 years ago
tensor-tang 8e182170ba refine and replace lstm peephole kernel
7 years ago
Dun 5f2e837847 optimize depthwise conv by register memory (#13778)
7 years ago
minqiyang 3f6ec90060 Polish code
7 years ago
tensor-tang 7ef2699e18 init peephole runtime kernel
7 years ago
minqiyang 0385b0a1ea Accelerate SequencePool Op on SUM mode
7 years ago
minqiyang 8ec748cfa0 Accelerate SelectedRows Functors:
7 years ago
tensor-tang 3ee8f2c6cf thread local jit kernels
7 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
7 years ago
tensor-tang b55c247678 add lstm compute unit test
7 years ago
tensor-tang 2a00969165 optimize lstm jitkernel keq8
7 years ago
tensor-tang f2adaf1c3e add vrelu and lstm kernel
7 years ago
tensor-tang e6d8aca3bf refine code and fix
7 years ago
tensor-tang ea7dc9cbf6 Merge remote-tracking branch 'ups/develop' into fea/jitkernel
7 years ago
tensor-tang 2513b2cc4e fix bug vtanh
7 years ago
tensor-tang cf8c8e72bd add vtanh and unit test
7 years ago
tensor-tang b37fe30417
Merge pull request #13690 from wangguibao/fix_cpu_lstm_compute_cc
7 years ago
dzhwinter 26771f41ba
"fix compile error" (#13579)
7 years ago
tensor-tang d10a9df7b8 add vaddbias and unit test
7 years ago
tensor-tang 3c8b651187 add vsigmoid avx implementations and unit test
7 years ago
tensor-tang 55e44761fb refine code and init vsigmoid
7 years ago
wangguibao 1940bc2d83 Avoid multiple definitions of lstm_compute_ctht when linking libpaddle_fluid.so
7 years ago
sneaxiy 584c3f048f fix sparse rmsprop
7 years ago
Dun 161c3e31f7 Optimization of Kernels that related to DeepLabv3+ (#13534)
7 years ago
tensor-tang 2d0ff6a3c2 add vexp and unit test
7 years ago
tensor-tang b3c63f40fa add vscal and unit test
7 years ago
tensor-tang 0987f2b4d9 add vadd unit test
7 years ago
tensor-tang 3d928d4f9d refine and seepdup
7 years ago
tensor-tang 77fc42d2d1 Merge remote-tracking branch 'ups/develop' into fea/jitkernel
7 years ago
tensor-tang 2937314d8e refine vmul and test
7 years ago
tensor-tang 6c986e127a fix macro and add vmul unit test
7 years ago
Yu Yang 0be1582df0
Merge pull request #13525 from reyoung/fix_mixed_vector
7 years ago