Commit Graph

2191 Commits (d26e4507dac94e0de3a24816541f06082770bc35)

Author SHA1 Message Date
Qiao Longfei d26e4507da init ctr data
7 years ago
tensor-tang 6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
7 years ago
Xin Pan 7fb5b66ac2
Merge pull request #13916 from panyx0718/fix2
7 years ago
tangwei12 fa2ab3346c
fill constant add infervarshape, lookuptable clone lr var (#13830)
7 years ago
Xin Pan 342e436158 Make Var::GetMutable robust
7 years ago
Yan Chunwei 7a751b83ac fix isfinite_op sprintf (#13850)
7 years ago
Qiyang Min e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
7 years ago
Yibing Liu 46b0b7903c
Merge pull request #13856 from kuke/seq_unpad_op
7 years ago
Zeng Jinle 93606c2c2c
Merge pull request #13689 from sneaxiy/sparse_rmsprop
7 years ago
Qiao Longfei 681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
7 years ago
Xin Pan 288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
7 years ago
sneaxiy 5cedfb60c8 test=develop
7 years ago
Xin Pan fededdda20
Merge pull request #13872 from panyx0718/fix2
7 years ago
Qiao Longfei 60030e8678 change the use of FLAGS_reader_queue_speed_test_mode
7 years ago
Qiyang Min cab29828a5
Merge pull request #13829 from velconia/accelerate_sequence_pool_op
7 years ago
Xin Pan ddb76d0d09 Make GetMutable more robust
7 years ago
minqiyang d9b202e717 Move tensor copy src_ptr and dst_ptr check to TensorCopy function
7 years ago
chengduo 8e2fdc54b1
Add check for opt op (#13840)
7 years ago
Qiyang Min e37c9e6732
Merge pull request #13828 from velconia/accelerate_selected_rows_functor
7 years ago
sneaxiy 84d9300365 test=develop
7 years ago
Qiao Longfei c61e16b181 add reader_queue_speed_test_mode_flag test
7 years ago
Qiao Longfei 8686f7c68e add reader_queue_speed_test_mode flag for speed test
7 years ago
chengduo 9c77b65c06
Fix layers.uniform_random (#13823)
7 years ago
Yibing Liu cbe4292516 Add sequence unpad op
7 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
7 years ago
tensor-tang 8e182170ba refine and replace lstm peephole kernel
7 years ago
Dun 5f2e837847 optimize depthwise conv by register memory (#13778)
7 years ago
minqiyang 3f6ec90060 Polish code
7 years ago
tensor-tang 7ef2699e18 init peephole runtime kernel
7 years ago
Qiyang Min f99ea99e36
Merge pull request #13720 from velconia/fix_grad_clip
7 years ago
minqiyang f40848828d Polish code
7 years ago
minqiyang e2e82bde32 Accelerate Reshape op
7 years ago
minqiyang 0385b0a1ea Accelerate SequencePool Op on SUM mode
7 years ago
minqiyang 8ec748cfa0 Accelerate SelectedRows Functors:
7 years ago
whs 7e651c8641
Fix truncated norm (#13785)
7 years ago
minqiyang 1456b8ec7d Add unittest for clip_by_norm_op with SelectedRows
7 years ago
Tao Luo fd0dd07ab4
Merge pull request #13726 from jczaja/prv-fused_embedding_fc_lstm-ut
7 years ago
chengduo e1761709f8
Set the right shape of selected_rows (#13723)
7 years ago
tensor-tang 3ee8f2c6cf thread local jit kernels
7 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
7 years ago
tensor-tang b55c247678 add lstm compute unit test
7 years ago
qingqing01 6094a72308
Fix bug in reduce_op caused by PR #13534 (#13748)
7 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
7 years ago
tensor-tang 2a00969165 optimize lstm jitkernel keq8
7 years ago
tensor-tang f2adaf1c3e add vrelu and lstm kernel
7 years ago
Jacek Czaja 9f15d8817e - Cleanup as suggessted by reviewers
7 years ago
Jacek Czaja ae8b4717cc - Cleaning fused_embedding_fc_lstm op
7 years ago
Jacek Czaja fd31b54cf1 - Removed disabled code
7 years ago
Jacek Czaja f9da2d6416 - Removed disabled diagnostic code
7 years ago
Jacek Czaja 809dbc5c17 - Added file for fused_embedded_fc_lstm_op unit test
7 years ago