Commit Graph

2249 Commits (f3729db6e03d5e290020d3cc74cfb50572902c4c)

Author SHA1 Message Date
Yu Yang a6c8d6b9a2 Revert "Merge pull request #13431 from chengduoZH/refine_lod"
6 years ago
Zeng Jinle 7f1e312677
Merge pull request #13456 from sneaxiy/refine_sparse_adam
6 years ago
Yu Yang b5996fa124 Fix unstable selected_rows_functor_test.cu
6 years ago
sneaxiy d775087dc7 Merge develop
6 years ago
sneaxiy a29b4227eb fix sparse gradient clip
6 years ago
Yihua Xu 87086b1386 Refine activation for GRU operator (#13275)
6 years ago
chengduo d402234ba8
Feature/op_fuse_pass (#12440)
6 years ago
nhzlx a0a6789347 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
6 years ago
Qingsheng Li 21ec93aa7e
[WIP]Sequence Scatter Op (#12625)
6 years ago
nhzlx 202e0a1ee1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
6 years ago
nhzlx 4c52be07dd fix ut error
6 years ago
Jiabin Yang 60d163ab17
Merge pull request #13473 from JiabinYang/mac/fix_compile
6 years ago
nhzlx dcc09dce35 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
6 years ago
nhzlx 94a57f1d83 add trt config to arguments
6 years ago
chengduo fd8d83e68a
Fix the nested dyn_rnn (#13417)
6 years ago
whs cf128231c6
Add truncated gaussian initializer. (#13000)
6 years ago
Zeng Jinle 642cf6ca2f
Merge pull request #13418 from sneaxiy/dam_save_memory
6 years ago
JiabinYang 9d2d3096c5 fix mac compile error
6 years ago
Dun 253f618ac7 loosen the restriction of output_size in conv2d_transpose (#12292)
6 years ago
Yu Yang 155ebbb9dc
Merge pull request #13449 from chengduoZH/speed_up_lod_tensor_to_array
6 years ago
chengduo 6757a31552
[Accelerate] Refine seq_softmax_op (#13421)
7 years ago
Yibing Liu 9cc1937d43
Merge pull request #13400 from kuke/fix_seq_pad
7 years ago
Yu Yang 2c31ea9293
Merge pull request #13424 from chengduoZH/refine_seq_concat
7 years ago
Yu Yang 5996e224fa
Merge pull request #13430 from chengduoZH/refine_seq_pool
7 years ago
sneaxiy fd3e32ea7d Merge develop
7 years ago
sneaxiy b6f61faf13 fix adam
7 years ago
chengduoZH 6534f8527a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_lod
7 years ago
chengduo 72dd6b37d9
Add sequence_expand_as_op (#13420)
7 years ago
chengduoZH 7b464d68a0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into speed_up_lod_tensor_to_array
7 years ago
chengduoZH 24459501fe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_seq_concat
7 years ago
chengduoZH f92b07f0b5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_seq_pool
7 years ago
Yu Yang e39899da16 Fix bug in test_detection_map_op
7 years ago
sneaxiy 0718113a9c modification
7 years ago
chengduoZH 6d0370d27a fix warning
7 years ago
chengduoZH 0d7519171e speed up lod_tensor to array and array to lod_tensor
7 years ago
sneaxiy d9942cd122 Merge develop
7 years ago
gongweibao 0c8c0d943f
fix macunittest (#13434)
7 years ago
Yu Yang cd69e1dd2a
Merge pull request #13339 from chengduoZH/debug_while_op
7 years ago
chengduoZH b57a2b989b remove test_seq_concat_op
7 years ago
chengduoZH cdb9605bad refine
7 years ago
chengduoZH 284519561d add doc
7 years ago
chengduoZH cacf549e8a refine seq_pool
7 years ago
chengduoZH e7940141ce refine seq_concat
7 years ago
sneaxiy a58a528436 modification
7 years ago
sneaxiy 7116cc12fb Merge develop
7 years ago
sneaxiy abf9832c12 tiny change to save memory
7 years ago
Tao Luo f15def6403
Merge pull request #12945 from jerrywgz/fix_maxout
7 years ago
gongweibao 239a83860b
Fix VarHandle return bug. (#13354)
7 years ago
tensor-tang 7c8730824a
Merge pull request #13396 from tensor-tang/refine/op/lstm
7 years ago
Dun e0bca5f866 Implement slice grad operator. #8130 (#12330)
7 years ago
Tao Luo 03dc7b79e9
Merge pull request #12966 from jerrywgz/fix_rnn_memory_helper
7 years ago
Yibing Liu 1c87558c4a Revert changes for debug
7 years ago
Yibing Liu 8edf60ceda Merge branch 'develop' of upstream into fix_seq_pad
7 years ago
Wu Yi 437debf40e Fix mac ci dist (#13393)
7 years ago
Tao Luo 40c54db301
Merge pull request #13338 from bingyanghuang/bingyang/seq_pool_memcpy
7 years ago
tensor-tang e09cf031a8 refine src and header
7 years ago
tensor-tang 3db1e41e12 Merge remote-tracking branch 'ups/develop' into refine/op/lstm
7 years ago
Tao Luo bdbf1bc820
Merge pull request #12953 from jerrywgz/fix_prelu
7 years ago
Michał Gallus 8cbefd1a43 Fuse Conv+BN+SkipConnectionAdd+ReLU with transpiler temporarily (#13350)
7 years ago
Qiyang Min f00081a419
Merge pull request #13381 from velconia/make_mac_pass_build
7 years ago
Yibing Liu 9e3245e3bd Merge branch 'develop' of upstream into fix_seq_pad
7 years ago
Yibing Liu f6595811a1 Get sequence length in sequence_pad op & fix sequence_mask op
7 years ago
bingyanghuang 76553c5a6d fix travis-ci
7 years ago
tensor-tang bc9971dd6c fix deps
7 years ago
Xingyuan Bu 9e2e893f59 Enhence generate_proposal_labels_op and fix some bug. (#13239)
7 years ago
tensor-tang ff858d35ed fix bug and enable on batch mode as well
7 years ago
tensor-tang 8dea07f209 fix comopile
7 years ago
tensor-tang 612ba41aee add simple lstm compute
7 years ago
Zhaolong Xing c9995289f1
Merge pull request #13124 from NHZlX/fix_subgraph_bug
7 years ago
Tao Luo 24e61d305b
Merge pull request #13378 from chuanqi129/group_conv
7 years ago
chuanqiw 1052a793bc support group convolution layer with mkldnn.
7 years ago
velconia bb9ec4b25f Polish code
7 years ago
gongweibao 3a3f28f99b
add (#13377)
7 years ago
velconia 926f5f43a9 fix redundant args of lambda and remove exception of destructor
7 years ago
nhzlx 329a8c5283 merge develop
7 years ago
nhzlx 49bafc05bf fix comments and set name for trt layer and ITensor
7 years ago
Bai Yifan e69d9c845b code fix (#13365)
7 years ago
tensor-tang b0b5f515a9 Merge remote-tracking branch 'ups/develop' into refine/infershape
7 years ago
gongweibao 8cee9f6176
Fix rpcclient's wait action in aync env. (#13307)
7 years ago
tensor-tang 43d30547c5 Merge remote-tracking branch 'ups/develop' into refine/infershape
7 years ago
tensor-tang 8bb824bb93 refine infershape hasinput and hasoutput
7 years ago
Jacek Czaja dfbd1cc3c1
Merge pull request #13209 from Sand3r-/mgallus/conv-relu-fuse
7 years ago
Krzysztof Binias 2ed7982d09
Merge pull request #13327 from kbinias/kbinias/conv-weights-converted-once
7 years ago
tensor-tang c4394bc543 Merge remote-tracking branch 'ups/develop' into refine/infershape
7 years ago
tensor-tang 8a1abe54d7 clean fusion infershape code
7 years ago
tensor-tang 916f42bcbf refine fusion gru infershape
7 years ago
tensor-tang a5556d4417 refine attentionlstm infershape
7 years ago
Krzysztof Binias accdecc681 Correcting Lint errors
7 years ago
bingyanghuang 83394bab3e modified by luotao's suggestion
7 years ago
Michal Gallus 5d34ef61cb Fuse MKLDNN's Conv + ReLU
7 years ago
nhzlx 49b5b3c5b3 merge develop
7 years ago
nhzlx 03ff4f6892 fix subgraph bug!
7 years ago
tensor-tang e0436ad8bb refine fusion lstm infershape
7 years ago
Krzysztof Binias 1ce9e9dc30 Renaming decision variable
7 years ago
chengduoZH cc18fffb90 add nest while_op
7 years ago
Bai Yifan faf8ad2436
Add ignore_index in cross_entropy op (#13217)
7 years ago
bingyanghuang 1454cd54aa pre-commit check
7 years ago
bingyanghuang 7429067ab3 clean code
7 years ago
bingyanghuang cdbc5e7353 Add some comments
7 years ago
bingyanghuang 53185fde11 Rewrite sequence pooling last and first mode with memcpy and clean code
7 years ago
guochaorong 76e9227467
Merge pull request #13199 from JiayiFeng/fix_CudnnHolder_bug
7 years ago
Krzysztof Binias 1658958fe6 Reusing converted weights
7 years ago
Yan Xu d117bbc313
Merge pull request #13291 from Yancey1989/reset_vars_on_pserver
7 years ago
qingqing01 a39eba77eb
Implement norm_op by CUDA instead of Eigen. (#13273)
7 years ago
Yancey1989 32b94a7d13 cache var types
7 years ago
Yancey1989 580f55fa0f update by comment
7 years ago
Yang Yu 8331e835a8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_CudnnHolder_bug
7 years ago
Yancey1989 6edfae4234 reset received vars on pserver
7 years ago
tensor-tang 40dbd97f8e Merge remote-tracking branch 'ups/develop' into refine/op/peephole
7 years ago
Qiyang Min b805751598
Merge pull request #13223 from velconia/open_python35_CI
7 years ago
Yu Yang 34e467dcab
Merge pull request #13232 from reyoung/feature/fix_layer_norm
7 years ago
chengduo 886852557f
Refine reshape_grad and transpose_grad (#13074)
7 years ago
tensor-tang 3eb55f0643 Merge remote-tracking branch 'ups/develop' into refine/op/peephole
7 years ago
tensor-tang d7ac1cc836 refine seq when bs is large
7 years ago
tensor-tang 9dd5a177a5 refine batch mode and peephole
7 years ago
Qiao Longfei 6e03f7900f
Add centered mode rmsprop (#13161)
7 years ago
Yan Chunwei 9df2d8b5ba
test/add text-classification test (#13081)
7 years ago
tensor-tang f10710b0ca move seq peephole if out of loop
7 years ago
tensor-tang 2f3b498949 refine fusion seq lstm peephole
7 years ago
tangwei12 d1e2efae6b
reimplement auc in fluid (#13167)
7 years ago
Yu Yang f57d706aa7 Use double to reduce
7 years ago
tensor-tang 5f586e2223 Merge remote-tracking branch 'ups/develop' into refine/op/fusion_lstm
7 years ago
Brian Liu 04272c0d41 Enable lstm peephole (#13160)
7 years ago
fengjiayi 56750e6a3e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_CudnnHolder_bug
7 years ago
Qiao Longfei cdd14f17f1
fix async mode handle COMPLETE_MESSAGE (#13212)
7 years ago
minqiyang 8059445fb5 Fix fake_quantize_op
7 years ago
tensor-tang 78d9ad5712 fusion gru enfore only used
7 years ago
tensor-tang 555083ae2a enforce only used
7 years ago
fengjiayi db5e3dd767 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_CudnnHolder_bug
7 years ago
Jiabin Yang d091dd02a0 fix mac compile error 0903 (#13184)
7 years ago
Yu Yang cda7842e26 Revert "Revert "Add Python Callstacks when Op::Run error (#12759)""
7 years ago
qingqing01 9557cc218d
Refine and fix some code for faster-rcnn. (#13135)
7 years ago
fengjiayi 82a1b35b9b Revert "Revert "Add CudnnHolder and use it in Conv and ConvTranspose op""
7 years ago
guochaorong 151e169eb7
Revert "Add CudnnHolder and use it in Conv and ConvTranspose op"
7 years ago
Chen Weihang 3b6090e80b
Merge pull request #12887 from chenwhql/sequence_enumerate_op
7 years ago
tensor-tang 1cc35f3642
Merge pull request #13118 from tensor-tang/optimize/op/fusion_lstm
7 years ago
dzhwinter 6fb28796f5
memory (#13143)
7 years ago
dzhwinter e722f68318
fix windows compile (#13147)
7 years ago
dzhwinter f05520060e
fix style (#13142)
7 years ago
dzhwinter 856c26faef
fix elementwise (#13146)
7 years ago
fengjiayi 653c8ded7d
Merge pull request #13078 from JiayiFeng/dev_CudnnHolder
7 years ago
tensor-tang 20659fc905
Merge pull request #13107 from tensor-tang/optimize/op/fusion_gru
7 years ago
tensor-tang 93c034ee51 Merge remote-tracking branch 'ups/develop' into optimize/op/fusion_lstm
7 years ago
tensor-tang c7adb99ae0 follow comment and refine code
7 years ago
tensor-tang 83f4bc4ecf follow comment and refine code
7 years ago
tensor-tang f38905a6e5 Merge remote-tracking branch 'ups/develop' into optimize/op/fusion_gru
7 years ago
tangwei12 fbdd4f8c0f
Merge pull request #13101 from zenghsh3/develop
7 years ago
tensor-tang 9838bacb35
Merge branch 'develop' into optimize/op/fusion_lstm
7 years ago
qingqing01 9bd933d3fb
Improve and fix fake_quantize_op (#13092)
7 years ago
Tao Luo 3fe0575b62
Merge pull request #13148 from dzhwinter/windows/math_compile
7 years ago
chenweihang 7ddbbcb0b5 doc: refine API and doc
7 years ago
dzhwinter 34757efb8e fix windows compile
7 years ago
tensor-tang c44108803a refine prelu
7 years ago
chenweihang b081363bae Merge branch 'sequence_enumerate_op' of https://github.com/chenwhql/Paddle into sequence_enumerate_op
7 years ago
chenweihang 0b7d82befb doc: refine English description
7 years ago
dzhwinter b11332a07b
"fix style" (#13094)
7 years ago
dzhwinter ab1097cd8e
Feature/template (#13093)
7 years ago
tensor-tang 80edd7ef29 enable run with fuse pass
7 years ago
fengjiayi f79ca23115 fix bugs
7 years ago
tensor-tang a79a77eeb5 refine and clean code
7 years ago
tensor-tang c459fb5be0 add fusion lstm batch mode
7 years ago
whs e10aa80f03
Add pad2d op. (#12950)
7 years ago
tensor-tang 7bdd11d88e Merge branch 'develop' into optimize/op/fusion_gru
7 years ago
fengjiayi 1f36a4c27c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CudnnHolder
7 years ago
fengjiayi b0aca8824d make CudnnHolder thread safe
7 years ago
tensor-tang 596213906b add gru seq mode forward
7 years ago
zenghsh3 d7495838b3 refine
7 years ago
zenghsh3 04a05d1d58 merged
7 years ago
zenghsh3 08b73b68c4 fix bug of sampling_id_op
7 years ago
tensor-tang b0d36c4c3d add cross vec to speedup gru
7 years ago
tensor-tang 038c16eed2 save intermediate data to out buffer
7 years ago
Xingyuan Bu 0a97d24b41 Faster RCNN Generate Proposal Labels (#12616)
7 years ago
fengjiayi d5f74b7308 use CudnnHolder in conv_transpose_cudnn_op
7 years ago
fengjiayi 407ff0bdbc use CudnnHolder in conv_cudnn_op
7 years ago
chengduo 3bd1d22a7d
Enhance fused_elementwise_activation_op (#12837)
7 years ago
tensor-tang 2d0ddf8c41 refine cpu gru batch mode
7 years ago
tensor-tang 70d3981220 add cpu vec bias sub
7 years ago
jerrywgz 85fe65ae61 modified error info for maxout op
7 years ago
Chen Weihang b98b744067
Merge branch 'develop' into sequence_enumerate_op
7 years ago
Yan Chunwei 902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006)
7 years ago
Xingyuan Bu 2ad5d91ef8 Faster RCNN Generate Proposals (#12056)
7 years ago
tensor-tang 89d6d69ce4
Merge pull request #12781 from tensor-tang/feature/op/fusion_gru
7 years ago
tensor-tang d941192e74 fix gcc53 on cpu vec (#13020)
7 years ago
tensor-tang 2328a69157
Merge pull request #13012 from tensor-tang/refine/seq2batch
7 years ago
Xin Pan 2bb15f437c
Merge pull request #12791 from panyx0718/ir3
7 years ago
Qiao Longfei a22309afe8
clean useless check code in auc_op (#13023)
7 years ago
Yu Yang 8965cee89f
Polish PrintOp (#12895)
7 years ago
chengduo 7ad39c4077
Enhance pad_constant_like_op (#12999)
7 years ago
qingqing01 0353eddb51
Improve fake_dequantize_op. (#12877)
7 years ago
Qiao Longfei 11e01d9b2d
Scale support selectedrows (#12960)
7 years ago
fengjiayi 7b84c580e2
Merge pull request #12824 from JiayiFeng/dev_sequence_padding_op
7 years ago
tensor-tang fd4f7c3ab5 refine seq2batch
7 years ago
Wu Yi 0ee6fed05b
Refine dist rpc deps (#12899)
7 years ago
fengjiayi 7e0c9f50ae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_sequence_padding_op
7 years ago
Zeng Jinle 599a32641b
Merge pull request #12971 from sneaxiy/unstack_op
7 years ago
Tao Luo 26cac36bfd
Merge pull request #12515 from kbinias/kbinias/bnorm-fwd-reuse
7 years ago
tensor-tang a481c5e98c Merge remote-tracking branch 'ups/develop' into feature/op/fusion_expand_concat_fc
7 years ago
tensor-tang 49c31febb5 fix typo and op test
7 years ago
fengjiayi 9cb455fa7d update function
7 years ago
Krzysztof Binias fb4b4f8d57 Refactor code
7 years ago
Krzysztof Binias 50d3e6e96b Reusing primitives for forward Batch Norm operator
7 years ago
Zeng Jinle ef7bd03a03
Merge pull request #12964 from sneaxiy/fix_concat_sync
7 years ago
sneaxiy 52a480bb98 Merge develop
7 years ago
tensor-tang 02909335e9 rename fusion seq_concat_fc to fusion seqexpand_concat_fc
7 years ago
Xin Pan 1a67061fee graph to program pass
7 years ago
qingqing01 1f09bc320c
Support data type int8_t . (#12841)
7 years ago
chenweihang 00b30b9938 doc: unified infershape format
7 years ago
chenweihang 0c4697f8cd fix: change to enumerate by sentence
7 years ago
tensor-tang c45cee0349 refine infershape and forward
7 years ago
sneaxiy 24264bc0b8 Merge develop
7 years ago
dzhwinter 0153c21d83 add unstack_op
7 years ago
tensor-tang c7c2506733 add forward implementation
7 years ago
jerrywgz 6033c1a278 Add error info & remove data sharing between input and output in rnn_memory_helper_op
7 years ago
chengduo 3e1050a2e8
Add pad_constant_like_op (#12943)
7 years ago
dzhwinter 6cc7870517 fix concat synchronization bug
7 years ago
tensor-tang 954b0e113f init fusion seq expand concat fc op
7 years ago
tensor-tang c488ee96a7 Merge remote-tracking branch 'ups/develop' into refine/op/fusion_lstm
7 years ago
tensor-tang e61cf3214d complete reverse seq
7 years ago
Chen Weihang 4ec12496dd
Merge branch 'develop' into sequence_enumerate_op
7 years ago
tensor-tang 4b28fab8c9 enable more acts
7 years ago
tensor-tang 607c41952e compute gates
7 years ago
Qiao Longfei 3c58b87b45
fix auc layer and add check for auc op (#12954)
7 years ago
jerrywgz 835573bbf2 add error_info prelu_op
7 years ago
Yibing Liu c1488b1796
Merge pull request #12940 from sneaxiy/stack_op
7 years ago
dzhwinter eca4563e5d
operators module (#12938)
7 years ago
tensor-tang 6be273cbdb add seq mode lstm
7 years ago
tensor-tang 36363292c3
Merge pull request #12904 from tensor-tang/refine/jit
7 years ago
jerrywgz bc7503c85e modified error_info for maxout_op
7 years ago
Zeng Jinle d189d4dbab
Merge pull request #12884 from sneaxiy/sequence_mask_op
7 years ago
sneaxiy 3b38e5a4fc speed up stack_op
7 years ago
tensor-tang 7bdaf09664 Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
Tao Luo 989cc2a4f4
Merge pull request #12913 from luotao1/concat
7 years ago
Tao Luo 8650f6ffae
Merge pull request #12898 from luotao1/expand
7 years ago
Qiao Longfei 52948a0b50
Merge pull request #12909 from jacquesqiao/fix-sparse-update-bug
7 years ago
tensor-tang ba943d38e3 make runtime avx act
7 years ago
tensor-tang 3462c29940 refine add bias with avx
7 years ago
tangwei12 ef6445ee39
Merge pull request #12908 from seiriosPlus/fill_constant_selectedrows
7 years ago
tensor-tang bb9f98e10d add inplace test
7 years ago
tensor-tang f269614bcd further optimize tanh with avx and mkl
7 years ago
chenweihang 733ea0d29b adjust infershape details
7 years ago
luotao1 e999c74cff Merge branch 'develop' into concat
7 years ago
luotao1 b61cf7ac4f Merge branch 'develop' into expand
7 years ago
luotao1 2b4edacca0 enhance the forward of concat op
7 years ago
Tao Luo 3e3b5f4fda
Merge pull request #12675 from Sand3r-/fix-conv-mkldnn-0.15
7 years ago
tensor-tang 7a4924cd44 further optimize sigmoid with avx and avx512
7 years ago
qiaolongfei fcf20eed0f fix sparse update bug
7 years ago
tangwei12 ca22586818 code optimize
7 years ago
Xin Pan 557be6fc58
Merge pull request #12902 from PaddlePaddle/revert-12736
7 years ago
tensor-tang 6bd89ba5b6 fix typo
7 years ago
Chen Weihang 2969aba14f
Merge branch 'develop' into sequence_enumerate_op
7 years ago