Commit Graph

10520 Commits (5b5fa37fb98bfa05f23e5ad508f6dbf3e7ec9f93)

Author SHA1 Message Date
luotao1 b3cd2ae88b Merge branch 'develop' into ner_ut2
7 years ago
Xin Pan c709a04ae2
Merge pull request #13018 from panyx0718/name
7 years ago
tensor-tang 596213906b add gru seq mode forward
7 years ago
Yan Chunwei af15f6f038
fea/refine fuse (#13076)
7 years ago
Yu Yang 819af27d63
Merge pull request #13079 from reyoung/feature/fix_bug
7 years ago
zenghsh3 d7495838b3 refine
7 years ago
zenghsh3 04a05d1d58 merged
7 years ago
luotao1 07cb64adc0 add unit-test for chinese_ner
7 years ago
qiaolongfei 583e21e60e fix mac compie
7 years ago
zenghsh3 62f8d59d8d Merge branch 'develop' of https://github.com/zenghsh3/Paddle into develop
7 years ago
zenghsh3 08b73b68c4 fix bug of sampling_id_op
7 years ago
Tao Luo 0148d8d2be
Merge pull request #13098 from luotao1/fix_mkldnn_include
7 years ago
tensor-tang b0d36c4c3d add cross vec to speedup gru
7 years ago
Wu Yi 0b8067c0dc
fix dist train reduce mode (#13068)
7 years ago
yuyang18 555f03d8f0 Fix bug when loss@GRAD is reused.
7 years ago
luotao1 7169f9378c fix mkldnn include format
7 years ago
Xin Pan 823c4f87be
Merge pull request #13058 from panyx0718/infer
7 years ago
Jiabin Yang cceffca6bf
Update api_impl.cc
7 years ago
Jiabin Yang a9be969cf9
Update attention_lstm_fuse_pass.cc
7 years ago
Jiabin Yang 5d5b70ad79
Update CMakeLists.txt
7 years ago
JiabinYang 7c7d3d6172 Fix mac
7 years ago
tensor-tang 038c16eed2 save intermediate data to out buffer
7 years ago
jerrywgz 1a88baae22 add rpn_target_assign api test (#13013)
7 years ago
Xingyuan Bu 0a97d24b41 Faster RCNN Generate Proposal Labels (#12616)
7 years ago
Yan Chunwei cfa6bbb755
move nodeid from graph to node (#13065)
7 years ago
fengjiayi d5f74b7308 use CudnnHolder in conv_transpose_cudnn_op
7 years ago
Yibing Liu f88a8ba99a
Merge pull request #12793 from kuke/wrap_squeezes
7 years ago
fengjiayi 15cc9128be fix compile error
7 years ago
fengjiayi 407ff0bdbc use CudnnHolder in conv_cudnn_op
7 years ago
fengjiayi 04bfd5c10c add CudnnHolder to manage cudnn_handle and workspace
7 years ago
chengduo 3bd1d22a7d
Enhance fused_elementwise_activation_op (#12837)
7 years ago
Xin Pan 5adf118ab5 polish
7 years ago
tensor-tang 2d0ddf8c41 refine cpu gru batch mode
7 years ago
tensor-tang 70d3981220 add cpu vec bias sub
7 years ago
Xin Pan c558f059ad fix
7 years ago
Xin Pan 4794d9cf70 use fast RunPrepareContext for inference
7 years ago
Xin Pan 7dff0b11c6 fix
7 years ago
tangwei12 515a756a3c
Merge pull request #13048 from TomorrowIsAnOtherDay/develop
7 years ago
Xin Pan 0aa8135b11
Merge pull request #13045 from panyx0718/ir5
7 years ago
Xin Pan d1058003fe fix style
7 years ago
Xin Pan 51ef0ad766 allow to use name_scope for debugging and visiualization
7 years ago
Chen Weihang b98b744067
Merge branch 'develop' into sequence_enumerate_op
7 years ago
Wu Yi 9ae55dd70d
fix dist transpile with memopt (#12974)
7 years ago
robot 31c90692f7 Merge branch 'develop' of https://github.com/TomorrowIsAnOtherDay/Paddle into develop
7 years ago
robot e1f17a1e4f add python op for sampling_id
7 years ago
Yan Chunwei 902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006)
7 years ago
Xin Pan c720ef972b fix graph_to_program_test to be platform-independent
7 years ago
Xingyuan Bu 2ad5d91ef8 Faster RCNN Generate Proposals (#12056)
7 years ago
tensor-tang 89d6d69ce4
Merge pull request #12781 from tensor-tang/feature/op/fusion_gru
7 years ago
tensor-tang d941192e74 fix gcc53 on cpu vec (#13020)
7 years ago
Xin Pan a08f6a17f0
Merge pull request #12886 from panyx0718/ir5
7 years ago
tensor-tang 2328a69157
Merge pull request #13012 from tensor-tang/refine/seq2batch
7 years ago
Xin Pan ec5cb3762c
Merge branch 'develop' into ir5
7 years ago
Xin Pan 2bb15f437c
Merge pull request #12791 from panyx0718/ir3
7 years ago
Qiao Longfei a22309afe8
clean useless check code in auc_op (#13023)
7 years ago
Yu Yang 8965cee89f
Polish PrintOp (#12895)
7 years ago
Yibing Liu 13509da69e Merge upstream to branch wrap_squeezes
7 years ago
Yibing Liu 03f6292bef Update API spec
7 years ago
chengduo 7ad39c4077
Enhance pad_constant_like_op (#12999)
7 years ago
qingqing01 0353eddb51
Improve fake_dequantize_op. (#12877)
7 years ago
Qiao Longfei 11e01d9b2d
Scale support selectedrows (#12960)
7 years ago
fengjiayi 7b84c580e2
Merge pull request #12824 from JiayiFeng/dev_sequence_padding_op
7 years ago
tensor-tang fd4f7c3ab5 refine seq2batch
7 years ago
Wu Yi 0ee6fed05b
Refine dist rpc deps (#12899)
7 years ago
Zhaolong Xing 3a0b6f97cc
Merge pull request #12942 from NHZlX/add_concat_op_converter
7 years ago
fengjiayi 7e0c9f50ae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_sequence_padding_op
7 years ago
Wu Yi 9b2b49ff26
test fix release branch api check (#12977)
7 years ago
Zeng Jinle 599a32641b
Merge pull request #12971 from sneaxiy/unstack_op
7 years ago
Tao Luo 0b77518f3a
Merge pull request #12982 from Sand3r-/mutable-data-check
7 years ago
Tao Luo 26cac36bfd
Merge pull request #12515 from kbinias/kbinias/bnorm-fwd-reuse
7 years ago
Michal Gallus b6d261dff5 Enforce requested size of tensor to be sufficiently large
7 years ago
tensor-tang a481c5e98c Merge remote-tracking branch 'ups/develop' into feature/op/fusion_expand_concat_fc
7 years ago
tensor-tang 49c31febb5 fix typo and op test
7 years ago
fengjiayi 9cb455fa7d update function
7 years ago
Krzysztof Binias fb4b4f8d57 Refactor code
7 years ago
Krzysztof Binias 50d3e6e96b Reusing primitives for forward Batch Norm operator
7 years ago
Zeng Jinle ef7bd03a03
Merge pull request #12964 from sneaxiy/fix_concat_sync
7 years ago
sneaxiy 52a480bb98 Merge develop
7 years ago
tensor-tang 02909335e9 rename fusion seq_concat_fc to fusion seqexpand_concat_fc
7 years ago
Xin Pan cf547e2714 fix program_desc feed/fetch names' order.
7 years ago
Xin Pan 08352fe56a fix
7 years ago
Xin Pan 6fdb7f5348 add test
7 years ago
Xin Pan 880cb8c4c3 clean
7 years ago
Xin Pan 1a67061fee graph to program pass
7 years ago
qingqing01 1f09bc320c
Support data type int8_t . (#12841)
7 years ago
chenweihang 0c4697f8cd fix: change to enumerate by sentence
7 years ago
tensor-tang c45cee0349 refine infershape and forward
7 years ago
sneaxiy 33b4def10a add api.spec
7 years ago
sneaxiy 24264bc0b8 Merge develop
7 years ago
dzhwinter 0153c21d83 add unstack_op
7 years ago
tensor-tang c7c2506733 add forward implementation
7 years ago
nhzlx 478eeabdd4 refine uttest of api_tensorrt_subgraph_engine
7 years ago
chengduo 3e1050a2e8
Add pad_constant_like_op (#12943)
7 years ago
dzhwinter 6cc7870517 fix concat synchronization bug
7 years ago
tensor-tang 954b0e113f init fusion seq expand concat fc op
7 years ago
nhzlx 791aa7f49d merge develop
7 years ago
dzhwinter d361624c1d
platform module (#12932)
7 years ago
tensor-tang c488ee96a7 Merge remote-tracking branch 'ups/develop' into refine/op/fusion_lstm
7 years ago
tensor-tang e61cf3214d complete reverse seq
7 years ago
Chen Weihang 4ec12496dd
Merge branch 'develop' into sequence_enumerate_op
7 years ago
tensor-tang 4b28fab8c9 enable more acts
7 years ago
tensor-tang 607c41952e compute gates
7 years ago
dzhwinter 4fcc293617
memory module (#12931)
7 years ago
Qiao Longfei 3c58b87b45
fix auc layer and add check for auc op (#12954)
7 years ago
Yibing Liu 43b2438aee Merge branch 'develop' of upstream into wrap_squeezes
7 years ago
Yibing Liu c1488b1796
Merge pull request #12940 from sneaxiy/stack_op
7 years ago
Yibing Liu efa6f0df4e Remove inplace args in unsqueeze & squeeze ops
7 years ago
Yibing Liu 26710ccd08 Update api spec accordingly
7 years ago
dzhwinter 04b1e4dcea
tensor module windows support (#12934)
7 years ago
dzhwinter c790d57cd4
data_type (#12933)
7 years ago
Qiao Longfei d0b713493e
enhance DebugStringEx (#12949)
7 years ago
dzhwinter eca4563e5d
operators module (#12938)
7 years ago
tensor-tang 6be273cbdb add seq mode lstm
7 years ago
tensor-tang 36363292c3
Merge pull request #12904 from tensor-tang/refine/jit
7 years ago
nhzlx 368dbc2835 merge develop
7 years ago
nhzlx 3de4556659 concat op && map cnn model support
7 years ago
Zeng Jinle d189d4dbab
Merge pull request #12884 from sneaxiy/sequence_mask_op
7 years ago
sneaxiy 3b38e5a4fc speed up stack_op
7 years ago
Qiyang Min 038e2817f0
Merge pull request #12836 from velconia/disable_prelu_test_local
7 years ago
tensor-tang 7bdaf09664 Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
Tao Luo 989cc2a4f4
Merge pull request #12913 from luotao1/concat
7 years ago
Tao Luo 8650f6ffae
Merge pull request #12898 from luotao1/expand
7 years ago
Qiao Longfei 52948a0b50
Merge pull request #12909 from jacquesqiao/fix-sparse-update-bug
7 years ago
tensor-tang ba943d38e3 make runtime avx act
7 years ago
tensor-tang 3462c29940 refine add bias with avx
7 years ago
tangwei12 ef6445ee39
Merge pull request #12908 from seiriosPlus/fill_constant_selectedrows
7 years ago
tensor-tang bb9f98e10d add inplace test
7 years ago
tensor-tang f269614bcd further optimize tanh with avx and mkl
7 years ago
chenweihang 733ea0d29b adjust infershape details
7 years ago
luotao1 e999c74cff Merge branch 'develop' into concat
7 years ago
luotao1 b61cf7ac4f Merge branch 'develop' into expand
7 years ago
luotao1 2b4edacca0 enhance the forward of concat op
7 years ago
Tao Luo 3e3b5f4fda
Merge pull request #12675 from Sand3r-/fix-conv-mkldnn-0.15
7 years ago
tensor-tang 7a4924cd44 further optimize sigmoid with avx and avx512
7 years ago
Xin Pan abeb71c895 small fix of op_desc
7 years ago
qiaolongfei fcf20eed0f fix sparse update bug
7 years ago
tangwei12 ca22586818 code optimize
7 years ago
Xin Pan 557be6fc58
Merge pull request #12902 from PaddlePaddle/revert-12736
7 years ago
tensor-tang 6bd89ba5b6 fix typo
7 years ago
minqiyang 6c775a0a06 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into disable_prelu_test_local
7 years ago
Chen Weihang 2969aba14f
Merge branch 'develop' into sequence_enumerate_op
7 years ago
chenweihang 219a2369da feat: wrap sequence enumerate op
7 years ago
tensor-tang e3bb98eb38 optimize relu with avx and avx512
7 years ago
minqiyang 0eccd59425 Keep APISpec the same with Python2
7 years ago
guochaorong 1f270275a6 Revert "Add Python Callstacks when Op::Run error (#12759)"
7 years ago
guochaorong b1fc238694 Revert "Disable in_place in batch_norm API. (#12736)"
7 years ago
tensor-tang 25976fe736 optimize the sigmoid and tanh
7 years ago
minqiyang 3fd169daed Resume all tests
7 years ago
tensor-tang 2eb46c2b06 add cpu vec test
7 years ago
sneaxiy 1083e99520 Merge develop
7 years ago
tensor-tang 0d46f518ae refine avx condition and warning
7 years ago
minqiyang 23bfdf9987 Port APISpec check
7 years ago
tensor-tang f0f06992c1
Merge pull request #12878 from tensor-tang/feature/op/attention_lstm
7 years ago
luotao1 83f4edabe9 remove broadcast in sequence_expand
7 years ago
sneaxiy 5ea7bf88ba
Merge pull request #12872 from sneaxiy/stack_op
7 years ago
minqiyang 2aac36b3f9 For test
7 years ago
Tao Luo ef2da86b4f
Merge pull request #12885 from luotao1/test_ditu_rnn
7 years ago
minqiyang 13686c4474 Change to debug case
7 years ago
sneaxiy e895c98f0a add support to max_len is None
7 years ago
fengjiayi 0fb5e351c1 update API.spec
7 years ago
minqiyang 709c37023a Polish code
7 years ago
fengjiayi f4a4a4cbd9 add op comment and python layer
7 years ago
tangwei12 acdd95d5ca bug fix
7 years ago
chenweihang d2e5395b97 feat: add sequence enumerate op
7 years ago
luotao1 9c7fde45a7 enhance test_analyzer to profile ditu inference demo
7 years ago
chengduo 8ad9055804
Add is_test for while_op (#12874)
7 years ago
minqiyang 80e3ce411d For test
7 years ago
Wu Yi c6f212a339
Merge pull request #12847 from typhoonzero/add_scope_info_in_graphviz
7 years ago
Wu Yi b8da70c370
Resovle multi gpu async deps (#12828)
7 years ago
sneaxiy 64464cb1fa Merge develop
7 years ago
qingqing01 79918a8442 add sequence_mask_op for DAM model
7 years ago
Xin Pan 4a4567fc3d
Merge pull request #12876 from panyx0718/ir5
7 years ago
Yu Yang b2df17003f
Add Python Callstacks when Op::Run error (#12759)
7 years ago
Yu Yang 17fcc4f5d0
Merge pull request #12864 from reyoung/feature/process_lod_grad
7 years ago
Xin Pan 698c926ce5 copy program and fix op_desc
7 years ago
minqiyang 8b8f6487d9 Add debug info for fetch feed
7 years ago
tensor-tang 4e538db14d refine jit space
7 years ago
tensor-tang 5ca0bb9aad support more activation type and remove some comments
7 years ago
sneaxiy ba168bd2d2 modify API.spec
7 years ago
tensor-tang d9bf73f3ab Merge remote-tracking branch 'ups/develop' into feature/op/fusion_gru
7 years ago
tensor-tang dd938d0b94 fix bugs and pass op test
7 years ago
tensor-tang ec59f0d454 add cpu vec
7 years ago
tensor-tang cf5ea925c3 fix bugs
7 years ago
tensor-tang 6ed20474d4 refine attention lstm infershape
7 years ago
tensor-tang 508548f897 implement attention lstm cpu forward
7 years ago
tensor-tang 9affc36c89 init attention lstm
7 years ago
tensor-tang 3dd66390b2 add blas vexp
7 years ago
tensor-tang 0ec1f65cf1 fix blas dot and add cblas scal
7 years ago
tensor-tang a2203d0466 add cblas dot
7 years ago
tensor-tang f72ab8961e refine blas gemm
7 years ago
qingqing01 f5d5d7b2d9
Disable in_place in batch_norm API. (#12736)
7 years ago
sneaxiy c73c5ed573 use for_range
7 years ago
Xin Pan b548ecbc2b add stack_op
7 years ago
Yu Yang eb8fd853bc Fix sequence_softmax_cudnn op
7 years ago
Yu Yang 3768677980 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/process_lod_grad
7 years ago
Tao Luo decda738b0
fea/anakin compile with demo (#12772)
7 years ago
Yu Yang 2a36ad1a96 Handle LoD for concat & seq_softmax ops
7 years ago
Yu Yang 211d81863d Process elemwise grad op's lod. mul_op's lod
7 years ago
Yan Chunwei 9ee698e605
enhance/ditu rnn with fc fuse (#12831)
7 years ago
Xin Pan 78415f326d
Merge pull request #12838 from panyx0718/infer
7 years ago
fengjiayi ce182d9037 bug fix
7 years ago
Xin Pan a2c0e52f3e speed up while_op
7 years ago
typhoonzero dd7a79158b add scope info in graphviz debug
7 years ago
tensor-tang 6f78fd7d1e fuse fc in gru
7 years ago
tensor-tang 300180cc26 init fusion gru op
7 years ago
Zhaolong Xing 21ba32b065
Merge pull request #12843 from NHZlX/fix_ssa_bug_for_trt
7 years ago
Michał Gallus cd32ddac12 Fuse Convolution and Eltwise Add into MKLDNN's Conv+Bias (#12669)
7 years ago
nhzlx c999895e93 merge develop
7 years ago
nhzlx 276950291a 1. fix ssa bug with batchnorm, 2. refine the trt
7 years ago
Yan Chunwei 896a37b6e3
fea/link ir to inference analysis and fc fuse support (#12789)
7 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
Tao Luo d04ef276a5
Merge pull request #12745 from tensor-tang/refine/op/elewise_mul
7 years ago
tangwei12 cbc6e6eb97
Merge pull request #12247 from seiriosPlus/dis_ckpt_fix
7 years ago
Qiyang Min 72965226e6
Merge pull request #12818 from velconia/fix_python3_CI_job
7 years ago
minqiyang 656c77e712 Resume cicheck
7 years ago
minqiyang e1492f19e1 Change the sequence of ci check
7 years ago
tangwei12 44bade8b17 fix api spec
7 years ago
Zhaolong Xing 470335e8c4
Merge pull request #12786 from NHZlX/add_batch_norm_trt_converter
7 years ago
Qingsheng Li 3d11d018e0
Fix scatter_op python API (#12742)
7 years ago
nhzlx ff052c0e6f merge develop
7 years ago
nhzlx c6a5c4b0c0 add comments for execute in ut_helper
7 years ago
minqiyang 50d66a0790 Fix prelu_op
7 years ago
minqiyang beb93bb901 Fix ut bug for graph_test
7 years ago
Tao Luo 8f9f414a14
Merge pull request #12805 from tensor-tang/fix/op/elewise_add
7 years ago
tensor-tang e955361267
Merge pull request #12737 from tensor-tang/feature/op/fusion_lstm
7 years ago
tensor-tang 82bb9170fb Merge remote-tracking branch 'ups/develop' into fix/op/elewise_add
7 years ago
tangwei12 99f74be561
Merge pull request #12802 from seiriosPlus/inference_teeny_mistakes
7 years ago
Tao Luo 2ae885e224
Merge pull request #12811 from luotao1/tensorrt_compiler_bug
7 years ago
Chen Weihang 57b34d9196
Merge pull request #12808 from chenwhql/remove_inplace_param_in_squeeze_and_unsqueeze
7 years ago
Xin Pan daf464af68
Merge pull request #12807 from panyx0718/fix
7 years ago
luotao1 808e5b1748 fix tensorrt compiler bug
7 years ago
Yihua Xu 084d4a9e9e Optimize CRF Decoding with AVX/AVX2/AVX512F instruction (#12767)
7 years ago
fengjiayi 34b209cffa Complete sequence_padding GPU kernel
7 years ago
dzhwinter 00463fdfe3
cudnn windows support (#12757)
7 years ago
Xin Pan 4a4c469f61 add test
7 years ago
qingqing01 c62f68cb94
Fix bug in conditional_block_op. (#12246)
7 years ago
nhzlx 1bf9d9e90c fix comments
7 years ago
chenweihang bc471b6ac4 refactor: remove inplace parameter from squeeze and unsqueeze op
7 years ago
Xin Pan 7473d5f735 fix program_desc constructor
7 years ago
tensor-tang 0507f7bc3c fix SEGV elementwise add at debug mode
7 years ago
tangwei12 cfb12f09bf fix some teeny mistakes
7 years ago
Yu Yang c6af7201e9
Merge pull request #12692 from reyoung/feature/fast_executor
7 years ago
Xin Pan e525aa232e
Merge pull request #12780 from panyx0718/ir4
7 years ago
Tao Luo 7decbaaa13
Merge pull request #12762 from luotao1/anakin_cuda_env
7 years ago
nhzlx 324dd16816 merge develop
7 years ago
yuyang18 b8029fd650 Follow comments
7 years ago
tangwei12 ca1e18c04a
Merge pull request #12469 from seiriosPlus/sum_op_dim_fix
7 years ago
Xin Pan 1d3343240e fix
7 years ago
nhzlx 144b20c160 add batch norm op converter
7 years ago
nhzlx 14311bb094 merge develop
7 years ago