Commit Graph

1420 Commits (efe9d44ae6c5e7f33c2a9a1435c3465e1a4e2a1b)

Author SHA1 Message Date
Tomasz Patejko 347bf90412 MKLDNN conv + elementwise_add fusion: bias is also handled
7 years ago
Tomasz Patejko bf95ac36a7 MKLDNN conv + elementwise_add fusion: further reformatting
7 years ago
Tomasz Patejko cbe122ae2e MKLDNN conv + elementwise_add fusion: correcting formatting
7 years ago
Tomasz Patejko 2a251bbf27 MKLDNN conv + elementwise_add fusion: some refactoring: consts, function calls instead of constant values
7 years ago
Tomasz Patejko b8e54ab5cc MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData
7 years ago
Tomasz Patejko 27573ece03 MKLDNN conv + elementwise_add fusion: trailing spaces removed
7 years ago
Tomasz Patejko 7f5c8a95e8 MKLDNN conv + elementwise_add fusion: arguments are replaced for many parameters in operator
7 years ago
Tomasz Patejko 5996bd39e8 MKLDNN conv + elementwise_add fusion: graph is corrected based on actual argument name, not formal argument name
7 years ago
Tomasz Patejko 41f3d78fdf MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
7 years ago
Tomasz Patejko 07a62ddc08 MKLDNN conv + elementwise_add fusion: inputs in pass modified. Support for new conv parameter. UTs corrected
7 years ago
Tomasz Patejko 42f569fdfd MKLDNN conv + elementwise_add fusion: use_mkldnn attribute added
7 years ago
Tomasz Patejko 441d3a4726 MKLDNN conv + elementwise_add: added some refactoring in the pass
7 years ago
Tomasz Patejko 38b7b34b1c MKLDNN conv + elementwise_add fusion: added reachability tests, inputs and outputs in graph nodes are transformed
7 years ago
Tomasz Patejko 16eaaf3fbe MKLDNN conv + elementwise_add fusion: added one more UT, found and corrected bugs in pass
7 years ago
Tomasz Patejko 604bad08bc MKLDNN conv + elementwise_add fusion: implementation of patterns refarctored, applied to graph. UTs added
7 years ago
Tomasz Patejko 9ce343f868 MKLDNN conv + elementwise_add fusion: initial implementation of patterns
7 years ago
tensor-tang 40f8456a4f refine fuse pattern and attr
7 years ago
tensor-tang cbbacb2534 Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
7 years ago
tensor-tang 603ba5e01d add seqconv eltadd relu pass
7 years ago
Michal Gallus c504a5a1b7 Adjust Conv+bias to placement pass
7 years ago
Michal Gallus d7509d63f1 Conv+Bias: Support non-null bias
7 years ago
Michal Gallus 582f59c190 Conv+Bias fuse
7 years ago
Wojciech Uss c3b70aece9 Add MKL-DNN placement pass (#13958)
7 years ago
Xin Pan 909e1341bd
Merge pull request #13966 from panyx0718/fix4
7 years ago
Tao Luo 7b11162ab5
Merge pull request #13949 from PaddlePaddle/wojtuss/unique-patterns-request-comment
7 years ago
tangwei12 48982e9dc7 fix lookuptable in reduce strategy
7 years ago
Xin Pan 9a819265eb fix
7 years ago
tangwei12 0e722c5ea2 fix lookuptable in reduce strategy
7 years ago
Wojciech Uss 55fd136ab0 Added comment with request for enhancement
7 years ago
Xin Pan 6de08b5eef set default timeout to avoiding blocking CI
7 years ago
Tao Luo 6a4e9230ed Merge branch 'develop' into mkldnn_test
7 years ago
Xin Pan 6a54c3de1f
Merge pull request #13928 from panyx0718/doc
7 years ago
Tao Luo e47f4186ae fix some compiler warning
7 years ago
sneaxiy 3419d04c3f test=develop
7 years ago
Xin Pan 7fb5b66ac2
Merge pull request #13916 from panyx0718/fix2
7 years ago
Yan Chunwei 6809238d97
fix analysis predictor profile (#13896)
7 years ago
Xin Pan abbfb60ca9 remove unused codes
7 years ago
Xin Pan 342e436158 Make Var::GetMutable robust
7 years ago
Qiyang Min e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
7 years ago
dzhwinter e41a3fcd68 fix update to develop hang problem.
7 years ago
Zeng Jinle af91d41ab8
Merge pull request #13852 from sneaxiy/feature/eager_delete_tensor
7 years ago
Xin Pan 288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
7 years ago
Tao Luo c26f2b21eb
Merge pull request #13813 from sfraczek/sfraczek/conv-bn-fuse-pass-full-eigen
7 years ago
dzhwinter 804dd7da04 merge conflict. both linux and windows pass.
7 years ago
dzhwinter 962061f0a3 windows fix
7 years ago
wanghaoshuang b12f7c2399 compile in linux.
7 years ago
Sylwester Fraczek 50c5e9b0c6 reshape_2d used from ddim.h
7 years ago
minqiyang aeec82acd5 Add unittest for reshape op
7 years ago
Xin Pan ddb76d0d09 Make GetMutable more robust
7 years ago
minqiyang 24c9fbdba3 Polish code
7 years ago
sneaxiy d3ed070e10 test=develop
7 years ago
minqiyang d9b202e717 Move tensor copy src_ptr and dst_ptr check to TensorCopy function
7 years ago
sneaxiy fb6201e93e test=develop
7 years ago
wanghaoshuang 3ae9645084 compile in linux
7 years ago
Xin Pan d852be7c48
Revert "Make variable::GetMutable robust"
7 years ago
Xin Pan ae8b1c32bf
Merge pull request #13821 from panyx0718/fix
7 years ago
Sylwester Fraczek 55d6950a1a rewrite conv_bn fuse pass to eigen
7 years ago
Xin Pan 228506618b Avoid GetMutable implicitly reset Var Type.
7 years ago
sneaxiy d87569134c test=develop
7 years ago
Tao Luo 9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
7 years ago
Tao Luo ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
7 years ago
Tao Luo 16b1beb244
Merge pull request #13486 from sfraczek/sfraczek/conv-bn-fuse-pass
7 years ago
Michal Gallus 40b17be4b0 Pass: Fuse Conv + Bias
7 years ago
Sylwester Fraczek 3fcca40909 eigen sqrt fix and change 1e-5 to epsilon
7 years ago
chengduo e1761709f8
Set the right shape of selected_rows (#13723)
7 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
7 years ago
Xin Pan 943e4deb23
Merge pull request #13750 from panyx0718/fix
7 years ago
Sylwester Fraczek 78f98294c2 conv bn fuse pass
7 years ago
Xin Pan d2079b1ddb clean unused code and small optimize
7 years ago
Xin Pan ab798a2832 clarify the fraction_of_gpu_memory flag
7 years ago
Tao Luo d770b9bda3
Merge pull request #13663 from luotao1/resnet50_ut
7 years ago
dzhwinter 32c260cd1f
"fix operators cmake" (#13581)
7 years ago
Tao Luo 6ef6a9180a
Merge pull request #13727 from Sand3r-/mgallus/enable-mkldnn-naive-exe
7 years ago
Michal Gallus 09d9d77a8f Enable MKLDNN in Naive Executor
7 years ago
sneaxiy 9606b37ce4 test=develop
7 years ago
Yu Yang 186b2b138d
Revert "Merge pull request #13201 from reyoung/revert_callstack" (#13697)
7 years ago
Tao Luo a89afd4c22
Merge pull request #13685 from luotao1/naive_cmake
7 years ago
chengduo d6747a9ac2
make check_graph choosable (#13674)
7 years ago
Xin Pan 425a882165
Merge pull request #13643 from panyx0718/ir2
7 years ago
luotao1 a989a4e7c2 refine paddle_inference_helper.h
7 years ago
Xin Pan 642905958a fix compile error
7 years ago
tensor-tang ede4b230be
Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op
7 years ago
Xin Pan 6746b1fdf3 add missing header
7 years ago
Xin Pan ddd60581b7 clean up channel
7 years ago
Tao Luo 83ca657f96 Merge branch 'develop' into resnet50_ut
7 years ago
Jacek Czaja e202f33aa9 - Yet another clarification to comment
7 years ago
Tao Luo 21ee30595b clean some CMakeLists
7 years ago
Jacek Czaja 1df69f7c9d - Fix to comment
7 years ago
Xin Pan d24f1f0aa4 Current scope needs to be thread-safe for training
7 years ago
Yan Chunwei c8744d118d
fea/infer executor and concurrency performance issue bug fix (#13451)
7 years ago
chengduo 5175b3cb2b
Add GraphChecker (#13580)
7 years ago
Xin Pan 7cd2761736
Merge pull request #13416 from panyx0718/ir
7 years ago
Yu Yang 0be1582df0
Merge pull request #13525 from reyoung/fix_mixed_vector
7 years ago
Jacek Czaja d5114c60b0 - Reviewers suggesstions to fused_embedding_fc_lstm_op
7 years ago
Jacek Czaja 7ab5626dee - Added initial pass for embedding-fc-lstm
7 years ago
Yan Chunwei 9e8d372ff4
hide attention lstm fuse (#13615)
7 years ago
Wu Yi 10dffc68cd
Merge pull request #13618 from typhoonzero/revert_13530
7 years ago
tangwei12 97cf1eb6d7
Add distributed unit tests about text_classification/simnet-bow/ctr (#12812)
7 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
7 years ago
tensor-tang 6938e6cf06
Merge pull request #13603 from tensor-tang/refine/peephole
7 years ago
Wu Yi 16e73e0d28
hide operator API (#12543)
7 years ago
tensor-tang 209e9c3db1 refine peephole
7 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
7 years ago
Tao Luo d0000082c4
Merge pull request #13552 from sfraczek/sfraczek/conv-relu-update
7 years ago
Sylwester Fraczek e5d1bd1e93 remove unused variable nodes2delete
7 years ago
Sylwester Fraczek a49aa4dac9 make bias unnecessary for ConvRelu fuse
7 years ago
Sylwester Fraczek 493ef0c8df do not remove conv node just rewire the output
7 years ago
Sylwester Fraczek 667b661786 updated the test
7 years ago
Xin Pan f117feab0c modify comments
7 years ago
Yu Yang 7119d6c3cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_callstack
7 years ago
Xin Pan c2b3838cf5 add some comments
7 years ago
Xin Pan 0d9ee0dced fix resolve conflicts
7 years ago
Xin Pan b43e49fa31 fix
7 years ago
Xin Pan 36c2a9af27 pass builder allow cutomize pass in python.
7 years ago
dzhwinter 97636a9fcf
"fix link error" (#13545)
7 years ago
chengduo cdf3a4c244
Fix concat_op InferShape (#13513)
7 years ago
Zeng Jinle f855c05fff
Merge pull request #13520 from sneaxiy/enhance_eager_delete
7 years ago
Yu Yang 3043f51b3a
Merge pull request #13511 from reyoung/fix_ce
7 years ago
Wu Yi 29c63d180f
[Feature] dist op role and lr op role, to support memory optimize with dist training (#13220)
7 years ago
Yu Yang e1913bc5d8 Fix MixedVector
7 years ago
sneaxiy 0a36ef3c18 enhance eager deletion
7 years ago
Yu Yang 6d2c6f96f1 Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod""
7 years ago
Yu Yang a6c8d6b9a2 Revert "Merge pull request #13431 from chengduoZH/refine_lod"
7 years ago
chengduo d402234ba8
Feature/op_fuse_pass (#12440)
7 years ago
Xin Pan 23ba766252
Merge pull request #13475 from panyx0718/ir5
7 years ago
Xin Pan 759dae0493
Merge pull request #13461 from panyx0718/ir3
7 years ago
Xin Pan 0bd7a67eaf avoid creating dangling ir::Node.
7 years ago
Zeng Jinle 642cf6ca2f
Merge pull request #13418 from sneaxiy/dam_save_memory
7 years ago
Yu Yang aa79bccfff
Merge pull request #13460 from reyoung/fix_data_transform
7 years ago
Yu Yang 2c31ea9293
Merge pull request #13424 from chengduoZH/refine_seq_concat
7 years ago
sneaxiy fd3e32ea7d Merge develop
7 years ago
Xin Pan 5ce77889f4 clean unused inference_optimize c++ implementation
7 years ago
Yu Yang 922dee3b43 Wait input when data transform
7 years ago
chengduoZH 6534f8527a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_lod
7 years ago
chengduoZH 24459501fe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_seq_concat
7 years ago
Xin Pan a83a4fab5c
Merge pull request #13441 from panyx0718/ir2
7 years ago
sneaxiy d9942cd122 Merge develop
7 years ago
chengduoZH cdb9605bad refine
7 years ago
sneaxiy 114eb17587 fix executor bug
7 years ago
Xin Pan e5b322051b clean
7 years ago
Xin Pan ec6ee0a293 simplify and hide bcast_params
7 years ago
yuyang c809fee3b0 Fix bug of Vector::Extend
7 years ago
chengduoZH def2a8b1b0 Async memcpy
7 years ago
chengduoZH e7940141ce refine seq_concat
7 years ago
sneaxiy 7116cc12fb Merge develop
7 years ago
sneaxiy abf9832c12 tiny change to save memory
7 years ago
sneaxiy 612e1a3155 modification
7 years ago
sneaxiy d0b2453ecd merge develop
7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor
7 years ago
Tao Luo 9f03ce6c4a
Merge pull request #13318 from Superjomn/refine/simple_fix
7 years ago
dzhwinter 85f8dd1c77 debug version
7 years ago
Yan Xu 23ec966cd3
Merge pull request #13360 from Yancey1989/parallel_bcast
7 years ago
Yancey1989 757f9683ab update comment text
7 years ago
Yancey1989 9ee1b7bc04 add some comments
7 years ago
Yancey1989 bad4ea192e update by comment
7 years ago
Yancey1989 1664899b63 update
7 years ago
Yancey1989 2888d2d7bc Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_bcast
7 years ago
Yancey1989 1e1b6622fd update by comment
7 years ago
Sylwester Fraczek dd149d469b hotfix for conv-relu pass
7 years ago
luotao1 b12322ce95 fix fusion_lstm unique_name bug
7 years ago
tensor-tang 62a9821043
Merge pull request #13346 from tensor-tang/refine/infershape
7 years ago
Qiyang Min 6abe03be0c
Merge pull request #13317 from velconia/fix_python35_CI_random_fail
7 years ago
Sylwester Fraczek 41de582bb0 create conv relu pass for MKLDNN (#13258)
7 years ago
dzhwinter c3e1fb5a3e add demo
7 years ago
Jiabin Yang d46b575e0e
Merge pull request #13349 from JiabinYang/mac/unittest
7 years ago
tensor-tang b0b5f515a9 Merge remote-tracking branch 'ups/develop' into refine/infershape
7 years ago
tensor-tang d61c11764a follow comment add enforce
7 years ago
Yancey1989 5ce1a960a5 move bcast op into pass
7 years ago
tensor-tang 8bb824bb93 refine infershape hasinput and hasoutput
7 years ago
JiabinYang 7dd54afd0c fix program desc unit test error
7 years ago
Yancey1989 23b12c6f58 fix invalide bcast in reduce strategy
7 years ago
tensor-tang e0436ad8bb refine fusion lstm infershape
7 years ago
guochaorong 76e9227467
Merge pull request #13199 from JiayiFeng/fix_CudnnHolder_bug
7 years ago
Xin Pan 17bf8713a5
Merge pull request #12988 from panyx0718/ir2
7 years ago
Yan Chunwei 2fd1bf2ea6
fea/add color log (#13305)
7 years ago
superjomn 6b2f680d88 simple fix
7 years ago
minqiyang dc863aac7e Add kids exists detection in Scope
7 years ago
Yan Chunwei 478a4e850e
refactor ir pattern (#13304)
7 years ago
minqiyang 681514e15f Make all scope pointer to shared
7 years ago
Xin Pan e762d85de4 clean
7 years ago
Xin Pan 9b7c3f9615 refine
7 years ago
Xin Pan ff47eaf45f clean
7 years ago
Xin Pan 4313d870a2 refine
7 years ago
Xin Pan c69cf6dde8 fix
7 years ago
Xin Pan 56a977d436 add test
7 years ago
Xin Pan 926e1077ca version
7 years ago
Yang Yu 8331e835a8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_CudnnHolder_bug
7 years ago
tensor-tang 3ea19b7596 fix bug and fc pass ut
7 years ago
tensor-tang df0c695618 fix fusion gru pass and enable it
7 years ago
tensor-tang c9bd2d50f1 refine fc and gru pattern
7 years ago
tensor-tang 353d7d1be6
Merge branch 'develop' into refine/ut/lac
7 years ago
Yan Chunwei e2d325ac08
refactor pass_library (#13261)
7 years ago
tensor-tang 0618077971 Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
tensor-tang 40dbd97f8e Merge remote-tracking branch 'ups/develop' into refine/op/peephole
7 years ago
tensor-tang 4d774953c6 enable fc gru fuse pass
7 years ago
tensor-tang 74f95b8da0 fix redefine macro
7 years ago
Tao Luo 7dd8adb5fd
Merge pull request #13211 from luotao1/multi-thread2
7 years ago
tensor-tang f057077c3a add fuse fc gru pass
7 years ago
Yan Chunwei f143c9a916
port VisualDL's graphviz theme to IR (#13246)
7 years ago
tensor-tang 3eb55f0643 Merge remote-tracking branch 'ups/develop' into refine/op/peephole
7 years ago