Commit Graph

1374 Commits (057a682ee9468cb02688f5185eb8f89ed0528f61)

Author SHA1 Message Date
Tomasz Patejko 4e72ab411e MKLDNN conv + elementwise_add fusion: fix for crash when bias is not present
7 years ago
Tomasz Patejko 415b261555 MKLDNN conv + elementwise_add fusion: fusion options added
7 years ago
Tomasz Patejko 0fe3079c46 MKLDNN conv + elementwise_add fusion: fix for order of parameters in elementwise_add in resnet50
7 years ago
Tomasz Patejko b73b868366 MKLDNN conv + elementwise_add fusion: bias in tests made persistent.
7 years ago
Tomasz Patejko a1fa203287 MKLDNN conv + elementwise_add fusion: name of the pass reused with name_scope_
7 years ago
Tomasz Patejko 2c43419db1 MKLDNN conv + elementwise_add fusion: comment explaining CorrectGraphEdges added
7 years ago
Tomasz Patejko 8fb29b2ca9 MKLDNN conv + elementwise_add fusion: new nodes marked as input or output
7 years ago
Tomasz Patejko cc1c8e37c1 MKLDNN conv + elementwise_add fusion: attributes in new conv op copied from old op
7 years ago
Tomasz Patejko a27a8c5da8 MKLDNN conv + elementwise_add fusion: bias in test marked as persistable
7 years ago
Tomasz Patejko af8c71317c MKLDNN conv + elementwise_add fusion: CorrectGraphEdges refactored
7 years ago
Tomasz Patejko 3e033087f1 MKLDNN conv + elementwise_add fusion: LinkNodes function removed and
7 years ago
Tomasz Patejko 4be45af1cc MKLDNN conv + elementwise_add fusion: skip connection attribute renamed. Comments about patterns added.
7 years ago
Tomasz Patejko 9a335e0277 MKLDNN conv + elementwise_add fusion: changed a name of a formal argument in ElementwiseAdd pattern
7 years ago
Tomasz Patejko fb7a50b230 MKLDNN conv + elementwise_add fusion: removed commented code. Internal functions marked as static.
7 years ago
Tomasz Patejko efd76614fb MKLDNN conv + elementwise_add fusion: implementation changed to conform with Paddle API
7 years ago
Tomasz Patejko 347bf90412 MKLDNN conv + elementwise_add fusion: bias is also handled
7 years ago
Tomasz Patejko bf95ac36a7 MKLDNN conv + elementwise_add fusion: further reformatting
7 years ago
Tomasz Patejko cbe122ae2e MKLDNN conv + elementwise_add fusion: correcting formatting
7 years ago
Tomasz Patejko 2a251bbf27 MKLDNN conv + elementwise_add fusion: some refactoring: consts, function calls instead of constant values
7 years ago
Tomasz Patejko b8e54ab5cc MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData
7 years ago
Tomasz Patejko 27573ece03 MKLDNN conv + elementwise_add fusion: trailing spaces removed
7 years ago
Tomasz Patejko 7f5c8a95e8 MKLDNN conv + elementwise_add fusion: arguments are replaced for many parameters in operator
7 years ago
Tomasz Patejko 5996bd39e8 MKLDNN conv + elementwise_add fusion: graph is corrected based on actual argument name, not formal argument name
7 years ago
Tomasz Patejko 41f3d78fdf MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
7 years ago
Tomasz Patejko 07a62ddc08 MKLDNN conv + elementwise_add fusion: inputs in pass modified. Support for new conv parameter. UTs corrected
7 years ago
Tomasz Patejko 42f569fdfd MKLDNN conv + elementwise_add fusion: use_mkldnn attribute added
7 years ago
Tomasz Patejko 441d3a4726 MKLDNN conv + elementwise_add: added some refactoring in the pass
7 years ago
Tomasz Patejko 38b7b34b1c MKLDNN conv + elementwise_add fusion: added reachability tests, inputs and outputs in graph nodes are transformed
7 years ago
Tomasz Patejko 16eaaf3fbe MKLDNN conv + elementwise_add fusion: added one more UT, found and corrected bugs in pass
7 years ago
Tomasz Patejko 604bad08bc MKLDNN conv + elementwise_add fusion: implementation of patterns refarctored, applied to graph. UTs added
7 years ago
Tomasz Patejko 9ce343f868 MKLDNN conv + elementwise_add fusion: initial implementation of patterns
7 years ago
tensor-tang 40f8456a4f refine fuse pattern and attr
7 years ago
tensor-tang cbbacb2534 Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
7 years ago
tensor-tang 603ba5e01d add seqconv eltadd relu pass
7 years ago
sneaxiy 2002e71da8 fix pinned allocator
7 years ago
Michal Gallus c504a5a1b7 Adjust Conv+bias to placement pass
7 years ago
Michal Gallus d7509d63f1 Conv+Bias: Support non-null bias
7 years ago
Michal Gallus 582f59c190 Conv+Bias fuse
7 years ago
Wojciech Uss c3b70aece9 Add MKL-DNN placement pass (#13958)
7 years ago
Xin Pan 909e1341bd
Merge pull request #13966 from panyx0718/fix4
7 years ago
Tao Luo 7b11162ab5
Merge pull request #13949 from PaddlePaddle/wojtuss/unique-patterns-request-comment
7 years ago
tangwei12 48982e9dc7 fix lookuptable in reduce strategy
7 years ago
Xin Pan 9a819265eb fix
7 years ago
tangwei12 0e722c5ea2 fix lookuptable in reduce strategy
7 years ago
Wojciech Uss 55fd136ab0 Added comment with request for enhancement
7 years ago
Xin Pan 6de08b5eef set default timeout to avoiding blocking CI
7 years ago
Tao Luo 6a4e9230ed Merge branch 'develop' into mkldnn_test
7 years ago
Xin Pan 6a54c3de1f
Merge pull request #13928 from panyx0718/doc
7 years ago
Tao Luo e47f4186ae fix some compiler warning
7 years ago
sneaxiy 3419d04c3f test=develop
7 years ago
Xin Pan 7fb5b66ac2
Merge pull request #13916 from panyx0718/fix2
7 years ago
Yan Chunwei 6809238d97
fix analysis predictor profile (#13896)
7 years ago
Xin Pan abbfb60ca9 remove unused codes
7 years ago
Xin Pan 342e436158 Make Var::GetMutable robust
7 years ago
Qiyang Min e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
7 years ago
Zeng Jinle af91d41ab8
Merge pull request #13852 from sneaxiy/feature/eager_delete_tensor
7 years ago
Xin Pan 288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
7 years ago
Tao Luo c26f2b21eb
Merge pull request #13813 from sfraczek/sfraczek/conv-bn-fuse-pass-full-eigen
7 years ago
Sylwester Fraczek 50c5e9b0c6 reshape_2d used from ddim.h
7 years ago
minqiyang aeec82acd5 Add unittest for reshape op
7 years ago
Xin Pan ddb76d0d09 Make GetMutable more robust
7 years ago
minqiyang 24c9fbdba3 Polish code
7 years ago
sneaxiy d3ed070e10 test=develop
7 years ago
minqiyang d9b202e717 Move tensor copy src_ptr and dst_ptr check to TensorCopy function
7 years ago
sneaxiy fb6201e93e test=develop
7 years ago
Xin Pan d852be7c48
Revert "Make variable::GetMutable robust"
7 years ago
Xin Pan ae8b1c32bf
Merge pull request #13821 from panyx0718/fix
7 years ago
Sylwester Fraczek 55d6950a1a rewrite conv_bn fuse pass to eigen
7 years ago
Xin Pan 228506618b Avoid GetMutable implicitly reset Var Type.
7 years ago
sneaxiy d87569134c test=develop
7 years ago
Tao Luo 9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
7 years ago
Tao Luo ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
7 years ago
Tao Luo 16b1beb244
Merge pull request #13486 from sfraczek/sfraczek/conv-bn-fuse-pass
7 years ago
Michal Gallus 40b17be4b0 Pass: Fuse Conv + Bias
7 years ago
Sylwester Fraczek 3fcca40909 eigen sqrt fix and change 1e-5 to epsilon
7 years ago
sneaxiy ea61e4ef15 merge develop
7 years ago
chengduo e1761709f8
Set the right shape of selected_rows (#13723)
7 years ago
sneaxiy 4c672ab1a2 Merge reyoung:rewrite_allocation
7 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
7 years ago
Xin Pan 943e4deb23
Merge pull request #13750 from panyx0718/fix
7 years ago
Sylwester Fraczek 78f98294c2 conv bn fuse pass
7 years ago
Xin Pan d2079b1ddb clean unused code and small optimize
7 years ago
Xin Pan ab798a2832 clarify the fraction_of_gpu_memory flag
7 years ago
Tao Luo d770b9bda3
Merge pull request #13663 from luotao1/resnet50_ut
7 years ago
dzhwinter 32c260cd1f
"fix operators cmake" (#13581)
7 years ago
Tao Luo 6ef6a9180a
Merge pull request #13727 from Sand3r-/mgallus/enable-mkldnn-naive-exe
7 years ago
Yu Yang 15076c325e Add comments and polish code style
7 years ago
Michal Gallus 09d9d77a8f Enable MKLDNN in Naive Executor
7 years ago
sneaxiy 9606b37ce4 test=develop
7 years ago
Yu Yang 186b2b138d
Revert "Merge pull request #13201 from reyoung/revert_callstack" (#13697)
7 years ago
Tao Luo a89afd4c22
Merge pull request #13685 from luotao1/naive_cmake
7 years ago
Yu Yang a1a01899c8 Refine
7 years ago
Yu Yang 31270e58d0 Add communication attr
7 years ago
chengduo d6747a9ac2
make check_graph choosable (#13674)
7 years ago
Xin Pan 425a882165
Merge pull request #13643 from panyx0718/ir2
7 years ago
luotao1 a989a4e7c2 refine paddle_inference_helper.h
7 years ago
Xin Pan 642905958a fix compile error
7 years ago
tensor-tang ede4b230be
Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op
7 years ago
Yu Yang 5cf395beaf Fix bug in uts
7 years ago
Xin Pan 6746b1fdf3 add missing header
7 years ago
Xin Pan ddd60581b7 clean up channel
7 years ago
Yu Yang a6fbf7ec28 Merge branch 'feature/refine_generate_proposals_op' into rewrite_allocation
7 years ago
Tao Luo 83ca657f96 Merge branch 'develop' into resnet50_ut
7 years ago
Jacek Czaja e202f33aa9 - Yet another clarification to comment
7 years ago
Tao Luo 21ee30595b clean some CMakeLists
7 years ago
Jacek Czaja 1df69f7c9d - Fix to comment
7 years ago
Xin Pan d24f1f0aa4 Current scope needs to be thread-safe for training
7 years ago
Yan Chunwei c8744d118d
fea/infer executor and concurrency performance issue bug fix (#13451)
7 years ago
Yu Yang 58ed412f68 refactor(memory): rewrite memory allocation and make it extentable
7 years ago
chengduo 5175b3cb2b
Add GraphChecker (#13580)
7 years ago
Xin Pan 7cd2761736
Merge pull request #13416 from panyx0718/ir
7 years ago
Yu Yang 0be1582df0
Merge pull request #13525 from reyoung/fix_mixed_vector
7 years ago
Jacek Czaja d5114c60b0 - Reviewers suggesstions to fused_embedding_fc_lstm_op
7 years ago
Jacek Czaja 7ab5626dee - Added initial pass for embedding-fc-lstm
7 years ago
Yan Chunwei 9e8d372ff4
hide attention lstm fuse (#13615)
7 years ago
Wu Yi 10dffc68cd
Merge pull request #13618 from typhoonzero/revert_13530
7 years ago
tangwei12 97cf1eb6d7
Add distributed unit tests about text_classification/simnet-bow/ctr (#12812)
7 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
7 years ago
tensor-tang 6938e6cf06
Merge pull request #13603 from tensor-tang/refine/peephole
7 years ago
Wu Yi 16e73e0d28
hide operator API (#12543)
7 years ago
tensor-tang 209e9c3db1 refine peephole
7 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
7 years ago
Tao Luo d0000082c4
Merge pull request #13552 from sfraczek/sfraczek/conv-relu-update
7 years ago
Sylwester Fraczek e5d1bd1e93 remove unused variable nodes2delete
7 years ago
Sylwester Fraczek a49aa4dac9 make bias unnecessary for ConvRelu fuse
7 years ago
Sylwester Fraczek 493ef0c8df do not remove conv node just rewire the output
7 years ago
Sylwester Fraczek 667b661786 updated the test
7 years ago
Xin Pan f117feab0c modify comments
7 years ago
Yu Yang 7119d6c3cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_callstack
7 years ago
Xin Pan c2b3838cf5 add some comments
7 years ago
Xin Pan 0d9ee0dced fix resolve conflicts
7 years ago
Xin Pan b43e49fa31 fix
7 years ago
Xin Pan 36c2a9af27 pass builder allow cutomize pass in python.
7 years ago
dzhwinter 97636a9fcf
"fix link error" (#13545)
7 years ago
chengduo cdf3a4c244
Fix concat_op InferShape (#13513)
7 years ago
Zeng Jinle f855c05fff
Merge pull request #13520 from sneaxiy/enhance_eager_delete
7 years ago
Yu Yang 3043f51b3a
Merge pull request #13511 from reyoung/fix_ce
7 years ago
Wu Yi 29c63d180f
[Feature] dist op role and lr op role, to support memory optimize with dist training (#13220)
7 years ago
Yu Yang e1913bc5d8 Fix MixedVector
7 years ago
sneaxiy 0a36ef3c18 enhance eager deletion
7 years ago
Yu Yang 6d2c6f96f1 Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod""
7 years ago
Yu Yang a6c8d6b9a2 Revert "Merge pull request #13431 from chengduoZH/refine_lod"
7 years ago
chengduo d402234ba8
Feature/op_fuse_pass (#12440)
7 years ago
Xin Pan 23ba766252
Merge pull request #13475 from panyx0718/ir5
7 years ago
Xin Pan 759dae0493
Merge pull request #13461 from panyx0718/ir3
7 years ago
Xin Pan 0bd7a67eaf avoid creating dangling ir::Node.
7 years ago
Zeng Jinle 642cf6ca2f
Merge pull request #13418 from sneaxiy/dam_save_memory
7 years ago
Yu Yang aa79bccfff
Merge pull request #13460 from reyoung/fix_data_transform
7 years ago
Yu Yang 2c31ea9293
Merge pull request #13424 from chengduoZH/refine_seq_concat
7 years ago
sneaxiy fd3e32ea7d Merge develop
7 years ago