Commit Graph

11382 Commits (ba8bbe159b99162ae28e36aff1bc2f81fcec5713)

Author SHA1 Message Date
jerrywgz e0708e62ba refine code
6 years ago
jerrywgz 1c591c3909
Merge branch 'develop' into fix_rpn_target_assign_op
6 years ago
sneaxiy a9d7a9d720 test=develop
6 years ago
Tao Luo 316bc9bfc9 fix typo and warning in analyzer_resnet50_test
6 years ago
guosheng 6447b69aec Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
6 years ago
jerrywgz f06c6193d7 fix rpn target assign test=develop
6 years ago
Xin Pan 4625f83f92 better handle var type inference
6 years ago
Xin Pan 8f2116d8fa clean up after the changes have been stopped for so long.
6 years ago
tensor-tang 83dc689877 Merge remote-tracking branch 'ups/develop' into refine/jit/gru
6 years ago
tensor-tang 640e789d3d add fusion gru jit kernel
6 years ago
qingqing01 0e24138494
Merge pull request #13991 from qingqing01/refine_generate_proposals_op
6 years ago
gongweibao 58c027cc38
Add rpc profiler flags. (#13989)
6 years ago
Xin Pan d10e54c460
Merge pull request #14003 from chengduoZH/fix_fast_parallel_exe_bug
6 years ago
Tao Luo 42aa1d409d
Merge pull request #13485 from tpatejko/tpatejko/capi-resnet-conv-elementwise-fusion
6 years ago
tensor-tang 664159ad42
Merge pull request #13998 from tensor-tang/fea/fusion_seqconv_add
6 years ago
guosheng 6d3b030bb5 Refine the api of reshape to be compatible.
6 years ago
chengduozh 82d2903b63 Fix fast ParallelExe bug
6 years ago
Tomasz Patejko aa35aaa1ab MKLDNN conv + elementwise_add fusion: fixing formatting
6 years ago
jerrywgz 765085d297
Merge pull request #13904 from jerrywgz/roialign
6 years ago
Dang Qingqing 56936b9e25 Refine doc for generate_proposals_op.
6 years ago
Tomasz Patejko ce2464fd98 MKLDNN conv + elementwise_add fusion: UT for missing bias added. UTs refactored. Some minor changes in the pass
6 years ago
Tomasz Patejko 4e72ab411e MKLDNN conv + elementwise_add fusion: fix for crash when bias is not present
6 years ago
Tomasz Patejko 415b261555 MKLDNN conv + elementwise_add fusion: fusion options added
6 years ago
Tomasz Patejko 1676094697 MKLDNN conv + elementwise_add fusion: turn on residual connection pass when CAPI is used.
6 years ago
Tomasz Patejko 0fe3079c46 MKLDNN conv + elementwise_add fusion: fix for order of parameters in elementwise_add in resnet50
6 years ago
Tomasz Patejko b73b868366 MKLDNN conv + elementwise_add fusion: bias in tests made persistent.
6 years ago
Tomasz Patejko a1fa203287 MKLDNN conv + elementwise_add fusion: name of the pass reused with name_scope_
6 years ago
Tomasz Patejko 2c43419db1 MKLDNN conv + elementwise_add fusion: comment explaining CorrectGraphEdges added
6 years ago
Tomasz Patejko 8fb29b2ca9 MKLDNN conv + elementwise_add fusion: new nodes marked as input or output
6 years ago
Tomasz Patejko cc1c8e37c1 MKLDNN conv + elementwise_add fusion: attributes in new conv op copied from old op
6 years ago
Tomasz Patejko a27a8c5da8 MKLDNN conv + elementwise_add fusion: bias in test marked as persistable
6 years ago
Tomasz Patejko af8c71317c MKLDNN conv + elementwise_add fusion: CorrectGraphEdges refactored
6 years ago
Tomasz Patejko 3e033087f1 MKLDNN conv + elementwise_add fusion: LinkNodes function removed and
6 years ago
Tomasz Patejko 4be45af1cc MKLDNN conv + elementwise_add fusion: skip connection attribute renamed. Comments about patterns added.
6 years ago
Tomasz Patejko 9a335e0277 MKLDNN conv + elementwise_add fusion: changed a name of a formal argument in ElementwiseAdd pattern
6 years ago
Tomasz Patejko fb7a50b230 MKLDNN conv + elementwise_add fusion: removed commented code. Internal functions marked as static.
6 years ago
Michal Gallus f688197182 MKLDNN conv + elementwise_add fusion: Fix output_data to point to the right tensor, also fix transpiler integration
6 years ago
Tomasz Patejko efd76614fb MKLDNN conv + elementwise_add fusion: implementation changed to conform with Paddle API
6 years ago
Tomasz Patejko 347bf90412 MKLDNN conv + elementwise_add fusion: bias is also handled
6 years ago
Tomasz Patejko bf95ac36a7 MKLDNN conv + elementwise_add fusion: further reformatting
6 years ago
Tomasz Patejko cbe122ae2e MKLDNN conv + elementwise_add fusion: correcting formatting
6 years ago
Tomasz Patejko 2a251bbf27 MKLDNN conv + elementwise_add fusion: some refactoring: consts, function calls instead of constant values
6 years ago
Tomasz Patejko b8e54ab5cc MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData
6 years ago
Tomasz Patejko 27573ece03 MKLDNN conv + elementwise_add fusion: trailing spaces removed
6 years ago
Tomasz Patejko 7f5c8a95e8 MKLDNN conv + elementwise_add fusion: arguments are replaced for many parameters in operator
6 years ago
Tomasz Patejko 5996bd39e8 MKLDNN conv + elementwise_add fusion: graph is corrected based on actual argument name, not formal argument name
6 years ago
Tomasz Patejko 41f3d78fdf MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
6 years ago
Tomasz Patejko 07a62ddc08 MKLDNN conv + elementwise_add fusion: inputs in pass modified. Support for new conv parameter. UTs corrected
6 years ago
Tomasz Patejko 56528531ea MKLDNN conv + elementwis_add fusion: initial work on passing eltwise data to conv primitive
6 years ago
Tomasz Patejko 42f569fdfd MKLDNN conv + elementwise_add fusion: use_mkldnn attribute added
6 years ago
Tomasz Patejko 441d3a4726 MKLDNN conv + elementwise_add: added some refactoring in the pass
6 years ago
Tomasz Patejko 38b7b34b1c MKLDNN conv + elementwise_add fusion: added reachability tests, inputs and outputs in graph nodes are transformed
6 years ago
Tomasz Patejko 16eaaf3fbe MKLDNN conv + elementwise_add fusion: added one more UT, found and corrected bugs in pass
6 years ago
Tomasz Patejko 604bad08bc MKLDNN conv + elementwise_add fusion: implementation of patterns refarctored, applied to graph. UTs added
6 years ago
Tomasz Patejko 9ce343f868 MKLDNN conv + elementwise_add fusion: initial implementation of patterns
6 years ago
tensor-tang 40f8456a4f refine fuse pattern and attr
6 years ago
tensor-tang cbbacb2534 Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
6 years ago
tensor-tang 603ba5e01d add seqconv eltadd relu pass
6 years ago
Dang Qingqing 4801ee8f97 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_generate_proposals_op
6 years ago
Tao Luo da722d6d9b
Merge pull request #13858 from Sand3r-/mgallus/conv-bias-pass
6 years ago
Tao Luo a4b48f70c1
Merge pull request #13997 from wojtuss/wojtuss/do-not-enable-mkldnn-twice
6 years ago
Tao Luo 252401c5d3
Merge pull request #13992 from wojtuss/wojtuss/add-ifdef-mkldnn
6 years ago
Michał Gallus f9ca31811d
Remove use mkldnn from config in resnet50 test
6 years ago
tensor-tang 23fc896bc2 Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
6 years ago
tensor-tang 339e655aec refine and add seqconv elementwiseadd relu op test
6 years ago
Michal Gallus c504a5a1b7 Adjust Conv+bias to placement pass
6 years ago
Michal Gallus d7509d63f1 Conv+Bias: Support non-null bias
6 years ago
Michal Gallus 91e8fbac2f Enable MKLDNN in Resnet50Tester
6 years ago
Michal Gallus 582f59c190 Conv+Bias fuse
6 years ago
jerrywgz a1d3db031b
Merge pull request #13844 from jerrywgz/fix_roi_pool
6 years ago
guosheng dfb841ad5a Make reshape_op reuse input.
6 years ago
Dang Qingqing 8e0b9496de Fix unit test
6 years ago
Wojciech Uss e6f480ec44 add comment on the default first pass
6 years ago
Wojciech Uss 2cf258e381 remove redundant pass list
6 years ago
Wojciech Uss 5632019f0f add MKL-DNN placement pass
6 years ago
tensor-tang 0a9f5f1790
Merge pull request #13968 from tensor-tang/fix/jit/exp
6 years ago
Wojciech Uss 5083ec3a1b do not enable MKL-DNN twice
6 years ago
Yipeng fcb2e8103e Ocr end2end dev (#13889)
6 years ago
tensor-tang e5ce965952 refine and add eltadd_relu unit test
6 years ago
sneaxiy 5a38930660 test=develop
6 years ago
Wojciech Uss c3b70aece9 Add MKL-DNN placement pass (#13958)
6 years ago
Wojciech Uss 4a368a4901 add ifdef guard for MKL-DNN placement pass
6 years ago
Xin Pan 909e1341bd
Merge pull request #13966 from panyx0718/fix4
6 years ago
chengduo 9775e50ca2
Fix add doc for bias_attr (#13937)
6 years ago
Tao Luo 7b11162ab5
Merge pull request #13949 from PaddlePaddle/wojtuss/unique-patterns-request-comment
6 years ago
tensor-tang 7cb19a5976 fuse elementwise_add and relu
6 years ago
tensor-tang 3c249283af init seqconv eltadd relu op
6 years ago
tangwei12 48982e9dc7 fix lookuptable in reduce strategy
6 years ago
Xin Pan 9a819265eb fix
6 years ago
sneaxiy ac2eba4457 test=develop
6 years ago
Tao Luo 305034f5b3
Merge pull request #13909 from luotao1/mkldnn_test
6 years ago
superjomn b77e4f4978 update
6 years ago
jerrywgz 553342624e test=develop
6 years ago
jerrywgz 9a14ca91b8 test=develop
6 years ago
tensor-tang 60ff05e312 Merge branch 'luotao1-fix_rnn2_test' into fix/jit/exp
6 years ago
Tao Luo ef09862450 fix analyzer_rnn2_test
6 years ago
tangwei12 0e722c5ea2 fix lookuptable in reduce strategy
6 years ago
Tao Luo e5b4643ad8 add profile_mkldnn test
6 years ago
Tao Luo 7d680be5a3 Merge branch 'develop' into mkldnn_test
6 years ago
buxingyuan 0bb3b099c2 generate_proposal_labels doc
6 years ago
Wojciech Uss 55fd136ab0 Added comment with request for enhancement
6 years ago
gongweibao a831ecc75d
Add grpc error context. (#13957)
6 years ago
tensor-tang b139b687de Merge remote-tracking branch 'ups/develop' into fix/jit/exp
6 years ago
qingqing01 67a2b5215d
Add affine channel op to speed and save memory for faster-rcnn model. (#13919)
6 years ago
tensor-tang 748435586a clean code exp avx
6 years ago
tensor-tang b4751a34a5 fix illegal instruction of rnn2
6 years ago
Xin Pan 6de08b5eef set default timeout to avoiding blocking CI
6 years ago
tensor-tang 30dfbdee7f
Merge pull request #13951 from tensor-tang/fix/warning
6 years ago
Tao Luo 34ed7d1379
Merge pull request #13924 from luotao1/clean_inference_lib
6 years ago
tensor-tang 36588b3365 fix illegal instruction of rnn1 and text
6 years ago
Tao Luo 6a4e9230ed Merge branch 'develop' into mkldnn_test
6 years ago
gongweibao 078223b3e3
Add rpc timeline. (#13900)
6 years ago
dzhwinter 29382db625
Merge pull request #13874 from dzhwinter/fix/momentum
6 years ago
Xin Pan 6a54c3de1f
Merge pull request #13928 from panyx0718/doc
6 years ago
qingqing01 5dbb2e9986
Small changes for sum_op to avoid zero setting. (#13923)
6 years ago
Tao Luo b819684370 add compare_mkldnn test
6 years ago
Tao Luo e47f4186ae fix some compiler warning
6 years ago
nhzlx b970c6d5d0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
6 years ago
nhzlx 32072d31b5 fix demo ci error on manylinux
6 years ago
Jiabin Yang 6553956bd6
Merge pull request #13931 from JiabinYang/fix_dist_on_mac
6 years ago
dzhwinter 00e8791f66 fix compile in cpu error. test=develop
6 years ago
tensor-tang e69328c3bc fix warning and mac compile
6 years ago
Tao Luo 6ea9d1b595 add analysis_predictor in vis_demo
6 years ago
Tao Luo f444a7226e Merge branch 'develop' into clean_inference_lib
6 years ago
Tao Luo 3598500773
Merge pull request #13867 from Superjomn/clean/CreatePaddlePredictor
6 years ago
Tao Luo 41eeb771e8 Merge branch 'develop' into clean_inference_lib
6 years ago
sneaxiy 3419d04c3f test=develop
6 years ago
dzhwinter d239cf2e15 use binary search. test=develop
6 years ago
dzhwinter a9f5f822e6 use binary search. test=develop
6 years ago
Tao Luo b854d959a5 update with comments
6 years ago
nhzlx 2b5edfbc37 Add ceil model pooling for trt (ocr attention)
6 years ago
Tao Luo 75bb0babef Merge branch 'develop' into mkldnn_test
6 years ago
tensor-tang 6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
6 years ago
sneaxiy 4b4af84e67 test=develop
6 years ago
jerrywgz 4c9884e713 refine unittest test=develop
6 years ago
JiabinYang 02f863400e test=develop
6 years ago
Qiao Longfei 0225957515 change elementwise_add to elementwise_add_to test=develop
6 years ago
Qiao Longfei bd2b6d7f8f sum_op support inplace
6 years ago
Xin Pan 7fb5b66ac2
Merge pull request #13916 from panyx0718/fix2
6 years ago
Yan Chunwei 6809238d97
fix analysis predictor profile (#13896)
6 years ago
Xin Pan abbfb60ca9 remove unused codes
6 years ago
Yibing Liu 6b795d424c
Merge pull request #13901 from kuke/seq_slice_py
6 years ago
dzhwinter 3861269594 merge develop branch
6 years ago
jerrywgz 98c3294b85 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
6 years ago
Tao Luo a35e7f4bae adjust demo_ci with fluid_inference_install_dir
6 years ago
tangwei12 fa2ab3346c
fill constant add infervarshape, lookuptable clone lr var (#13830)
6 years ago
jerrywgz 8c79071d6a roi_align for gpu
6 years ago
Xin Pan 342e436158 Make Var::GetMutable robust
6 years ago
Yan Chunwei 7a751b83ac fix isfinite_op sprintf (#13850)
6 years ago
Qiyang Min e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
6 years ago
Yibing Liu 46b0b7903c
Merge pull request #13856 from kuke/seq_unpad_op
6 years ago
tensor-tang dcfb687584
Merge pull request #13846 from tensor-tang/fix/mkldnn
6 years ago
Qiao Longfei b4a32eafdf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
6 years ago
Tao Luo bd77460182 refine mkldnn test in analyzer_tests
6 years ago
jerrywgz c9d2046f76 roi_align for gpu
6 years ago
jerrywgz 2f5a80174e add roi_align api
6 years ago
Zeng Jinle af91d41ab8
Merge pull request #13852 from sneaxiy/feature/eager_delete_tensor
6 years ago
Zeng Jinle 93606c2c2c
Merge pull request #13689 from sneaxiy/sparse_rmsprop
6 years ago
Qiao Longfei 681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
6 years ago
jerrywgz 90f39b1123 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
6 years ago
Tao Luo f8874b3cb2
Merge pull request #13884 from luotao1/rename_inference_lib_dist
6 years ago
Xin Pan 288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
6 years ago
sneaxiy 5cedfb60c8 test=develop
6 years ago
Yibing Liu b785798585 Expose layer's name for sequence pad & unpad
6 years ago
Yibing Liu 18e1c1e07d Update API spec for seq slice
6 years ago
jerrywgz 5e52dafda5 add roi align
6 years ago
jerrywgz c0e34eebec add roi align
6 years ago
Tao Luo c26f2b21eb
Merge pull request #13813 from sfraczek/sfraczek/conv-bn-fuse-pass-full-eigen
6 years ago
Yibing Liu 16b2c6dc78 Add py api for sequence_slice_op
6 years ago
superjomn 1cfd2b51a7 update
6 years ago
Xin Pan fededdda20
Merge pull request #13872 from panyx0718/fix2
6 years ago
Qiao Longfei b16e9cd105
a small fix for compile WITH_INFERENCE=OFF (#13869)
6 years ago
Qiao Longfei ec25a09bd5 revert unused change test=develop
6 years ago
Qiao Longfei 60030e8678 change the use of FLAGS_reader_queue_speed_test_mode
6 years ago
Tao Luo 323d67cfc1
Merge pull request #13879 from panyx0718/doc
6 years ago
Qiao Longfei 936926aadd code optimize
6 years ago
Sylwester Fraczek 50c5e9b0c6 reshape_2d used from ddim.h
6 years ago
Qiyang Min cab29828a5
Merge pull request #13829 from velconia/accelerate_sequence_pool_op
6 years ago
minqiyang aeec82acd5 Add unittest for reshape op
6 years ago
Qiao Longfei 9fd78df71c revert unused change
6 years ago
Xin Pan ddb76d0d09 Make GetMutable more robust
6 years ago
Qiao Longfei c52ccbc109 clean code
6 years ago
Qiao Longfei 6056d04361 optimize blas call
6 years ago
Qiyang Min c2842377ce
Merge pull request #13837 from velconia/add_pyramid_dnn_support
6 years ago
Qiao Longfei 5db7551317 optimize code
6 years ago
minqiyang 24c9fbdba3 Polish code
6 years ago
chengduo 2c9839c847
add cuda version display (#13885)
6 years ago
sneaxiy d3ed070e10 test=develop
6 years ago
minqiyang d9b202e717 Move tensor copy src_ptr and dst_ptr check to TensorCopy function
6 years ago
sneaxiy fb6201e93e test=develop
6 years ago
chengduo 8e2fdc54b1
Add check for opt op (#13840)
6 years ago
Qiao Longfei eb6d9e3bbe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
6 years ago
Yibing Liu 46e61d81a7 Wrapper py api for sequence_unpad
6 years ago
Qiao Longfei 0170d36c42 fix a bug
6 years ago
superjomn 28459592cc update
6 years ago
Qiyang Min e37c9e6732
Merge pull request #13828 from velconia/accelerate_selected_rows_functor
6 years ago
Qiao Longfei 86e2e686ee fix bug
6 years ago
Qiao Longfei 333fd15204 add gpu test for mrege add
6 years ago
Tao Luo 3d976f3f18 rename inference_lib_dist to fluid_lib_dist
6 years ago
Qiao Longfei ab3e36da80 update MergeAdd for selected_rows_functor.cu
6 years ago