Commit Graph

507 Commits (13b03e7ad6e9d555b972cca8e5e84ee0d4b61c99)

Author SHA1 Message Date
Yan Chunwei e84234b551
make clone thread safe (#15363)
6 years ago
Zhaolong Xing 236201c222
Merge pull request #15350 from NHZlX/fix_bug_for_precditor
6 years ago
Yan Chunwei e07900d317
cache tensor ptr in ZeroCopyTensor (#15352)
6 years ago
Yan Chunwei b7916440ff
hot fix the Native clone (#15344)
6 years ago
nhzlx b95f2ff8fe fix win build bug
6 years ago
nhzlx b938324381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
6 years ago
Yiqun Liu 568cc2ffa8
Optimize while_op for test (#14764)
6 years ago
nhzlx b2ba3471fd fix analysis config bug.
6 years ago
tensor-tang a5d2a6d1ad add fuse pass of sequared mat sub fusion
6 years ago
tensor-tang a89296ac1f add repeated fc relu pass
6 years ago
Zhaolong Xing 98e85f3735 add_transpose_flatten_concat_fuse (#15121)
6 years ago
wopeizl 5d9edb4124
Merge pull request #15156 from wopeizl/windows/fixgpuissue
6 years ago
tensor-tang 146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
6 years ago
peizhilin 439691f5bd adjust the shlwapi on windows
6 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
peizhilin e239558e56 remove the dismatch enclosure to avoid warning message test=develop
6 years ago
Tao Luo 2b11c710b3
Merge pull request #15249 from NHZlX/fix_trt_demo_ci
6 years ago
tensor-tang 137060135e fix zerocopy size
6 years ago
nhzlx e7d83389e6 fix demo ci bug
6 years ago
nhzlx 4e3522e5b4 add trt int8 support
6 years ago
tensor-tang 72d2a1801e add seqpool concat fuse pass
6 years ago
Yan Chunwei d09d6eadc0
make inference api work with Doxygen (#15195)
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
tensor-tang 516fe301ee add comment in case of empty name
6 years ago
tensor-tang dca68cdf97 throw error when name not find
6 years ago
tensor-tang cd94df8679 fix load and refine
6 years ago
Zhaolong Xing 4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
6 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
6 years ago
Tao Luo 05f1b65da3 simplify prepere_input in analyzer_test
6 years ago
nhzlx 02e17396c2 fix comments
6 years ago
nhzlx 71636e677d add min_subgraph_size attr to tensorrt config
6 years ago
sneaxiy dde3afe7b7 Merge develop
6 years ago
nhzlx 73b47df1f4 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
6 years ago
nhzlx ce3782c193 add affine_channel fuse.
6 years ago
qingqing01 51a9fca323
Async memory copy (#15013)
6 years ago
sneaxiy ae6f46a1a9 rewrite variable type
6 years ago
peizhilin 07c7eaabb4 Merge remote-tracking branch 'upstream/develop' into windows/mkl
6 years ago
Zhaolong Xing a9fb34fad8
Merge pull request #14903 from NHZlX/add_conv_elementwise_pass
6 years ago
peizhilin 5a6d7fe2ff add mkl,ctc support for windows
6 years ago
wopeizl 0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
6 years ago
nhzlx fcc93d96d5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass
6 years ago
Yu Yang bacf1d2399 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
nhzlx 4e4a777243 add conv+elementwiseadd pass
6 years ago
Yan Chunwei a985949be9
Fea/fuse conv elementwise add fuse (#14669)
6 years ago
Yu Yang 04a570b463 Fix ut
6 years ago
peizhilin 23dec78772 fix script issue
6 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
6 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
6 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
6 years ago
Tao Luo 743cb840f1 update with comments
6 years ago
Tao Luo 42359e88a4 clean code
6 years ago
Tao Luo 405b2486db support loading from memory
6 years ago
Xin Pan 7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
6 years ago
Yihua Xu 64e261c6cd Implement the fusion of convolution and bias for mkldnn
6 years ago
Yan Chunwei 4b7617740e
fix container not cleared (#14231)
6 years ago
nhzlx 49c28b8c52 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
6 years ago
Sang Ik Lee 24e70920db Refactor some build settings.
6 years ago
Sang Ik Lee d6125a5eec Include ngraph in inference demo build.
6 years ago
Tao Luo b4de023ee1
Merge pull request #14636 from Superjomn/fix/word2vec
6 years ago
nhzlx d3e140a572 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
6 years ago
nhzlx 900fbb83f9 add params sync pass
6 years ago
superjomn 9c665c81ae update
6 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
Yiqun Liu 726f2cefe3
Fix bug of referencing a temporary variable. (#14614)
6 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
6 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01 39ec80def4
Remove the memory copy of feeding data in C++ inference API (#14577)
6 years ago
peizhilin 1afa9492af Recover the profiler
6 years ago
Yiqun Liu bf222f197d
Use sub scope in tensor_array_to_tensor op. (#14524)
6 years ago
dzhwinter 840c1b29ad
test=develop (#14562)
6 years ago
luotao1 116979a40a refine api name
6 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
6 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
6 years ago
peizhilin 7c8c9dc9bf fix unit test cases
6 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
6 years ago
peizhilin 6e66fadb95 clean up the pre-definitions on windows
6 years ago
nhzlx a4dc1d4292 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
6 years ago
nhzlx faeb9b8aa9 fix compile rely problem
6 years ago
nhzlx b742d46520 fix demo ci bug on trt
6 years ago
hjchen2 a8c077df7c Implement leaky relu tensorRT converter
6 years ago
Superjomn e878a8e885 update
6 years ago
superjomn 4bf6817cbc fix gpu load model
6 years ago
Zhaolong Xing 2f27c048cc
Merge pull request #14440 from hjchen2/develop
6 years ago
hjchen2 21f33b4274 Complete PRelu plugin and Conv2d transpose op converter
6 years ago
Sylwester Fraczek 8a1eeec579 add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278)
6 years ago
dzhwinter d3aed98d86
Merge pull request #14320 from wopeizl/windows/online
6 years ago
Yiqun Liu 9e6b1c5f97
Refine tester of TensorRT engine (#14390)
6 years ago
peizhilin 1a9008c420 code style fix
6 years ago
nhzlx ddb120357c Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_trt_plugin
6 years ago
peizhilin 30ddc07a7e Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
6 years ago
nhzlx d38fd6a0fc add plugin support and offer an simple split sample
6 years ago
peizhilin ca60e1d34d Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin 52f7644f53 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Qiyang Min 698698f2fa
Merge branch 'develop' into fix_vlog
6 years ago
qingqing01 abe209234f
Exhaustive search for cuDNN conv. (#14286)
6 years ago
minqiyang 87450b9ad4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
peizhilin 4ffa92d4f0 Merge branch 'develop' into windows/build
6 years ago
Tao Luo 813e54efbd
Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug
6 years ago
minqiyang 3db9fad764 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
minqiyang 3da43dcae2 Because anakin do NOT use glog, so we revert anakin related change
6 years ago
Tao Luo 387610aae1
Merge pull request #14325 from luotao1/fix_test_analysis_predictor
6 years ago
peizhilin 45125ba538 fix share library issue
6 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
6 years ago
minqiyang fcc0452c8b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
Tao Luo 381bea0a16 fix test_analysis_predictor
6 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times
6 years ago
peizhilin 869487a2b7 Merge remote-tracking branch 'origin/develop' into windows/build
6 years ago
dzhwinter 2835e04409 merge develop branch. test=develop
6 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
6 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
6 years ago
peizhilin 9d67c1fb69 cpu build support
6 years ago
dzhwinter 60f70b174d test=develop
6 years ago
dzhwinter cc02353d10 test=develop
6 years ago
dzhwinter eb2f7ed21b refine tests. test=develop
6 years ago
Tao Luo fe8f178582 fix word2vec related inference unit-tests (#14203)
6 years ago
dzhwinter 1ace55c8ee merge develop branch
6 years ago
Yan Chunwei 06e508ab58
fix simple_on_word2vec random fail (#14171)
6 years ago
dzhwinter 316765839d add back jit simd instructions. stage.
6 years ago
dzhwinter bf2e4cb188 cleard. staged
6 years ago
Yan Chunwei 70ce6dcd67
fix api_impl ci error (#14140)
6 years ago
dzhwinter ebfe5a02b3 merge develop branch
6 years ago
JiabinYang 7c45e77c41 test=develop
6 years ago
Xin Pan aa87a989ec
Merge pull request #14119 from Superjomn/fix/api-impl-tester
6 years ago
superjomn 5f7fda0b07 disable some tests
6 years ago
Tao Luo d3534d2b14 refine warning message
6 years ago
Tao Luo 79da263b11
Merge pull request #14032 from sfraczek/sfraczek/fix-test-multithreading-mkldnn
6 years ago
Tao Luo 4928ff32a9 fix cmake warning when ON_INFER=false
6 years ago
Yan Chunwei ee74be3a49
[1.1] Bugfix/tensorarray (#14044)
6 years ago
Qiyang Min 33b4920d2d
Merge pull request #14057 from velconia/continue_hash_op
6 years ago
minqiyang 7f7af5d412 Add xxhash deps to inference demo and trainer demo
6 years ago
minqiyang fe18adfbaa Add fluid inference support
6 years ago
dzhwinter 7141debe38 add cudnn back. staged.
6 years ago
dzhwinter 09409bad4d staged. test speed=49ms in 1080.
6 years ago
Tao Luo ea2bdd192d Merge branch 'develop' into remove_unused_code
6 years ago
Sylwester Fraczek 741cb33bd9 test multithreading
6 years ago
dzhwinter 468467f391 update real incnet tester
6 years ago
dzhwinter abe8e207c4 clean demo_ci
6 years ago
dzhwinter 597d92179b clean demo_ci
6 years ago
Tao Luo f7bbcfa913 remove unused code in paddle_inference_api.h
6 years ago
dzhwinter c6dcffc61a lb. add debug output
6 years ago
nhzlx ae8f26072d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
6 years ago
dzhwinter 607080e888 windows static library
6 years ago
Wojciech Uss 5083ec3a1b do not enable MKL-DNN twice
6 years ago
Wojciech Uss c3b70aece9 Add MKL-DNN placement pass (#13958)
6 years ago
superjomn b77e4f4978 update
6 years ago
nhzlx b970c6d5d0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
6 years ago
nhzlx 32072d31b5 fix demo ci error on manylinux
6 years ago
Tao Luo 6ea9d1b595 add analysis_predictor in vis_demo
6 years ago
Tao Luo f444a7226e Merge branch 'develop' into clean_inference_lib
6 years ago
Tao Luo 3598500773
Merge pull request #13867 from Superjomn/clean/CreatePaddlePredictor
6 years ago
Tao Luo 41eeb771e8 Merge branch 'develop' into clean_inference_lib
6 years ago
Yan Chunwei 6809238d97
fix analysis predictor profile (#13896)
6 years ago
Tao Luo a35e7f4bae adjust demo_ci with fluid_inference_install_dir
6 years ago
tensor-tang dcfb687584
Merge pull request #13846 from tensor-tang/fix/mkldnn
6 years ago
dzhwinter e41a3fcd68 fix update to develop hang problem.
6 years ago
dzhwinter 962061f0a3 windows fix
6 years ago
superjomn 28459592cc update
6 years ago
Zhaolong Xing 7413fa458f
Merge pull request #13838 from NHZlX/add_trt_pad_op
6 years ago
superjomn 049fcbe125 update
6 years ago
superjomn f5c0221c17 clean CreatePaddlePredictor
6 years ago
nhzlx 320c78e16f fix commets
6 years ago
nhzlx efa5bac7ad fix demo_ci bug in vis_demo.cc
6 years ago
tensor-tang dc5a7b906d fix default number of threads when inference with or without MKLDNN
6 years ago
nhzlx 0cb88c34be add op converter
6 years ago
nhzlx d347ea689a fix comments
6 years ago
nhzlx f569095084 add tensorrt api lib to paddle_fluid
6 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
6 years ago
Tao Luo d770b9bda3
Merge pull request #13663 from luotao1/resnet50_ut
6 years ago
Michal Gallus 09d9d77a8f Enable MKLDNN in Naive Executor
6 years ago
Tao Luo a89afd4c22
Merge pull request #13685 from luotao1/naive_cmake
6 years ago
luotao1 9cbf2023ab rollback paddle_inference_helper.h to helper.h
6 years ago
Tao Luo 824a82d728
Merge pull request #13672 from luotao1/gen_fluid_library
6 years ago
luotao1 d55d7e04fd update libpaddle_fluid.so with zeroCopy
6 years ago
luotao1 a989a4e7c2 refine paddle_inference_helper.h
6 years ago
tensor-tang ede4b230be
Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op
6 years ago
Tao Luo cfbd71c223 reduce inference ci time
6 years ago
JiabinYang 358b386953 test=develop
6 years ago
Tao Luo 21ee30595b clean some CMakeLists
6 years ago
Yan Chunwei c8744d118d
fea/infer executor and concurrency performance issue bug fix (#13451)
6 years ago
Jacek Czaja 910cd415f2 - Disabled embedding_fc_lstm_fuse by defult and
6 years ago
Yan Chunwei 9e8d372ff4
hide attention lstm fuse (#13615)
6 years ago
nhzlx 5c57e15044 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
6 years ago
Tao Luo c07b2a97a9
Merge pull request #13521 from Sand3r-/mgallus/fix-pooling-ceiled-size
6 years ago
Michal Gallus f465b03ef9 Enable MKLDNN in Analysis Predictor
6 years ago
Yan Chunwei e426cdae32
fix inference output with lod (#13557)
6 years ago
Yan Chunwei 5de14c6b96
refine inference api (#13518)
6 years ago
dzhwinter c66a8d2cd8
add guide (#13332)
6 years ago
dzhwinter 24447ec517
flags (#13541)
6 years ago
dzhwinter 4fd5eb2255
"refine cmake" (#13546)
6 years ago
dzhwinter 97636a9fcf
"fix link error" (#13545)
6 years ago
nhzlx baae7e4f63 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
6 years ago
nhzlx 2763321684 fix comments
6 years ago
Yan Chunwei 90bc14da24
simple fix on inference tester helper (#13507)
6 years ago
nhzlx 1f6c9dbad4 fix typo
6 years ago
nhzlx 4801beb101 add arguments for trt config
7 years ago
nhzlx 94a57f1d83 add trt config to arguments
7 years ago
Tao Luo 2d89849125 add WITH_INFERENCE_API_TEST option (#13425)
7 years ago
nhzlx cc4a7661c6 merge develop
7 years ago
nhzlx d40402f9b7 add dropout and sigmoid op converter
7 years ago
dzhwinter 85f8dd1c77 debug version
7 years ago
Yan Chunwei 3725f22442 Hotfix/api predictor (#13383)
7 years ago
dzhwinter e1999538eb debug the device context
7 years ago
dzhwinter 372caf4000 windows staff
7 years ago
tensor-tang 89d09e6594
Merge branch 'develop' into fea/ut/vis
7 years ago
luotao1 20b40cb06a add multi-thread for nlp unit-tests
7 years ago
tensor-tang dd0b2036c6 add note for use mkldnn
7 years ago
tensor-tang 01f0f16884 enable mkldnn in infer api
7 years ago
dzhwinter c3e1fb5a3e add demo
7 years ago
tensor-tang 8cbb3c0720 refine lac ut and fix fetch
7 years ago
tensor-tang ca973139fe
Merge pull request #13285 from tensor-tang/refine/ut/lac
7 years ago
tensor-tang acfdbf0293 enable ner analysis test and refine lac
7 years ago
luotao1 d4c3fe9a44 clean api_anakin_engine_rnn_tester
7 years ago
tensor-tang 7eebb90523 fix conflicts
7 years ago
tensor-tang ca30127e0a fix compile error undef registrar pass
7 years ago
tensor-tang 0618077971 Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
tensor-tang 6b104c90d3 fix profile
7 years ago
Tao Luo 11b22883be
Merge pull request #12738 from luotao1/anakin_cpu
7 years ago
luotao1 61cae53e79 support anakin for only-cpu environment
7 years ago
Yan Chunwei 225ecee5ea
refine/text classification tester (#13244)
7 years ago
tensor-tang 4d774953c6 enable fc gru fuse pass
7 years ago
tensor-tang 12b483c0db Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
luotao1 35cff5e00d Merge branch 'develop' into multi-thread2
7 years ago
Yan Chunwei 9df2d8b5ba
test/add text-classification test (#13081)
7 years ago
luotao1 1a373fbb0d add result check for multi-thread UT
7 years ago
luotao1 2dc23ffaa8 Merge branch 'develop' into multi-thread2
7 years ago
luotao1 8cb92fb18e speedup the download of inference_demo
7 years ago
luotao1 459d4cc811 Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo 907696709f
Merge pull request #13133 from luotao1/library
7 years ago
Jiabin Yang d091dd02a0 fix mac compile error 0903 (#13184)
7 years ago
tensor-tang d83187dba8 enable lac analysis test
7 years ago
luotao1 d7b4965785 auto generate paddle_inference_pass.h
7 years ago
luotao1 0639a32477 Merge branch 'develop' into library
7 years ago
luotao1 37d1a6685c Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo 737a033ed0
Merge pull request #13140 from dzhwinter/windows/inference_api
7 years ago
dzhwinter b4d43030ff
windows inference fix (#13141)
7 years ago
tensor-tang 1e7ccf9f45
Merge pull request #13126 from tensor-tang/fea/infer/ut/lac-new
7 years ago
dzhwinter a0aa2ec8b5 build compile
7 years ago
dzhwinter 75681c0a79 switch to 9.2
7 years ago
dzhwinter bfa9b268de fix elementwise
7 years ago
dzhwinter dbe90cc0f6 merge develop branch
7 years ago
Jiabin Yang 6ba2b22279
Merge pull request #13096 from JiabinYang/fix_mac
7 years ago
luotao1 fb077c17e6 add shared library for analysis
7 years ago
tensor-tang 663a11ac7c bugfix and follow comment
7 years ago
luotao1 0fbe0a7a28 add multi-thread ut for ditu-rnn
7 years ago
luotao1 b3cd2ae88b Merge branch 'develop' into ner_ut2
7 years ago
Yan Chunwei af15f6f038
fea/refine fuse (#13076)
7 years ago
luotao1 07cb64adc0 add unit-test for chinese_ner
7 years ago