Commit Graph

1093 Commits (2246f7c133e3dc3cfd9f2779fd2f4cc2778c7ea7)

Author SHA1 Message Date
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Jacek Czaja 4aa7ef3c13 - Compensation fix to LRN MKL-DNN op
6 years ago
nhzlx b43ea40c51 delete the usage of the const_cast
6 years ago
Yan Chunwei e2818c8608
add dynamic memory optim (#15457)
6 years ago
nhzlx 92cf4a4c6b fix comments
6 years ago
JiabinYang 1bf2facecb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/imperative
6 years ago
JiabinYang e3a8929cf8 little change
6 years ago
Zhaolong Xing a7ba07d7ef
Merge pull request #15504 from NHZlX/fix_conv2d_fusion
6 years ago
nhzlx 0779e35544 fix two bug:
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
nhzlx 9641324995 fix comments
6 years ago
nhzlx 484b3bc801 When cudnn version < 7100, there is problem with conv_fusion.
6 years ago
tensor-tang 5c68dee798 fix debug compile of analysis pass fail
6 years ago
luotao1 353b5f06a7 refine analyzer_bert_test to pass the ci
6 years ago
nhzlx e6218c1d7b change the input to a smaller value
6 years ago
fuchang01 4a33a44f45 analyzer bert tester
6 years ago
nhzlx 5b92ddabe2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_stream_bug
6 years ago
nhzlx 2f4aee361a fix comments
6 years ago
nhzlx ec213730bc fix trt stream bug.
6 years ago
flame d60751fb71
add python inference api (#15248)
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
Tao Luo 8f522c15ed
Merge pull request #15408 from luotao1/mm_dnn
6 years ago
Tao Luo 001827c270 test_analyzer_mm_dnn runs in serial
6 years ago
Tao Luo 140fc1e92c
Merge pull request #15392 from luotao1/pyramid_dnn
6 years ago
Yan Chunwei c9e5aa19c1
get tensor API add more comments (#15345)
6 years ago
Yan Chunwei e84234b551
make clone thread safe (#15363)
6 years ago
Tao Luo 668563088e add pyramid_dnn c++ inference test
6 years ago
Zhaolong Xing 236201c222
Merge pull request #15350 from NHZlX/fix_bug_for_precditor
6 years ago
nhzlx 8817841c73 fix unit test bug
6 years ago
Yan Chunwei e07900d317
cache tensor ptr in ZeroCopyTensor (#15352)
6 years ago
Yan Chunwei b7916440ff
hot fix the Native clone (#15344)
6 years ago
Xin Pan 3ecf6bb338
Merge pull request #15028 from yihuaxu/develop_641313ea7_elementwise_mul_mkldnn_bug_fix
6 years ago
nhzlx b95f2ff8fe fix win build bug
6 years ago
nhzlx b938324381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
6 years ago
Yiqun Liu 568cc2ffa8
Optimize while_op for test (#14764)
6 years ago
nhzlx b2ba3471fd fix analysis config bug.
6 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
6 years ago
bingyanghuang a152a5c731 Disable conv3d mkldnn in dam (#15335)
6 years ago
tensor-tang 1a95cd227d disable seqpool test on mac or without mkl
6 years ago
Tao Luo 2411ed4286 fix multi-threads in ZeroCopyProfile
6 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
6 years ago
tensor-tang 7035f051a8 adjust acc on mac
6 years ago
Tao Luo e33427da0d
Merge pull request #15280 from luotao1/random_test
6 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
6 years ago
tensor-tang a5d2a6d1ad add fuse pass of sequared mat sub fusion
6 years ago
tensor-tang 84e023eae5 adjust the acc since the refer result is too large
6 years ago
tensor-tang 4461a458a5 adjust diff since abs is too large
6 years ago
tensor-tang ca6fdc6e33 refine and fix test
6 years ago
tensor-tang a89296ac1f add repeated fc relu pass
6 years ago
tensor-tang 781cd0cf51 add multi threads test of seqpool test (#15293)
6 years ago
Tao Luo cbd1c7c01f fix CompareDeterministic error when test_all_data
6 years ago
Zhaolong Xing 98e85f3735 add_transpose_flatten_concat_fuse (#15121)
6 years ago
wopeizl 5d9edb4124
Merge pull request #15156 from wopeizl/windows/fixgpuissue
6 years ago
tensor-tang 146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
6 years ago
peizhilin 439691f5bd adjust the shlwapi on windows
6 years ago
tensor-tang 96786d3716 add compare_determine of seqpool1 test
6 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
peizhilin e239558e56 remove the dismatch enclosure to avoid warning message test=develop
6 years ago
Tao Luo 7d13d20769
Merge pull request #15245 from luotao1/rnn1_multi_thread
6 years ago
Tao Luo 2b11c710b3
Merge pull request #15249 from NHZlX/fix_trt_demo_ci
6 years ago
tensor-tang 54afcb7ec6 add compare zerocopy test with native result
6 years ago
tensor-tang 137060135e fix zerocopy size
6 years ago
tensor-tang 7461356723 add zerocopy for seqpool test
6 years ago
tensor-tang 48410b9bfe
Merge pull request #15237 from tensor-tang/fuse/seqpool_concat_2
6 years ago
nhzlx e7d83389e6 fix demo ci bug
6 years ago
Tao Luo 9b41e45584
Merge pull request #15222 from luotao1/native_config
6 years ago
Tao Luo d43983b61d reduce threads number to avoid hang in CI
6 years ago
nhzlx c1264e99f3 fix win error
6 years ago
nhzlx 4e3522e5b4 add trt int8 support
6 years ago
tensor-tang f8c305b243 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat_2
6 years ago
Tao Luo 197d0f2431 fix trt_model_tester to pass the ci
6 years ago
Tao Luo feee78d9f0
Merge pull request #15214 from tensor-tang/fix/debug
6 years ago
Tao Luo 71d9097a89 fix analyzer_test runs error in native_config
6 years ago
Tao Luo 9c02765158
Merge pull request #15210 from Superjomn/fix/analysis_tester_bug
6 years ago
tensor-tang 72d2a1801e add seqpool concat fuse pass
6 years ago
tensor-tang 69fd3fdb52 fix debug build error
6 years ago
superjomn 23bdd0a223 fix analysis_tester bug
6 years ago
Yan Chunwei d09d6eadc0
make inference api work with Doxygen (#15195)
6 years ago
Tao Luo 6ca9a4810b
Merge pull request #15196 from luotao1/serial
6 years ago
Yan Chunwei 6ccf8685f7
refactor tensorrt node teller (#15181)
6 years ago
Tao Luo 7dc0181c46 run analyzer_tester serial in multi-thread
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
tensor-tang 516fe301ee add comment in case of empty name
6 years ago
tensor-tang b9c645639b workaround with third party cache
6 years ago
tensor-tang dca68cdf97 throw error when name not find
6 years ago
tensor-tang 484085693e update url and num_ops
6 years ago
tensor-tang cd94df8679 fix load and refine
6 years ago
tensor-tang 8e271896ae add test data for seqpool1
6 years ago
Zhaolong Xing 4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
6 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
6 years ago
Tao Luo 85471533e0
Merge pull request #15079 from luotao1/analysis_test
6 years ago
wopeizl 719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
6 years ago
Qiyang Min 0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
6 years ago
sneaxiy c4ce2e7b21 merge develop, solve conflict
6 years ago
Tao Luo ecae157edf simplify some data record in analyzer_tester
6 years ago
Tao Luo 05f1b65da3 simplify prepere_input in analyzer_test
6 years ago
nhzlx 02e17396c2 fix comments
6 years ago
nhzlx 71636e677d add min_subgraph_size attr to tensorrt config
6 years ago
peizhilin 01c00b07dd fix test issues on windows
6 years ago
nhzlx a6aa8ea771 faster rcnn input is presistable. (fix it in paddle-trt)
6 years ago
sneaxiy dde3afe7b7 Merge develop
6 years ago
Yihua Xu 0b0acfaa88 Add mkldnn item for porfile and compare usage.
6 years ago
tensor-tang d46a140dd9 add seq pool inference test
6 years ago
tensor-tang d4931a2abc support more input fake data
6 years ago
nhzlx 73b47df1f4 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
6 years ago
nhzlx ce3782c193 add affine_channel fuse.
6 years ago
Tao Luo 91408e3122 fix analyzer_mm_dnn_tester fails when bs > 1
6 years ago
Tao Luo f01c966800 Merge branch 'develop' into mm_dnn
6 years ago
qingqing01 51a9fca323
Async memory copy (#15013)
6 years ago
minqiyang b1d0a14c14 Change the ut back
6 years ago
minqiyang 7d1533216d Fix syntax error in unit test
6 years ago
Tao Luo 22c71398e3 add MM_DNN inference test
6 years ago
peizhilin 9e60c58666 Merge remote-tracking branch 'upstream/develop' into windows/mkl
6 years ago
luotao1 13367866cd add deterministic result unit-test
6 years ago
sneaxiy ae6f46a1a9 rewrite variable type
6 years ago
peizhilin 07c7eaabb4 Merge remote-tracking branch 'upstream/develop' into windows/mkl
6 years ago
Tao Luo 6aa6b8cfa0
Merge pull request #14918 from luotao1/mobilenet_test
6 years ago
Zhaolong Xing a9fb34fad8
Merge pull request #14903 from NHZlX/add_conv_elementwise_pass
6 years ago
Tao Luo 2f55a04ec6 add refer result comparasion
6 years ago
peizhilin 5a6d7fe2ff add mkl,ctc support for windows
6 years ago
wopeizl 0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
6 years ago
nhzlx 050a68dde3 fix comments
6 years ago
Tao Luo 1a6d2cfe39 add test_analyzer_mobilenet
6 years ago
nhzlx fcc93d96d5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass
6 years ago
Yu Yang bacf1d2399 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
nhzlx 4e4a777243 add conv+elementwiseadd pass
6 years ago
nhzlx 050e118f3c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_thread_bug
6 years ago
nhzlx 96216052d5 1. fix trt multi thread bug
6 years ago
Yan Chunwei a985949be9
Fea/fuse conv elementwise add fuse (#14669)
6 years ago
Yu Yang 04a570b463 Fix ut
6 years ago
peizhilin 23dec78772 fix script issue
6 years ago
Yu Yang aa38fc4ce5 Fix compile
6 years ago
Yu Yang 194e66f785 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
Tao Luo 322bb8d5c5
Merge pull request #14825 from NHZlX/add_benchmark_for_trt
6 years ago
flame dd3aca3b96
Merge pull request #14824 from Superjomn/fix/visualizer
6 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
6 years ago
nhzlx 644c13a387 fix compile error
6 years ago
nhzlx a5bfed3776 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_benchmark_for_trt
6 years ago
nhzlx afc51e6f82 add benchmark for trt
6 years ago
Zhaolong Xing bc6d0a3427
Merge pull request #14762 from NHZlX/fix_bug_of_trt_pool
6 years ago
superjomn edd1f5a92b fix visualizer
6 years ago
nhzlx 019e8bbed2 fix comments test=develop
6 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
6 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
6 years ago
Tao Luo cf66133857
Merge pull request #14734 from luotao1/memory_load
6 years ago
Tao Luo 743cb840f1 update with comments
6 years ago
flame f6a877bc57 add tool to visualize inference model (#14621)
6 years ago
Tao Luo 42359e88a4 clean code
6 years ago
Tao Luo 923b18877e Merge branch 'develop' into memory_load
6 years ago
Tao Luo 405b2486db support loading from memory
6 years ago
Houjiang Chen c6b39a0099
Merge pull request #14714 from NHZlX/add_prelu_gpu
6 years ago
nhzlx 722b0a805f fix bug of trt pool
6 years ago
Xin Pan 0591ba96ec fix hack
6 years ago
nhzlx e7abe6b654 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_prelu_gpu
6 years ago
nhzlx f75815b78c add prelu gpu inference
6 years ago
Xin Pan 7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
6 years ago
Yihua Xu 82eefceabe Add the profile_mkldnn flag for profile function(test=develop)
6 years ago
Yihua Xu 64e261c6cd Implement the fusion of convolution and bias for mkldnn
6 years ago
Tao Luo 2af5762cf8
Merge pull request #14668 from wzzju/use_small_dam
6 years ago
ZhenWang 6e48e47406 test=develop
6 years ago
ZhenWang e1da6cd754 add the normal dam and the small dam
6 years ago
ZhenWang d5947b0ed7 test=develop
6 years ago
ZhenWang 33b4963505 unify the normal and small dam model.
6 years ago
Yan Chunwei 4b7617740e
fix container not cleared (#14231)
6 years ago
ZhenWang 8f2e556e65 support the small dam model. test=develop
6 years ago
nhzlx 49c28b8c52 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
6 years ago
nhzlx 3c83a2f720 fix comments
6 years ago
Sang Ik Lee 24e70920db Refactor some build settings.
6 years ago
Sang Ik Lee d6125a5eec Include ngraph in inference demo build.
6 years ago
Tao Luo b4de023ee1
Merge pull request #14636 from Superjomn/fix/word2vec
6 years ago
nhzlx d3e140a572 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
6 years ago
nhzlx d666c8eb1d fix benchmark
6 years ago
nhzlx 900fbb83f9 add params sync pass
6 years ago
superjomn 9c665c81ae update
6 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
Tao Luo e8ef14d2a7
Merge pull request #14610 from Superjomn/revert/cache_fix
6 years ago
Yiqun Liu 726f2cefe3
Fix bug of referencing a temporary variable. (#14614)
6 years ago
peizhilin 38715e6fd0 minor fix
6 years ago
superjomn 4babc6b06c update
6 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
6 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01 39ec80def4
Remove the memory copy of feeding data in C++ inference API (#14577)
6 years ago
peizhilin 1afa9492af Recover the profiler
6 years ago
Yiqun Liu bf222f197d
Use sub scope in tensor_array_to_tensor op. (#14524)
6 years ago
dzhwinter 840c1b29ad
test=develop (#14562)
6 years ago
Yan Chunwei 923c8e3332
add benchmark for inference (#14571)
6 years ago
Tao Luo e90afec47b
Merge pull request #14543 from luotao1/threads
6 years ago
Zhaolong Xing e52d90a35e
Merge pull request #14527 from hjchen2/develop
6 years ago
luotao1 116979a40a refine api name
6 years ago
luotao1 e66b4c6bff adjust tester_helper to make multi-instance multi-thread work
6 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
6 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
6 years ago
peizhilin 7c8c9dc9bf fix unit test cases
6 years ago
hjchen2 1adda8e06c Add more unit tests for split plugin
6 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
6 years ago
peizhilin bef475c92b Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
hjchen2 6eba5bd276 Fix direct copy and refine split ut
6 years ago
Qiao Longfei fd290c2580 fix mac compile of analysis
6 years ago
hjchen2 5857fb3014 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
6 years ago
hjchen2 3e3599f3d9 Refine split tensorrt plugin
6 years ago
peizhilin f10e196fc8 fix build issue
6 years ago
Zhaolong Xing ad349e770f
Merge pull request #14452 from NHZlX/fix_avg_pool_trt_bug
6 years ago
peizhilin 6e66fadb95 clean up the pre-definitions on windows
6 years ago
Tao Luo 1d9b2a453c
Merge pull request #14508 from luotao1/warm_up_multi_thread
6 years ago
nhzlx e62872df8b fix conflicts
6 years ago
nhzlx a4dc1d4292 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
6 years ago
nhzlx faeb9b8aa9 fix compile rely problem
6 years ago
Tao Luo eb9b9becdc add warm up in TestMultiThreadPrediction
6 years ago
Tao Luo 5cc7946313
Merge pull request #14499 from luotao1/disable_openblas_test
6 years ago
nhzlx 2a84054372 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
6 years ago
nhzlx b742d46520 fix demo ci bug on trt
6 years ago
Houjiang Chen 33c65517fd Update CMakeLists.txt test=develop
6 years ago
Houjiang Chen 01bda73116
Update CMakeLists.txt
6 years ago
Tao Luo 09ee266f8e disable two openblas test temporary
6 years ago
hjchen2 2c2a192eb1 Resolve merge conflicts
6 years ago
Yiqun Liu 8bc1c5d2ab
Implement the Tensorrt plugin for elementwise op (#14487)
6 years ago
hjchen2 1622cb9937 Fix alpha tensor key
6 years ago
hjchen2 a8c077df7c Implement leaky relu tensorRT converter
6 years ago
hjchen2 2825685f2a Fix tensorrt plugin cmake dependency, test=develop
6 years ago
Superjomn e878a8e885 update
6 years ago
superjomn 4bf6817cbc fix gpu load model
6 years ago
Wu Yi a2d9b34417
Refine operator cmake (#14413)
6 years ago
nhzlx 8f9a8c455a delete unused test code.
6 years ago
nhzlx 83f8c403a7 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into fix_avg_pool_trt_bug
6 years ago
nhzlx b969116988 fxi avg pool trt bug and fix cpplint
6 years ago
Zhaolong Xing 2f27c048cc
Merge pull request #14440 from hjchen2/develop
6 years ago
hjchen2 6a7b995737 Refine commit message to enable ci, test=develop
6 years ago
hjchen2 413f5948b2 Fix code style
6 years ago
hjchen2 21f33b4274 Complete PRelu plugin and Conv2d transpose op converter
6 years ago
Sylwester Fraczek 8a1eeec579 add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278)
6 years ago
Tao Luo 9d29ebc010
Merge pull request #14306 from sfraczek/sfraczek/test-analyzer-mobilenet
6 years ago
Sylwester Fraczek d318583eb5 rename mobilenet dir to mobilenet_depthwise_conv
6 years ago
Tao Luo 1d867805b0 rollback analyzer_seq_conv1_tester
6 years ago
Tao Luo 5ef123c778 Merge branch 'develop' into dam_fc
6 years ago
dzhwinter d3aed98d86
Merge pull request #14320 from wopeizl/windows/online
6 years ago
Yiqun Liu 9e6b1c5f97
Refine tester of TensorRT engine (#14390)
6 years ago
peizhilin 0ef2a37c0e merge from develop
6 years ago
nhzlx 15bdb7ef14 delete error uploaded files
6 years ago
Sylwester Fraczek 2412c27c2b
Merge branch 'develop' into sfraczek/test-analyzer-mobilenet
6 years ago
peizhilin 1a9008c420 code style fix
6 years ago
Tao Luo e0d4e04bdd fix some compiler warning
6 years ago
Tao Luo 8ea13e336a add in_num_col_dims for fc
6 years ago
nhzlx ddb120357c Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_trt_plugin
6 years ago
peizhilin 447bf7c80b test=develop
6 years ago
peizhilin 30ddc07a7e Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
6 years ago
nhzlx 0b96268057 fix comments
6 years ago
nhzlx e5bf8616f0 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_trt_plugin
6 years ago