Commit Graph

937 Commits (d1901f27bcbe7e974d1e9c0d1eae59f51a79b174)

Author SHA1 Message Date
JiabinYang e3a8929cf8 little change
6 years ago
Zhaolong Xing a7ba07d7ef
Merge pull request #15504 from NHZlX/fix_conv2d_fusion
6 years ago
nhzlx 0779e35544 fix two bug:
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
nhzlx 9641324995 fix comments
6 years ago
nhzlx 484b3bc801 When cudnn version < 7100, there is problem with conv_fusion.
6 years ago
tensor-tang 5c68dee798 fix debug compile of analysis pass fail
6 years ago
luotao1 353b5f06a7 refine analyzer_bert_test to pass the ci
6 years ago
nhzlx e6218c1d7b change the input to a smaller value
6 years ago
fuchang01 4a33a44f45 analyzer bert tester
6 years ago
nhzlx 5b92ddabe2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_stream_bug
6 years ago
nhzlx 2f4aee361a fix comments
6 years ago
nhzlx ec213730bc fix trt stream bug.
6 years ago
flame d60751fb71
add python inference api (#15248)
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
Tao Luo 8f522c15ed
Merge pull request #15408 from luotao1/mm_dnn
6 years ago
Tao Luo 001827c270 test_analyzer_mm_dnn runs in serial
6 years ago
Tao Luo 140fc1e92c
Merge pull request #15392 from luotao1/pyramid_dnn
6 years ago
Yan Chunwei c9e5aa19c1
get tensor API add more comments (#15345)
6 years ago
Yan Chunwei e84234b551
make clone thread safe (#15363)
6 years ago
Tao Luo 668563088e add pyramid_dnn c++ inference test
6 years ago
Zhaolong Xing 236201c222
Merge pull request #15350 from NHZlX/fix_bug_for_precditor
6 years ago
nhzlx 8817841c73 fix unit test bug
6 years ago
Yan Chunwei e07900d317
cache tensor ptr in ZeroCopyTensor (#15352)
6 years ago
Yan Chunwei b7916440ff
hot fix the Native clone (#15344)
6 years ago
Xin Pan 3ecf6bb338
Merge pull request #15028 from yihuaxu/develop_641313ea7_elementwise_mul_mkldnn_bug_fix
6 years ago
nhzlx b95f2ff8fe fix win build bug
6 years ago
nhzlx b938324381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
6 years ago
Yiqun Liu 568cc2ffa8
Optimize while_op for test (#14764)
6 years ago
nhzlx b2ba3471fd fix analysis config bug.
6 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
6 years ago
bingyanghuang a152a5c731 Disable conv3d mkldnn in dam (#15335)
6 years ago
tensor-tang 1a95cd227d disable seqpool test on mac or without mkl
6 years ago
Tao Luo 2411ed4286 fix multi-threads in ZeroCopyProfile
6 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
6 years ago
tensor-tang 7035f051a8 adjust acc on mac
6 years ago
Tao Luo e33427da0d
Merge pull request #15280 from luotao1/random_test
6 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
6 years ago
tensor-tang a5d2a6d1ad add fuse pass of sequared mat sub fusion
6 years ago
tensor-tang 84e023eae5 adjust the acc since the refer result is too large
6 years ago
tensor-tang 4461a458a5 adjust diff since abs is too large
6 years ago
tensor-tang ca6fdc6e33 refine and fix test
6 years ago
tensor-tang a89296ac1f add repeated fc relu pass
6 years ago
tensor-tang 781cd0cf51 add multi threads test of seqpool test (#15293)
6 years ago
Tao Luo cbd1c7c01f fix CompareDeterministic error when test_all_data
6 years ago
Zhaolong Xing 98e85f3735 add_transpose_flatten_concat_fuse (#15121)
6 years ago
wopeizl 5d9edb4124
Merge pull request #15156 from wopeizl/windows/fixgpuissue
6 years ago
tensor-tang 146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
6 years ago
peizhilin 439691f5bd adjust the shlwapi on windows
6 years ago
tensor-tang 96786d3716 add compare_determine of seqpool1 test
6 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
peizhilin e239558e56 remove the dismatch enclosure to avoid warning message test=develop
6 years ago
Tao Luo 7d13d20769
Merge pull request #15245 from luotao1/rnn1_multi_thread
6 years ago
Tao Luo 2b11c710b3
Merge pull request #15249 from NHZlX/fix_trt_demo_ci
6 years ago
tensor-tang 54afcb7ec6 add compare zerocopy test with native result
6 years ago
tensor-tang 137060135e fix zerocopy size
6 years ago
tensor-tang 7461356723 add zerocopy for seqpool test
6 years ago
tensor-tang 48410b9bfe
Merge pull request #15237 from tensor-tang/fuse/seqpool_concat_2
6 years ago
nhzlx e7d83389e6 fix demo ci bug
6 years ago
Tao Luo 9b41e45584
Merge pull request #15222 from luotao1/native_config
6 years ago
Tao Luo d43983b61d reduce threads number to avoid hang in CI
6 years ago
nhzlx c1264e99f3 fix win error
6 years ago
nhzlx 4e3522e5b4 add trt int8 support
6 years ago
tensor-tang f8c305b243 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat_2
6 years ago
Tao Luo 197d0f2431 fix trt_model_tester to pass the ci
6 years ago
Tao Luo feee78d9f0
Merge pull request #15214 from tensor-tang/fix/debug
6 years ago
Tao Luo 71d9097a89 fix analyzer_test runs error in native_config
6 years ago
Tao Luo 9c02765158
Merge pull request #15210 from Superjomn/fix/analysis_tester_bug
6 years ago
tensor-tang 72d2a1801e add seqpool concat fuse pass
6 years ago
tensor-tang 69fd3fdb52 fix debug build error
6 years ago
superjomn 23bdd0a223 fix analysis_tester bug
6 years ago
Yan Chunwei d09d6eadc0
make inference api work with Doxygen (#15195)
6 years ago
Tao Luo 6ca9a4810b
Merge pull request #15196 from luotao1/serial
6 years ago
Yan Chunwei 6ccf8685f7
refactor tensorrt node teller (#15181)
6 years ago
Tao Luo 7dc0181c46 run analyzer_tester serial in multi-thread
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
tensor-tang 516fe301ee add comment in case of empty name
6 years ago
tensor-tang b9c645639b workaround with third party cache
6 years ago
tensor-tang dca68cdf97 throw error when name not find
6 years ago
tensor-tang 484085693e update url and num_ops
6 years ago
tensor-tang cd94df8679 fix load and refine
6 years ago
tensor-tang 8e271896ae add test data for seqpool1
6 years ago
Zhaolong Xing 4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
6 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
6 years ago
Tao Luo 85471533e0
Merge pull request #15079 from luotao1/analysis_test
6 years ago
wopeizl 719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
6 years ago
Qiyang Min 0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
6 years ago
sneaxiy c4ce2e7b21 merge develop, solve conflict
6 years ago
Tao Luo ecae157edf simplify some data record in analyzer_tester
6 years ago
Tao Luo 05f1b65da3 simplify prepere_input in analyzer_test
6 years ago
nhzlx 02e17396c2 fix comments
6 years ago
nhzlx 71636e677d add min_subgraph_size attr to tensorrt config
6 years ago
peizhilin 01c00b07dd fix test issues on windows
6 years ago
nhzlx a6aa8ea771 faster rcnn input is presistable. (fix it in paddle-trt)
6 years ago
sneaxiy dde3afe7b7 Merge develop
6 years ago
Yihua Xu 0b0acfaa88 Add mkldnn item for porfile and compare usage.
6 years ago
tensor-tang d46a140dd9 add seq pool inference test
6 years ago
tensor-tang d4931a2abc support more input fake data
6 years ago
nhzlx 73b47df1f4 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
6 years ago