Commit Graph

75 Commits (b593d588aa418d47831a6e19653c32b4be9d3c5f)

Author SHA1 Message Date
Wilber b593d588aa
[Inference] EnableUseGpu has higher priority than flags (#29697)
5 years ago
Wilber d68af02c04
fix analysis_config bug. (#29304)
5 years ago
Wilber a22ea652cf
fix trt delete_pass bug. (#28763)
5 years ago
Shang Zhizhou 8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
5 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
5 years ago
Wilber f935ca8a50
[lite-xpu-subgraph] Fix xpu compile and test xpu ci. (#27932)
5 years ago
joanna.wozna.intel a21b57109c
Add AVX512 instruction check for C-API (#28087)
5 years ago
joanna.wozna.intel ddcd1b5381
Add bfloat16 resnet50 test (#27755)
5 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
5 years ago
Wilber dae62556cb
Enhance infer error info message (#26731)
5 years ago
joanna.wozna.intel 0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " (#26882)
5 years ago
石晓伟 ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
5 years ago
Wilber 68e0560c2f
refine paddle inference api (#26774)
5 years ago
joanna.wozna.intel 02083bda40
Add mkldnn bfloat16 option to C-API (#26676)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
Pei Yang 695a53c874
remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop (#23805)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Michał Gallus 253e664275 Disable memory opt pass when DNNL is on (#21826)
6 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
6 years ago
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
6 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
石晓伟 d008260fa8
update the initialization of anakin subgraph (#17880)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
Wojciech Uss ec2750b3c2 fix repeating passes (#16606)
6 years ago
石晓伟 5dea0bdd1b
Merge pull request #16498 from Shixiaowei02/feature/anakin-engine
6 years ago
Shixiaowei02 bddb2cd315 resolve conflicts with the develop branch test=develop
6 years ago
Wojciech Uss 09dfc7a2aa
C-API quantization core 2 (#16396)
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx c79f06d3d8 cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
6 years ago