Commit Graph

62 Commits (ae0f88a988a1b1e53168a6108484d0cfdcb58003)

Author SHA1 Message Date
Shang Zhizhou ae0f88a988
add DLA support:C++&&Python api (#30165)
4 years ago
Shang Zhizhou 05b27695f1
add inference api: DisableTensorRtOps (#30109)
4 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
5 years ago
joanna.wozna.intel ddcd1b5381
Add bfloat16 resnet50 test (#27755)
5 years ago
Wilber dae62556cb
Enhance infer error info message (#26731)
5 years ago
Wilber 1b84c0bf43
Lite subgraph refine predictor (#27167)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
6 years ago
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
Shixiaowei02 bddb2cd315 resolve conflicts with the develop branch test=develop
6 years ago
nhzlx 953bdde058 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
Wojciech Uss 46677fb080 Move cpu_quantize_* passes into mkldnn subfolder
6 years ago
nhzlx 3df7b98a0f Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx c79f06d3d8 cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
6 years ago
nhzlx a9ed427749 cant not pass ci
6 years ago
nhzlx 0ed63b2108 6. delete useless predictor id
6 years ago
nhzlx 2070fb246d 4. do the trt_engine optim during init.
6 years ago
Yan Chunwei 655179089f
AnalysisConfig remove contrib namespace (#15540)
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Yan Chunwei e2818c8608
add dynamic memory optim (#15457)
6 years ago
nhzlx 92cf4a4c6b fix comments
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
6 years ago
nhzlx 4e3522e5b4 add trt int8 support
7 years ago
Yan Chunwei 6ccf8685f7
refactor tensorrt node teller (#15181)
7 years ago
nhzlx 71636e677d add min_subgraph_size attr to tensorrt config
7 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
7 years ago
Tao Luo 743cb840f1 update with comments
7 years ago
Tao Luo 405b2486db support loading from memory
7 years ago
superjomn 4bf6817cbc fix gpu load model
7 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
7 years ago
minqiyang 87450b9ad4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
7 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
7 years ago
minqiyang fcc0452c8b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
7 years ago