Commit Graph

50 Commits (13b03e7ad6e9d555b972cca8e5e84ee0d4b61c99)

Author SHA1 Message Date
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Zhaolong Xing 6a23850a3f
add init value to varis in analysis config. (#23442)
5 years ago
Pei Yang 7e439780d9
add full paddle_analysis_config.h APIs. (#23215)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Pei Yang 46b8d282dc
Add some inference API comments for AnalysisConfig (#23117)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
5 years ago
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
5 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
5 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
Zhaolong Xing 61238d31f7
Trt fp16 support (#18860)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
石晓伟 5dea0bdd1b
Merge pull request #16498 from Shixiaowei02/feature/anakin-engine
6 years ago
Shixiaowei02 bddb2cd315 resolve conflicts with the develop branch test=develop
6 years ago
Wojciech Uss 09dfc7a2aa
C-API quantization core 2 (#16396)
6 years ago
nhzlx 07dcf2856c git cherry-pick from feature/anakin-engine: update anakin subgraph #16278
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx c79f06d3d8 cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
luotao1 5ecdc49c6b set enable_runtime_context_cache_ default false
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
nhzlx a9ed427749 cant not pass ci
6 years ago
Sylwester Fraczek 543e53db05 fix typo releated->related
6 years ago
Yan Chunwei e887d71958
fix ir debug config (#15571)
6 years ago
Yan Chunwei 655179089f
AnalysisConfig remove contrib namespace (#15540)
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Yan Chunwei e2818c8608
add dynamic memory optim (#15457)
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
6 years ago
nhzlx 4e3522e5b4 add trt int8 support
6 years ago
Yan Chunwei d09d6eadc0
make inference api work with Doxygen (#15195)
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
nhzlx 71636e677d add min_subgraph_size attr to tensorrt config
6 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
6 years ago
Tao Luo 743cb840f1 update with comments
6 years ago
Tao Luo 405b2486db support loading from memory
6 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
6 years ago
Yiqun Liu 9e6b1c5f97
Refine tester of TensorRT engine (#14390)
6 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
6 years ago