Commit Graph

33 Commits (5efe8c728706056f847a7449d712a460131794b1)

Author SHA1 Message Date
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
石晓伟 5dea0bdd1b
Merge pull request #16498 from Shixiaowei02/feature/anakin-engine
6 years ago
Shixiaowei02 bddb2cd315 resolve conflicts with the develop branch test=develop
6 years ago
Wojciech Uss 09dfc7a2aa
C-API quantization core 2 (#16396)
6 years ago
nhzlx 07dcf2856c git cherry-pick from feature/anakin-engine: update anakin subgraph #16278
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx c79f06d3d8 cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
luotao1 5ecdc49c6b set enable_runtime_context_cache_ default false
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
nhzlx a9ed427749 cant not pass ci
6 years ago
Sylwester Fraczek 543e53db05 fix typo releated->related
6 years ago
Yan Chunwei e887d71958
fix ir debug config (#15571)
6 years ago
Yan Chunwei 655179089f
AnalysisConfig remove contrib namespace (#15540)
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Yan Chunwei e2818c8608
add dynamic memory optim (#15457)
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
6 years ago
nhzlx 4e3522e5b4 add trt int8 support
6 years ago
Yan Chunwei d09d6eadc0
make inference api work with Doxygen (#15195)
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
nhzlx 71636e677d add min_subgraph_size attr to tensorrt config
6 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
6 years ago
Tao Luo 743cb840f1 update with comments
6 years ago
Tao Luo 405b2486db support loading from memory
6 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
6 years ago
Yiqun Liu 9e6b1c5f97
Refine tester of TensorRT engine (#14390)
6 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
6 years ago