Commit Graph

44 Commits (2c0a4a347015d2201fde5acdd2c0bb411a43f8f0)

Author SHA1 Message Date
Wilber 1bf4836580
[Inference] Add TryShrinkMemory interface. (#28409)
5 years ago
石晓伟 0d27591642
save operator version infomation to program desc, test=develop (#27668)
5 years ago
Wilber df7fabeedc
Fix memory leak for mkldnn. (#27493)
5 years ago
Pei Yang 43f9f180e5
Add api to clear intermediate tensors in AnalysisPredictor (#25069)
5 years ago
Wilber 0129f4b568
Add some inference API comments for AnalysisPredictor (#23242)
5 years ago
石晓伟 01b9d07963
update operator compatible info, test=develop (#19978)
6 years ago
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
6 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Tao Luo 3123d18787
remove unused AnalysisPredictor::SetMkldnnThreadID() (#18444)
6 years ago
lijianshe02 de26df440b add SaveOptimModel interface in analysis_predictor.h and test it in a… (#16441)
6 years ago
Wojciech Uss 09dfc7a2aa
C-API quantization core 2 (#16396)
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
nhzlx 3c40cb767b 7 refine zero copy
6 years ago
nhzlx 2eff3e26b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt
6 years ago
nhzlx 06a088a199 fix comments and fix cpplint
6 years ago
nhzlx 0ed63b2108 6. delete useless predictor id
6 years ago
Sylwester Fraczek 8bc604571f fix typo seriazlized->serialized
6 years ago
nhzlx 2070fb246d 4. do the trt_engine optim during init.
6 years ago
Yan Chunwei 655179089f
AnalysisConfig remove contrib namespace (#15540)
6 years ago
Yan Chunwei 526790e652
infer get program (#15511)
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Yan Chunwei e2818c8608
add dynamic memory optim (#15457)
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
flame d60751fb71
add python inference api (#15248)
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
Yan Chunwei e84234b551
make clone thread safe (#15363)
6 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
6 years ago
nhzlx 4e3522e5b4 add trt int8 support
7 years ago
Yan Chunwei d09d6eadc0
make inference api work with Doxygen (#15195)
7 years ago
Yiqun Liu 726f2cefe3
Fix bug of referencing a temporary variable. (#14614)
7 years ago
luotao1 116979a40a refine api name
7 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
7 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
7 years ago
qingqing01 abe209234f
Exhaustive search for cuDNN conv. (#14286)
7 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
7 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
7 years ago
Yan Chunwei ee74be3a49
[1.1] Bugfix/tensorarray (#14044)
7 years ago
Yan Chunwei 6809238d97
fix analysis predictor profile (#13896)
7 years ago
Yan Chunwei c8744d118d
fea/infer executor and concurrency performance issue bug fix (#13451)
7 years ago
Yan Chunwei 5de14c6b96
refine inference api (#13518)
7 years ago
Yan Chunwei 9df2d8b5ba
test/add text-classification test (#13081)
7 years ago
Yan Chunwei af15f6f038
fea/refine fuse (#13076)
7 years ago