Commit Graph

149 Commits (ea851796e5bf5018ee4535758e0509c148285473)

Author SHA1 Message Date
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
5 years ago
joanna.wozna.intel ddcd1b5381
Add bfloat16 resnet50 test (#27755)
5 years ago
石晓伟 0d27591642
save operator version infomation to program desc, test=develop (#27668)
5 years ago
Wilber df7fabeedc
Fix memory leak for mkldnn. (#27493)
5 years ago
Wilber 5034d181f3
update for 2.0 inference api. (#27473)
5 years ago
Wilber dae62556cb
Enhance infer error info message (#26731)
5 years ago
Wilber 1b84c0bf43
Lite subgraph refine predictor (#27167)
5 years ago
zlsh80826 ad6e3dd69c
[Paddle-TRT] Stack op plugin (#25605)
5 years ago
Wilber 68e0560c2f
refine paddle inference api (#26774)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Pei Yang 43f9f180e5
Add api to clear intermediate tensors in AnalysisPredictor (#25069)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Tao Luo c1df7048c7
add UT for mkldnn_cache_capacity (#24336)
5 years ago
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
Zhaolong Xing 133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
5 years ago
guofei 2b896c1f6b
Support LoDTensorArray in fetch (#23645)
5 years ago
Pei Yang c528f1d4f3
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23672)
5 years ago
Zhaolong Xing 3acb047a20
[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396)
5 years ago
Tao Luo e4f1b1c5e1
solve mklml memory leak (#23557)
5 years ago
Pei Yang 3d5d217030
Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)", test=develop (#23642)
5 years ago
Pei Yang cdc6d4e292
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
石晓伟 1861ca88f1
serialize the PaddleTensor, test=develop (#22810)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Pei Yang 50bee83f71
add TRT support for instance_norm op (#21928)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
石晓伟 2bb135825e
fix analysis_predictor when func is called multiple times, test=release/1.6 (#21665)
6 years ago
Zhaolong Xing fbbd94a6ce
there is bug for inference using auto grwoth allocator (#21621)
6 years ago
Pei Yang 20d61414b4
fix glog warning, test=develop (#21573)
6 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
6 years ago
Zhaolong Xing b39c011637
specify the auto growth allocator for inference. (#21448)
6 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
6 years ago
Zhaolong Xing d1a6e112e6 fix C++ multicard inference bug. (#20955)
6 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
6 years ago
石晓伟 e742760f8e
optimize version error, test=develop (#20715)
6 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
6 years ago
石晓伟 01b9d07963
update operator compatible info, test=develop (#19978)
6 years ago
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
6 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
Tao Luo 3123d18787
remove unused AnalysisPredictor::SetMkldnnThreadID() (#18444)
6 years ago