Commit Graph

212 Commits (d5e40d1ba911a35f1094e9d04260e6c8d85fa68b)

Author SHA1 Message Date
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
Zhaolong Xing 843581154f
fix emb eltwise layernorm (#24873)
5 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
5 years ago
Jeng Bai-Cheng bef4afa6de
bugfix for unique_ptr of IOptimizationProfile (#23917)
5 years ago
zlsh80826 49e4ee27e1
[Paddle-TRT] slice kernel optimization (#24783)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Pei Yang 181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746)
5 years ago
zlsh80826 fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop (#24544)
5 years ago
Zhaolong Xing f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Zhaolong Xing 35148d17f7
[BUG]: Head number can only be > 1 on multihead op (#23974)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zhaolong Xing 133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
5 years ago
chenhaoze b7d185d6ca
OP clip, merge_lod_tensor, convert/elementwise error message enhancement (#23742)
5 years ago
Pei Yang c528f1d4f3
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23672)
5 years ago
Pei Yang 015acdbfb1
Refine error message of leaky_relu, tensorrt_engine, split, prelu op converter (#23661)
5 years ago
Zhaolong Xing 3acb047a20
[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396)
5 years ago
Zhaolong Xing ed5766ffbc
refine act conv2d pool2d trt converter log (#23605)
5 years ago
Pei Yang 28f04c6a5e
refine shuffle channel errmsg, test=develop (#23520)
5 years ago
Pei Yang 3d5d217030
Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)", test=develop (#23642)
5 years ago
Pei Yang eb11633611
batch_norm trt converter error message, test=develop (#23620)
5 years ago
Pei Yang cdc6d4e292
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)
5 years ago
Pei Yang 42655ef721
Add full_like op. (#23364)
5 years ago
Zhaolong Xing 1a6ce8b910
add swish split gelu plugin dynamic support (#23305)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Pei Yang 24db750386
fix trt int8 calib precision bug. test=develop (#23036)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
Zhaolong Xing 54a325a52f
[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3 models for Inference. (#22483)
5 years ago
Zhaolong Xing ceda0b9b1a
[Fix BUG]: Core when multi thread + clone + paddle-trt (#22442)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
Pei Yang d8a9b134e3
fix trt instance_norm serialize bug. test=develop (#22152)
5 years ago
Pei Yang 50bee83f71
add TRT support for instance_norm op (#21928)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
Zhaolong Xing da7748c53d
add conv, depthwise_conv, pooling (#20966)
6 years ago
Pei Yang 2e2f92a5b1
fix trt weight bug (#21231)
6 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
6 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
6 years ago
Pei Yang 74812d1c90
Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825)
6 years ago
石晓伟 d004a0f50e
fix multi-thread exec of trt, test=develop (#19338)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
Tao Luo 3ae939e48a
unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
wopeizl 80b7ef6fc8
add tensorrt support for windows (#19084)
6 years ago
Zhaolong Xing 3816d221ff
Fix the CE error which caused by paddle-trt version (#18941)
6 years ago
Zhaolong Xing 61238d31f7
Trt fp16 support (#18860)
6 years ago
Huihuang Zheng cfce4994cf
Merge cuda 9/10 dockerfile with root dockerfile (#18693)
6 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago