Commit Graph

253 Commits (cced930b61ba246dffec68bbe09bd9e22a142d64)

Author SHA1 Message Date
Shang Zhizhou a5c56d83a1
update trt int8 calibrator to IEntropyCalibratorV2 (#31060)
5 years ago
Wilber 01ccfbcde9
update trt error message when input height or width is -1 (#31019)
5 years ago
Pei Yang 9b54fe4154
add trt transpose and flatten converter (#31022)
5 years ago
Shang Zhizhou e6095bc2ce
fix split trt plugin initialize (#30875)
5 years ago
wanghuancoder 35c5b23f68
use iwyu clean include second time, test=develop (#30829)
5 years ago
Shang Zhizhou b909450994
fix trt plugin clone and initialize bugs in TRT7.1+ (#30709)
5 years ago
Shang Zhizhou ae0f88a988
add DLA support:C++&&Python api (#30165)
5 years ago
Leo Chen 81217a94d8
unify calling cudaSetDevice (#30470)
5 years ago
alncat 7bbf3ac5ab
Added support for inference using quantization aware trained dygraph (#30288)
6 years ago
WeiXin 66dc4ac77b
modify error message based on comments (#30189)
6 years ago
Wilber 01a287bf0a
fix windows compile when WITH_PYTHON=ON and WITH_TENSORRT=ON (#30194)
6 years ago
Pei Yang 2480bdef6c
change hard_swish from plugin to layer (#29177)
6 years ago
Pei Yang f860de4af7
support clip op trt converter (#29411)
6 years ago
Shang Zhizhou b9e76a0103
detect tensorRT plugin fp16 in runtime (#27933)
6 years ago
Pei Yang 994673bf4f
change avg pooling and global pooling to trt layer in dynamic shape mode (#28702)
6 years ago
Shang Zhizhou 8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
6 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
6 years ago
Pei Yang 602d2ce5c9
change avg pooling from trt plugin to trt layer (#28032)
6 years ago
Shang Zhizhou bbc837ee72
add info log for trt input dynamic shape check (#27796)
6 years ago
Pei Yang ae6e40a7fd
Add unittests and OP version registry for tensorrt_subgraph_pass (#27544)
6 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
6 years ago
Chen Weihang 765064476b
Polish some lost invalid error message (#27445)
6 years ago
Pei Yang fda54c0212
errmsg refine of trt plugin (#27309)
6 years ago
Pei Yang a5ef246cac
Optimize emb_eltwise_layernorm_plugin and support fp16 (#27128)
6 years ago
Shang Zhizhou 47fdc60ecc
Optimize slice trt plugin (#26970)
6 years ago
Shang Zhizhou e6e2e53782
Optimize error report (#27254)
6 years ago
Pei Yang aae41c6fca
refine error message related to paddle-TRT (#27256)
6 years ago
Zhaolong Xing 932bbe955b
fix pool trt plugin bug (#26463)
6 years ago
zlsh80826 ad6e3dd69c
[Paddle-TRT] Stack op plugin (#25605)
6 years ago
Pei Yang 78a530c219
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536)
6 years ago
zlsh80826 ac63c7cdef
fix a skip_layernorm bug, test=develop (#26800)
6 years ago
Pei Yang e3f8e5cf5c
trt int8 support conv2d_transpose (#26636)
6 years ago
Pei Yang 379222c3f1
add output scale and trt op teller support for hard_swish and hard_sigmoid (#26499)
6 years ago
Zhaolong Xing b7a86e92a8
fix dy shape bug in trt7.1 (#26273)
6 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
6 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
6 years ago
Pei Yang 9e9a569dae
add trt int8 support for elementwise_mul and scale (#25676)
6 years ago
Pei Yang eef98b7f86
add macro check for using TRT api dynamicRangeIsSet() (#25694)
6 years ago
Pei Yang f82baed866
fix trt instance norm plugin on gcc8. test=develop (#25730)
6 years ago
Jeng Bai-Cheng fc93266b0a
Improve qkv transpose performance (#23919)
6 years ago
Zhaolong Xing 7b7e605189
[Fix BUGs]: fix multhead matmul pass's instable bug (#25123)
6 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
6 years ago
Zhaolong Xing 843581154f
fix emb eltwise layernorm (#24873)
6 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
6 years ago
Jeng Bai-Cheng bef4afa6de
bugfix for unique_ptr of IOptimizationProfile (#23917)
6 years ago
zlsh80826 49e4ee27e1
[Paddle-TRT] slice kernel optimization (#24783)
6 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
6 years ago
Pei Yang 181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746)
6 years ago
zlsh80826 fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop (#24544)
6 years ago
Zhaolong Xing f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517)
6 years ago