Commit Graph

60 Commits (d84eb9b33f8751e50c73c07ed0d88379d9a406e9)

Author SHA1 Message Date
Pei Yang ae6e40a7fd
Add unittests and OP version registry for tensorrt_subgraph_pass (#27544)
4 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
4 years ago
Chen Weihang 765064476b
Polish some lost invalid error message (#27445)
4 years ago
Pei Yang fda54c0212
errmsg refine of trt plugin (#27309)
4 years ago
Pei Yang a5ef246cac
Optimize emb_eltwise_layernorm_plugin and support fp16 (#27128)
4 years ago
Shang Zhizhou 47fdc60ecc
Optimize slice trt plugin (#26970)
4 years ago
Pei Yang aae41c6fca
refine error message related to paddle-TRT (#27256)
4 years ago
Zhaolong Xing 932bbe955b
fix pool trt plugin bug (#26463)
5 years ago
zlsh80826 ad6e3dd69c
[Paddle-TRT] Stack op plugin (#25605)
5 years ago
zlsh80826 ac63c7cdef
fix a skip_layernorm bug, test=develop (#26800)
5 years ago
Zhaolong Xing b7a86e92a8
fix dy shape bug in trt7.1 (#26273)
5 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
Pei Yang f82baed866
fix trt instance norm plugin on gcc8. test=develop (#25730)
5 years ago
Jeng Bai-Cheng fc93266b0a
Improve qkv transpose performance (#23919)
5 years ago
Zhaolong Xing 7b7e605189
[Fix BUGs]: fix multhead matmul pass's instable bug (#25123)
5 years ago
Zhaolong Xing 843581154f
fix emb eltwise layernorm (#24873)
5 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
5 years ago
zlsh80826 49e4ee27e1
[Paddle-TRT] slice kernel optimization (#24783)
5 years ago
zlsh80826 fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop (#24544)
5 years ago
Zhaolong Xing 35148d17f7
[BUG]: Head number can only be > 1 on multihead op (#23974)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zhaolong Xing 133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
5 years ago
Pei Yang c528f1d4f3
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23672)
5 years ago
Pei Yang 015acdbfb1
Refine error message of leaky_relu, tensorrt_engine, split, prelu op converter (#23661)
5 years ago
Zhaolong Xing 3acb047a20
[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396)
5 years ago
Pei Yang 3d5d217030
Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)", test=develop (#23642)
5 years ago
Pei Yang cdc6d4e292
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)
5 years ago
Zhaolong Xing 1a6ce8b910
add swish split gelu plugin dynamic support (#23305)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Pei Yang d8a9b134e3
fix trt instance_norm serialize bug. test=develop (#22152)
5 years ago
Pei Yang 50bee83f71
add TRT support for instance_norm op (#21928)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
5 years ago
Tao Luo 3ae939e48a
unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631)
6 years ago
wopeizl 80b7ef6fc8
add tensorrt support for windows (#19084)
6 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
6 years ago
nhzlx 06a088a199 fix comments and fix cpplint
6 years ago
nhzlx 0ed63b2108 6. delete useless predictor id
6 years ago
nhzlx 1d5ef7c9ee 5. add static trt load model
6 years ago
nhzlx f75815b78c add prelu gpu inference
6 years ago
hjchen2 6eba5bd276 Fix direct copy and refine split ut
6 years ago
hjchen2 5857fb3014 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
6 years ago
hjchen2 3e3599f3d9 Refine split tensorrt plugin
6 years ago
nhzlx e62872df8b fix conflicts
6 years ago
Houjiang Chen 33c65517fd Update CMakeLists.txt test=develop
6 years ago
Houjiang Chen 01bda73116
Update CMakeLists.txt
6 years ago
hjchen2 2c2a192eb1 Resolve merge conflicts
6 years ago
Yiqun Liu 8bc1c5d2ab
Implement the Tensorrt plugin for elementwise op (#14487)
6 years ago
hjchen2 a8c077df7c Implement leaky relu tensorRT converter
6 years ago