Commit Graph

1493 Commits (a501a7b0caadcfbbcb2f637ed58b52aa07f7d2dc)

Author SHA1 Message Date
Chen Weihang 87852616aa
[CustomOp] Support complex dtype in custom op (#31657)
4 years ago
zlsh80826 fe241fd02f
[Paddle-TRT] gather converter (#31640)
4 years ago
zlsh80826 4ea3427865
[Paddle-TRT] support batch axis concatenation when using dynamic shape (#31627)
4 years ago
Pei Yang cac9635a67
[Paddle-TRT] Fix engine key in trt int8 calibration (#31513)
4 years ago
Shang Zhizhou 50ac7dbfd0
Trt elementwise plugin serialize (#31587)
4 years ago
lidanqing 0f1e7e3d52
[Bug fix] Different machine generate different binary file, remove md5 check (#31482)
4 years ago
Shang Zhizhou f57739be35
fix ernie_varlen when cutting head (#31497)
4 years ago
石晓伟 5f6213217b
update zero_copy_tensor_test.cc for build of gcc485, test=develop (#31470)
4 years ago
石晓伟 bc7632be73
upgrade inference tensor apis, test=develop (#31402)
4 years ago
Pei Yang 30717a6cbc
fix trt serialization on windows (#31438)
4 years ago
Pei Yang 1321c47950
add more info in trt engine serialization (#31434)
4 years ago
Pei Yang 32211fe9c4
TRT conv2d converter support SAME padding (#31379)
4 years ago
Shang Zhizhou 77c44e2f1b
change prelu plugin to tensorRT layer (#30210)
4 years ago
Pei Yang 2e9e3fad15
add n-d input support for trt scale converter (#31316)
4 years ago
Shang Zhizhou 6404c43814
support trt serialize when load model from memory (#31342)
4 years ago
Qi Li 72d99c5dcd
[ROCM] update fluid operators for rocm (part4), test=develop (#31225)
4 years ago
Wilber e20234094c
Fix xpu compile and cipher symbol problem. (#31271)
4 years ago
石晓伟 625482f752
inference modification for custom operator, test=develop (#31312)
4 years ago
石晓伟 1da3280660
inference modification for custom operator, test=develop (#31283)
4 years ago
石晓伟 8c94d8cb4c
[Custom OP] change the user header file format, test=develop (#31274)
4 years ago
Wilber 7d91974c91
enable lite ut. (#30890)
4 years ago
joanna.wozna.intel d11602481c
Add bf16 gru model test (#31158)
4 years ago
jakpiase 2f1165342b
OneDNN hardswish integration (#30211)
4 years ago
Pei Yang 00b09e86ac
[Paddle-TRT] support group_norm (#31040)
4 years ago
Zhou Wei be61c2d06b
support build whl and inference library nightly,test=windows3 (#30616)
4 years ago
joanna.wozna.intel 781df300d0
Unification of BF16 enablement process (#31034)
4 years ago
Shang Zhizhou a5c56d83a1
update trt int8 calibrator to IEntropyCalibratorV2 (#31060)
4 years ago
Wilber 463eae0383
update paddle_fluid.so to paddle_inference.so (#30850)
4 years ago
Qi Li eb3050fa9a
[ROCM] update fluid inference for rocm (part1), test=develop (#31018)
4 years ago
Wilber 39aeaa160e
fix jetson problem (#30939)
4 years ago
Wilber 01ccfbcde9
update trt error message when input height or width is -1 (#31019)
4 years ago
Wilber cf8b8f9c5e
resolve memory leak in cudnn8.0 (#31029)
4 years ago
Pei Yang 9b54fe4154
add trt transpose and flatten converter (#31022)
4 years ago
joanna.wozna.intel caf9d39839
Add Conv Transpose BF16 (#30877)
4 years ago
石晓伟 99bd16eb4e
bug fix of xpu lite engine, test=develop (#30918)
4 years ago
joanna.wozna.intel 73cdea01d4
Add bf16 fast performance verification (#30551)
4 years ago
Shang Zhizhou e6095bc2ce
fix split trt plugin initialize (#30875)
4 years ago
wanghuancoder 35c5b23f68
use iwyu clean include second time, test=develop (#30829)
4 years ago
石晓伟 2ac4143b6c
support xpu with analysis predictor, test=develop (#30832)
4 years ago
Adam Osewski 4f066e316e
Layer normalization fuse pass. (#30721)
4 years ago
Shang Zhizhou b909450994
fix trt plugin clone and initialize bugs in TRT7.1+ (#30709)
4 years ago
Wilber b08ae368bb
ci compilation depends on a stable release (#30755)
4 years ago
lidanqing 46989e889b
Fix python3 incompatibility issues (#30698)
4 years ago
Shang Zhizhou ae0f88a988
add DLA support:C++&&Python api (#30165)
4 years ago
Pei Yang cf9bdb9404
extend trt ut timeout threshold (#30537)
4 years ago
石晓伟 33bf6eb753
revert external gflags, test=develop (#30623)
4 years ago
Jacek Czaja dfdb0359ea
- Disabling oneDNN inplace pass (#30588)
4 years ago
Tao Luo 9dd71c74df
disable test_analyzer_detect (#30541)
4 years ago
Leo Chen 81217a94d8
unify calling cudaSetDevice (#30470)
4 years ago
石晓伟 715d862868
export global google flags to users, test=develop (#30448)
4 years ago