Commit Graph

602 Commits (cced930b61ba246dffec68bbe09bd9e22a142d64)

Author SHA1 Message Date
joanna.wozna.intel 781df300d0
Unification of BF16 enablement process (#31034)
5 years ago
Wilber 463eae0383
update paddle_fluid.so to paddle_inference.so (#30850)
5 years ago
Qi Li eb3050fa9a
[ROCM] update fluid inference for rocm (part1), test=develop (#31018)
5 years ago
Wilber 39aeaa160e
fix jetson problem (#30939)
5 years ago
Wilber cf8b8f9c5e
resolve memory leak in cudnn8.0 (#31029)
5 years ago
Pei Yang 9b54fe4154
add trt transpose and flatten converter (#31022)
5 years ago
石晓伟 99bd16eb4e
bug fix of xpu lite engine, test=develop (#30918)
5 years ago
joanna.wozna.intel 73cdea01d4
Add bf16 fast performance verification (#30551)
5 years ago
wanghuancoder 35c5b23f68
use iwyu clean include second time, test=develop (#30829)
5 years ago
石晓伟 2ac4143b6c
support xpu with analysis predictor, test=develop (#30832)
5 years ago
Adam Osewski 4f066e316e
Layer normalization fuse pass. (#30721)
5 years ago
Shang Zhizhou ae0f88a988
add DLA support:C++&&Python api (#30165)
5 years ago
Jacek Czaja dfdb0359ea
- Disabling oneDNN inplace pass (#30588)
5 years ago
石晓伟 715d862868
export global google flags to users, test=develop (#30448)
6 years ago
alncat 7bbf3ac5ab
Added support for inference using quantization aware trained dygraph (#30288)
6 years ago
Zhang Jun 10a8f3e5c3
fix bug on compiling inference shared lib with crypto;test=develop (#30269)
6 years ago
石晓伟 8c4500ff6d
fix header file paths of gflags, commit 2, test=develop (#30272)
6 years ago
石晓伟 8ce2482b80
fix header file paths of gflags, commit 1, test=develop (#30271)
6 years ago
Wilber ade244948c
disable mkldnn inplace pass on windows (#30164)
6 years ago
joanna.wozna.intel 907262ee15
Fix analysis predictor test (#30191)
6 years ago
Shang Zhizhou 05b27695f1
add inference api: DisableTensorRtOps (#30109)
6 years ago
cc 68398abce9
[Inference] zero_copy_tensor supports int8_t (#30053)
6 years ago
cc 6a0102b038
map matmul/squeeze2+matmul/reshape2+matmul to mul (#29911)
6 years ago
Wilber 2b1d796cd0
[Inference] Solve 2.0 trt performance reduce compare 1.8. (#29925)
6 years ago
石晓伟 acb5e86363
fix a bug in reset_tensor_array, test=develop (#29620)
6 years ago
jakpiase edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772)
6 years ago
Wilber 2c0a4a3470
call_statck is turned on default when ON_INFER=ON (#29798)
6 years ago
Zhang Jun 0c23ba95d8
enable MakeCiper api for inference;test=develop (#29692)
6 years ago
Wilber b593d588aa
[Inference] EnableUseGpu has higher priority than flags (#29697)
6 years ago
Pei Yang f860de4af7
support clip op trt converter (#29411)
6 years ago
Wilber d68af02c04
fix analysis_config bug. (#29304)
6 years ago
Wilber a22ea652cf
fix trt delete_pass bug. (#28763)
6 years ago
gongweibao 1dad8ceaab
Fix gpu memory allocation bug. (#28703)
6 years ago
Wilber 04cefeacc5
Disable windows gpu static lib. (#28741)
6 years ago
Wilber 8b97bb2e1f
Update cmake for arm ft and fix a bug for Predictor dtor. (#28586)
6 years ago
Shang Zhizhou 8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
6 years ago
Wilber 1bf4836580
[Inference] Add TryShrinkMemory interface. (#28409)
6 years ago
Wilber 645e999afc
fix api_impl test. (#28483)
6 years ago
YUNSHEN XIE ba0756325a
exec ut no more than 15s 1 (#28439)
6 years ago
iducn ba0fe0a812
revert the modified shell script (#28453)
6 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
6 years ago
iducn f763cb81a6
Modify the shell script according to the specification (#28302)
6 years ago
Adam Osewski 7db747d9e8
oneDNN BatchNorm + Act fusion pass. (#27912)
6 years ago
Wilber f935ca8a50
[lite-xpu-subgraph] Fix xpu compile and test xpu ci. (#27932)
6 years ago
joanna.wozna.intel a21b57109c
Add AVX512 instruction check for C-API (#28087)
6 years ago
joanna.wozna.intel ddcd1b5381
Add bfloat16 resnet50 test (#27755)
6 years ago
Wilber 345574a6ed
Demo CMakeLists add openmp flag. (#27848)
6 years ago
石晓伟 0d27591642
save operator version infomation to program desc, test=develop (#27668)
6 years ago
Wilber df7fabeedc
Fix memory leak for mkldnn. (#27493)
6 years ago
Wilber ec4155d7d0
windows lib size crop from 5.4G to 3.9G (#27477)
6 years ago