Commit Graph

496 Commits (6df1ca54c869efe34723be8bee2ce15db7320a2a)

Author SHA1 Message Date
石晓伟 ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
5 years ago
Pei Yang 78a530c219
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536)
5 years ago
Wilber 68e0560c2f
refine paddle inference api (#26774)
5 years ago
joanna.wozna.intel 02083bda40
Add mkldnn bfloat16 option to C-API (#26676)
5 years ago
chalsliu dc56c89822
Add the option to execute unit tests only at night (#26669)
5 years ago
LoveAn 02fc1fef8b
Fix the cmake-function named inference_download_and_uncompress on Windows (#26512)
5 years ago
YUNSHEN XIE a8b5741fb4
add a few unittests for setting timeout properity (#26630)
5 years ago
Pei Yang b757466b0d
fix trt dynamic ernie serialization unit test (#26228)
5 years ago
Wilber c84aa9c61f
update diff val. (#26242)
5 years ago
Wilber fb72b192e7
[DOC] Fix dead link (#26154)
5 years ago
lidanqing 65b97d6215
GRU model xnli dataset C++ tester (#25534)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
Zhaolong Xing bcddefef39
[Fix Ut]: fix inference ut which exist bug on windows. (#25814)
5 years ago
Pei Yang f82baed866
fix trt instance norm plugin on gcc8. test=develop (#25730)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
Pei Yang 43f9f180e5
Add api to clear intermediate tensors in AnalysisPredictor (#25069)
5 years ago
Wilber 4474fc1033
fix compile on windows. test=develop (#25310)
5 years ago
Wojciech Uss 23a4f54b73
rename qat into quant (#24948)
5 years ago
Wojciech Uss 56fa3880e3
rename qat into quant in filenames only (#25194)
5 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
iducn f282599229
disable unitest for gcc8(#25134)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
silingtong123 126d3d693b
support C++ inference shared library on windows (#24672)
5 years ago
Zhaolong Xing f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517)
5 years ago
Tao Luo 72c370c8d2
remove unused test_multi_thread_helper.h (#24399)
5 years ago
Tao Luo 63da846de0
remove old inference C++ tests (#24368)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Tao Luo c1df7048c7
add UT for mkldnn_cache_capacity (#24336)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
lidanqing 61ec30f030
Update QAT INT8 2.0 doc (#24127)
5 years ago
Pei Yang 695a53c874
remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop (#23805)
5 years ago
Zhaolong Xing 133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
5 years ago
guofei 2b896c1f6b
Support LoDTensorArray in fetch (#23645)
5 years ago
lidanqing 2291634c5c
Add user local data preprocess support (#23692)
5 years ago
Zhaolong Xing 3acb047a20
[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Zhaolong Xing 1a6ce8b910
add swish split gelu plugin dynamic support (#23305)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
Michał Gallus 0038bfbd1d
Prevent loading of warmup data in analyzer_int8 if enable_int8 is set to false (#22857)
5 years ago
liu zhengxi 324f2b3922
Fix inference c api PD_GetZeroCopyOutput lod (#22768)
5 years ago
liu zhengxi 71ab0458e1
Fix pointer and c-api encapsulation (#22663)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
flame 1d503e6a9e
Golang inference API (#22503)
5 years ago
Wojciech Uss 4cddb43c5c
Add support for Ernie NLP model to the Slim QAT (#22506)
5 years ago
Zhaolong Xing ceda0b9b1a
[Fix BUG]: Core when multi thread + clone + paddle-trt (#22442)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
joanna.wozna.intel 3099d9d47c Restore requantize squash (#22399)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
Wilber 1230c110cb
[fluid-lite] adjust to relative error (#22232)
5 years ago
Wojciech Uss 2e90c4eb0a improve mkldnn_quantizer_config test code coverage (#22216)
5 years ago
Wilber 5750152e80
support fluid-lite subgraph run resnet test=develop (#22191)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Pei Yang 50bee83f71
add TRT support for instance_norm op (#21928)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
lidanqing 9dff56e8e2 change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895)
5 years ago
Michał Gallus a5159d8480 Re-anble vgg and resnet101 models download (#21713)
5 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
5 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
5 years ago
rensilin 7f5d532a9c fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461)
5 years ago
lidanqing fbf9eca0d3 QAT Int8 document (#21360)
5 years ago
GaoWei8 250a192181 Add ernie large c++ inference test (#21365)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
5 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
5 years ago
liu zhengxi 3cb6c0a059
Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240)
5 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
5 years ago
GaoWei8 a9d4eed3a8 fix cmake fails on inference_download_and_uncompress (#21185)
5 years ago
Adam d74ea0855f Add relative error measure when (value > 1) (#21144)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
GaoWei8 829bf871d7 Add ernie c++ inference test (#21015)
5 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
5 years ago
bingyanghuang fd49ebcbd8 update int8 benchmark with 6271 data, test=develop test=document_fix (#20736)
5 years ago
liu zhengxi d39777fefa
alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697)
5 years ago
lidanqing 57b656f956 Add document for int8 object detection quantization (#19356)
5 years ago
liu zhengxi 922d432477
fix the PD_ZeroCopyPredictorRun output problem (#20612)
5 years ago
bingyanghuang 85e1f2150b Modify the helper information in full_pascalvoc_test_preprocess.py (#20475)
5 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
5 years ago
liu zhengxi 301eeb5bea
Add capi for fluid inference api (#20092)
5 years ago
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
5 years ago
Aurelius84 99a9615a4b
Removing length dims constraints of seq_pad and seq_unpad (#19497)
5 years ago
pawelpiotrowicz 2c5c636514 Add two extra flags for test_analyzer_int8_image_classification to disable fp32/int8 (#19840)
5 years ago
Pei Yang 74812d1c90
Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825)
5 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
5 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
5 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
lidanqing 9240e5325c add local user data conversion into full_pascalvoc_test_preprocess.py (#19283)
6 years ago
lidanqing 07a4d8f8d6 Fix mAP problem in unit test of int8 object detection test (#18946)
6 years ago
Leo Zhao 10eeed93d1 Revert "use static variable to do cache instead of thread local in thread frequent switching case (#18428)" (#18879)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Leo Zhao ce38bb5341 use static variable to do cache instead of thread local in thread frequent switching case (#18428)
6 years ago
Tao Luo fe32879d2a
add mkldnn shapeblob cache clear strategy (#18513)
6 years ago
bingyanghuang 3fe6bf5ee6 fix command line bug in int8v2 readme (#18507)
6 years ago
石晓伟 047bba855b
Remove the obsolete cmake options (#18481)
6 years ago
Tao Luo d234aa02cd
add transfer_scope_cache unit-test (#18467)
6 years ago
Tao Luo 3123d18787
remove unused AnalysisPredictor::SetMkldnnThreadID() (#18444)
6 years ago
lidanqing 5fd68ac154 some fixes for int8 mobilenet_ssd tester (#18112)
6 years ago
翟飞跃 de42fe8fd5 Change int8v2 CAPI unit test name and add log in the prediction stage (#18200)
6 years ago
翟飞跃 78441c5449 add mkldnn Int8v2 slim doc (#17909)
6 years ago
Wojciech Uss ca5642c850 unify FP32 vs. INT8 comparison tests output (#18111)
6 years ago