Commit Graph

1404 Commits (e2d01eb650dba6267046c1cfd6e64cf8cfd74267)

Author SHA1 Message Date
chenhaoze b7d185d6ca
OP clip, merge_lod_tensor, convert/elementwise error message enhancement (#23742)
5 years ago
Pei Yang c528f1d4f3
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23672)
5 years ago
Pei Yang 015acdbfb1
Refine error message of leaky_relu, tensorrt_engine, split, prelu op converter (#23661)
5 years ago
joanna.wozna.intel 12ba05ce0c
Add scale-matmul fuse pass (#23734)
5 years ago
Zhaolong Xing 3acb047a20
[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396)
5 years ago
chenhaoze 9b06dd8628
Add three passes and api reference of paddle_pass_builder. test=develop (#23741)
5 years ago
Zhaolong Xing ed5766ffbc
refine act conv2d pool2d trt converter log (#23605)
5 years ago
Pei Yang 28f04c6a5e
refine shuffle channel errmsg, test=develop (#23520)
5 years ago
Tao Luo e4f1b1c5e1
solve mklml memory leak (#23557)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Pei Yang 3d5d217030
Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)", test=develop (#23642)
5 years ago
Pei Yang eb11633611
batch_norm trt converter error message, test=develop (#23620)
5 years ago
joanna.wozna.intel ce08fdcf2b
Add support for INT8 matmul in C-API quantization (#23463)
5 years ago
Pei Yang cdc6d4e292
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)
5 years ago
Pei Yang 42655ef721
Add full_like op. (#23364)
5 years ago
石晓伟 36b82eae0e
refine the doc of paddle_api.h, test=develop (#23402)
5 years ago
Zhaolong Xing 6a23850a3f
add init value to varis in analysis config. (#23442)
5 years ago
Zhaolong Xing 70782e6379
[Inference doc]: refine paddle_api.h doc (#23354)
5 years ago
Pei Yang 7e439780d9
add full paddle_analysis_config.h APIs. (#23215)
5 years ago
Zhaolong Xing 1a6ce8b910
add swish split gelu plugin dynamic support (#23305)
5 years ago
Jacek Czaja 2bb1b0e89e
[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315)
5 years ago
石晓伟 708ded584e
pause the io_utils_test of int64 and resume after repair, test=develop (#23234)
5 years ago
Wilber 0129f4b568
Add some inference API comments for AnalysisPredictor (#23242)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Pei Yang 46b8d282dc
Add some inference API comments for AnalysisConfig (#23117)
5 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
Pei Yang 24db750386
fix trt int8 calib precision bug. test=develop (#23036)
5 years ago
Wilber db40ee86db
fix unittets. test=develop (#23018)
5 years ago
Zhang Ting 137d6563fc
add check for assigned data, test=develop (#22960)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
Michał Gallus 0038bfbd1d
Prevent loading of warmup data in analyzer_int8 if enable_int8 is set to false (#22857)
5 years ago
石晓伟 1861ca88f1
serialize the PaddleTensor, test=develop (#22810)
5 years ago
石晓伟 ddb9b46fec
change the function in op_teller, test=develop (#22794)
5 years ago
liu zhengxi 324f2b3922
Fix inference c api PD_GetZeroCopyOutput lod (#22768)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
liu zhengxi 71ab0458e1
Fix pointer and c-api encapsulation (#22663)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
flame d97475d53b
fix CPU C inference API compile bug (#22702)
5 years ago
flame 74eb82de19
fix go api bug (#22669)
5 years ago
flame f7eafca828
remove python inference warning (#22602)
5 years ago
flame 1d503e6a9e
Golang inference API (#22503)
5 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
5 years ago
Wojciech Uss 4cddb43c5c
Add support for Ernie NLP model to the Slim QAT (#22506)
5 years ago
Zhaolong Xing 54a325a52f
[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3 models for Inference. (#22483)
5 years ago
Zhaolong Xing ceda0b9b1a
[Fix BUG]: Core when multi thread + clone + paddle-trt (#22442)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
joanna.wozna.intel 3099d9d47c Restore requantize squash (#22399)
5 years ago
石晓伟 8cb04664b9
revert paddle_fluid.map, test=develop (#22236)
5 years ago
liu zhengxi 07afc29e90
Make api.cc malloc consistent with paddle_api.h for PaddleBuf (#22255)
5 years ago
silingtong123 4f1da4adcb remove the useless third_party library from C++ inference library (#22021)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
Wilber 1230c110cb
[fluid-lite] adjust to relative error (#22232)
5 years ago
Wojciech Uss 2e90c4eb0a improve mkldnn_quantizer_config test code coverage (#22216)
5 years ago
Wilber 5750152e80
support fluid-lite subgraph run resnet test=develop (#22191)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Pei Yang d8a9b134e3
fix trt instance_norm serialize bug. test=develop (#22152)
5 years ago
Yiqun Liu b1401fb74d Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094)
5 years ago
Pei Yang 50bee83f71
add TRT support for instance_norm op (#21928)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
Michał Gallus 6192108408 [DNNL] 3D Fully-Connected (#21746)
5 years ago
zhouwei25 e66f92d1ae Modify demo_ci to support Windows, prepare for PR_Windows_Inference (#21873)
5 years ago
石晓伟 03479469a7
fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841)
5 years ago
lidanqing 9dff56e8e2 change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895)
5 years ago
Michał Gallus 253e664275 Disable memory opt pass when DNNL is on (#21826)
5 years ago
Michał Gallus a5159d8480 Re-anble vgg and resnet101 models download (#21713)
6 years ago
石晓伟 2bb135825e
fix analysis_predictor when func is called multiple times, test=release/1.6 (#21665)
6 years ago
Chen Weihang 1fd1f06f11 Rename paddle throw error macro (#21657)
6 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
6 years ago
Zhaolong Xing fbbd94a6ce
there is bug for inference using auto grwoth allocator (#21621)
6 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
6 years ago
rensilin 7f5d532a9c fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461)
6 years ago
lidanqing fbf9eca0d3 QAT Int8 document (#21360)
6 years ago
Pei Yang 20d61414b4
fix glog warning, test=develop (#21573)
6 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
6 years ago
Zhaolong Xing da7748c53d
add conv, depthwise_conv, pooling (#20966)
6 years ago
GaoWei8 250a192181 Add ernie large c++ inference test (#21365)
6 years ago
Zhaolong Xing b39c011637
specify the auto growth allocator for inference. (#21448)
6 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
6 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
6 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
6 years ago
Zhaolong Xing d1a6e112e6 fix C++ multicard inference bug. (#20955)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
6 years ago
silingtong123 45c1e7bb7b add prediction demo and script on windows (#21248)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
liu zhengxi 3cb6c0a059
Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240)
6 years ago
Pei Yang 2e2f92a5b1
fix trt weight bug (#21231)
6 years ago
zhouwei25 c0dcb090a3 Determine whether to copy and link inference lib by ON_INFER (#20931)
6 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
6 years ago
GaoWei8 a9d4eed3a8 fix cmake fails on inference_download_and_uncompress (#21185)
6 years ago
Adam d74ea0855f Add relative error measure when (value > 1) (#21144)
6 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
6 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
6 years ago
GaoWei8 829bf871d7 Add ernie c++ inference test (#21015)
6 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
6 years ago
石晓伟 e742760f8e
optimize version error, test=develop (#20715)
6 years ago
bingyanghuang fd49ebcbd8 update int8 benchmark with 6271 data, test=develop test=document_fix (#20736)
6 years ago
石晓伟 d8f4f4239d
Ensure backward compatibility with the anakin interface, test=develop (#20691)
6 years ago
liu zhengxi d39777fefa
alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697)
6 years ago
liu zhengxi dbc2bb3376
improve the performance of capi in PD_PredictorRun (#20665)
6 years ago
lidanqing 57b656f956 Add document for int8 object detection quantization (#19356)
6 years ago
liu zhengxi 922d432477
fix the PD_ZeroCopyPredictorRun output problem (#20612)
6 years ago
bingyanghuang 85e1f2150b Modify the helper information in full_pascalvoc_test_preprocess.py (#20475)
6 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
6 years ago
zhaoyuchen2018 b8333edef6
Add Multihead matmul fuse pass (#20167)
6 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
6 years ago
liu zhengxi 53d8799bee
remove incorrect new in c style, test=develop (#20370)
6 years ago
石晓伟 2c28e3283a
fix analysis_predictor ci, test=release/1.6 (#20141)
6 years ago
liu zhengxi acb02fd69e
add dll to inference capi (#20180)
6 years ago
liu zhengxi 301eeb5bea
Add capi for fluid inference api (#20092)
6 years ago
Wilber 276b5e3440
fix compile paddle with anakin bug
6 years ago
石晓伟 01b9d07963
update operator compatible info, test=develop (#19978)
6 years ago
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
6 years ago
Aurelius84 99a9615a4b
Removing length dims constraints of seq_pad and seq_unpad (#19497)
6 years ago
pawelpiotrowicz 2c5c636514 Add two extra flags for test_analyzer_int8_image_classification to disable fp32/int8 (#19840)
6 years ago
Pei Yang baccd7e2ca
Add TRT input shape check between model and runtime (#19864)
6 years ago
Pei Yang 74812d1c90
Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825)
6 years ago
石晓伟 d004a0f50e
fix multi-thread exec of trt, test=develop (#19338)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
石晓伟 71b2ed61bc
support MLU nums, test=develop (#19372)
6 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
6 years ago
Zhaolong Xing 110be57c1b
fix memory optimization type (#19781)
6 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
Tao Luo 3ae939e48a
unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631)
6 years ago
baojun a3a4b6e570 Enable ngraph through build_strategy (#19266)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
liuwei1031 d6cb1a4122
add dynamic C runtime support on windows, test=develop (#19502)
6 years ago
Yiqun Liu fcec365d29
Add a pass to replace dropout_op with scale_op when is_test is true (#19297)
6 years ago
lidanqing 9240e5325c add local user data conversion into full_pascalvoc_test_preprocess.py (#19283)
6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
lidanqing 07a4d8f8d6 Fix mAP problem in unit test of int8 object detection test (#18946)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
wopeizl 80b7ef6fc8
add tensorrt support for windows (#19084)
6 years ago
Tao Luo 741ce8bb1a
inference_shared_library support profile (#16275)
6 years ago
mapingshuo 4ad7c9d5a7 [WIP] Add Imdb train demo (#18895)
6 years ago
silingtong123 fd3b666d8c test=develop,Synchronize the contents of develop with release1.5 (#18937)
6 years ago
Zhaolong Xing 3816d221ff
Fix the CE error which caused by paddle-trt version (#18941)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
liuwei1031 0d99690809
fix several security bugs reported by security team (#18831)
6 years ago
Zhaolong Xing 61238d31f7
Trt fp16 support (#18860)
6 years ago
Leo Zhao 10eeed93d1 Revert "use static variable to do cache instead of thread local in thread frequent switching case (#18428)" (#18879)
6 years ago
Huihuang Zheng cfce4994cf
Merge cuda 9/10 dockerfile with root dockerfile (#18693)
6 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
6 years ago
guru4elephant d714bf037c
remove async executor and add data_feed.proto to the deps of train demo (#18659)
6 years ago
石晓伟 25d8079140
Fix Bitmain Predictor::Clone() (#18599)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Jiabin Yang 667f88f9a6
Fix/gcc 4.8 ubt link error (#18558)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
石晓伟 1529154821
Support Bitmain Anakin (#18542)
6 years ago
Leo Zhao ce38bb5341 use static variable to do cache instead of thread local in thread frequent switching case (#18428)
6 years ago
Tao Luo fe32879d2a
add mkldnn shapeblob cache clear strategy (#18513)
6 years ago
bingyanghuang 3fe6bf5ee6 fix command line bug in int8v2 readme (#18507)
6 years ago
石晓伟 047bba855b
Remove the obsolete cmake options (#18481)
6 years ago
Tao Luo d234aa02cd
add transfer_scope_cache unit-test (#18467)
6 years ago
Tao Luo 3123d18787
remove unused AnalysisPredictor::SetMkldnnThreadID() (#18444)
6 years ago
Michał Gallus 7023a86c3a Fix Pooling output scale (#18186)
6 years ago
Michał Gallus 8409693272 Reset DeviceContext after quantization warmup (#18182)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago
lidanqing 5fd68ac154 some fixes for int8 mobilenet_ssd tester (#18112)
6 years ago
wopeizl daa32d5383
fix package generation for inference test=develop (#18220)
6 years ago
翟飞跃 de42fe8fd5 Change int8v2 CAPI unit test name and add log in the prediction stage (#18200)
6 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
6 years ago
翟飞跃 78441c5449 add mkldnn Int8v2 slim doc (#17909)
6 years ago
Wojciech Uss ca5642c850 unify FP32 vs. INT8 comparison tests output (#18111)
6 years ago
Wojciech Uss c26130f3a9 reuse C-API INT8 unit test application (#18077)
6 years ago
lidanqing 466254151a add Mobilienet ssd int8 analyzer tester (#18075)
6 years ago
石晓伟 42f12a4aca
fix ci test cmake test=develop (#18060)
6 years ago
Michał Gallus 8462e2b805 Disable MKLDNN FC in Resnet50 test (#18030)
6 years ago
石晓伟 04ea7cb069
modify the access level of anakin engine (#18015)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
Zhaolong Xing 4e8d5a034f
Light mem reuse strategy for inference. (#17925)
6 years ago
mozga-intel c1379bf238 [NGraph] Bert model for a capi, ngraph's support test=develop (#17844)
6 years ago
石晓伟 d008260fa8
update the initialization of anakin subgraph (#17880)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
Tao Luo e089e454a1
make omp thread num default 1 after inference run (#17801)
6 years ago
Tao Luo b4b169467b
add fc_mkldnn_pass in compare_mkldnn (#17712)
6 years ago
Zhaolong Xing 4337009b92 fix trt ci timeout error (#17701)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
lidanqing 04b6c29ee0 Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570)
6 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
6 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
6 years ago
Zhen Wang 8bd651b7ed
Fix the bug in the AnalysisPredictor and add more directions about io APIs. (#17639)
6 years ago
Zeng Jinle 4aa931dd85
Code clean of Allocator (#17602)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago
Sylwester Fraczek 5b2a3c4b12 Conv concat relu quantization (#17466)
6 years ago
Sylwester Fraczek bccb0ba49a fix quantize_squash_pass segfault when no tensor linked to Bias (#17292)
6 years ago
Zhaolong Xing 38da103034 fix trt ci bug temporary. (#17565)
6 years ago
lijianshe02 daf88968e2
fix bug that saved optimal model path in test_analyzer_save_model con… (#17555)
6 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
6 years ago
Tao Luo 3d19f44a89
remove unused SERIAL compiler option (#17500)
6 years ago
lidanqing 36757ed203 Enabling resnet101, vgg16, vgg19 INT8v2 model tests (#17468)
6 years ago
liuwei1031 ba70cc499e
fix security bugs : (#17464)
6 years ago