Commit Graph

1325 Commits (1c898b66d6c668048ab77ee33b2457687b8b36be)

Author SHA1 Message Date
Pei Yang 181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746)
5 years ago
zlsh80826 fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop (#24544)
5 years ago
Zhaolong Xing f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Jacek Czaja 8b88cd5167
[oneDNN] Fix to inplace pass (#24442)
5 years ago
Jacek Czaja d0307145a3
[oneDNN] disabling oneDNN inplace pass (#24406)
5 years ago
Tao Luo 72c370c8d2
remove unused test_multi_thread_helper.h (#24399)
5 years ago
Tao Luo 63da846de0
remove old inference C++ tests (#24368)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Tao Luo c1df7048c7
add UT for mkldnn_cache_capacity (#24336)
5 years ago
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
lidanqing 61ec30f030
Update QAT INT8 2.0 doc (#24127)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
5 years ago
Pei Yang 695a53c874
remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop (#23805)
5 years ago
Zhaolong Xing 35148d17f7
[BUG]: Head number can only be > 1 on multihead op (#23974)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zhaolong Xing 133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
5 years ago
guofei 2b896c1f6b
Support LoDTensorArray in fetch (#23645)
5 years ago
lidanqing 2291634c5c
Add user local data preprocess support (#23692)
5 years ago
chenhaoze b7d185d6ca
OP clip, merge_lod_tensor, convert/elementwise error message enhancement (#23742)
5 years ago
Pei Yang c528f1d4f3
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23672)
5 years ago
Pei Yang 015acdbfb1
Refine error message of leaky_relu, tensorrt_engine, split, prelu op converter (#23661)
5 years ago
joanna.wozna.intel 12ba05ce0c
Add scale-matmul fuse pass (#23734)
5 years ago
Zhaolong Xing 3acb047a20
[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396)
5 years ago
chenhaoze 9b06dd8628
Add three passes and api reference of paddle_pass_builder. test=develop (#23741)
5 years ago
Zhaolong Xing ed5766ffbc
refine act conv2d pool2d trt converter log (#23605)
5 years ago
Pei Yang 28f04c6a5e
refine shuffle channel errmsg, test=develop (#23520)
5 years ago
Tao Luo e4f1b1c5e1
solve mklml memory leak (#23557)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Pei Yang 3d5d217030
Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)", test=develop (#23642)
5 years ago
Pei Yang eb11633611
batch_norm trt converter error message, test=develop (#23620)
5 years ago
joanna.wozna.intel ce08fdcf2b
Add support for INT8 matmul in C-API quantization (#23463)
5 years ago
Pei Yang cdc6d4e292
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)
5 years ago
Pei Yang 42655ef721
Add full_like op. (#23364)
5 years ago
石晓伟 36b82eae0e
refine the doc of paddle_api.h, test=develop (#23402)
5 years ago
Zhaolong Xing 6a23850a3f
add init value to varis in analysis config. (#23442)
5 years ago
Zhaolong Xing 70782e6379
[Inference doc]: refine paddle_api.h doc (#23354)
5 years ago
Pei Yang 7e439780d9
add full paddle_analysis_config.h APIs. (#23215)
5 years ago
Zhaolong Xing 1a6ce8b910
add swish split gelu plugin dynamic support (#23305)
5 years ago
Jacek Czaja 2bb1b0e89e
[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315)
5 years ago
石晓伟 708ded584e
pause the io_utils_test of int64 and resume after repair, test=develop (#23234)
5 years ago
Wilber 0129f4b568
Add some inference API comments for AnalysisPredictor (#23242)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Pei Yang 46b8d282dc
Add some inference API comments for AnalysisConfig (#23117)
5 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
Pei Yang 24db750386
fix trt int8 calib precision bug. test=develop (#23036)
5 years ago
Wilber db40ee86db
fix unittets. test=develop (#23018)
5 years ago
Zhang Ting 137d6563fc
add check for assigned data, test=develop (#22960)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
Michał Gallus 0038bfbd1d
Prevent loading of warmup data in analyzer_int8 if enable_int8 is set to false (#22857)
5 years ago
石晓伟 1861ca88f1
serialize the PaddleTensor, test=develop (#22810)
5 years ago
石晓伟 ddb9b46fec
change the function in op_teller, test=develop (#22794)
5 years ago
liu zhengxi 324f2b3922
Fix inference c api PD_GetZeroCopyOutput lod (#22768)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
liu zhengxi 71ab0458e1
Fix pointer and c-api encapsulation (#22663)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
flame d97475d53b
fix CPU C inference API compile bug (#22702)
5 years ago
flame 74eb82de19
fix go api bug (#22669)
5 years ago
flame f7eafca828
remove python inference warning (#22602)
5 years ago
flame 1d503e6a9e
Golang inference API (#22503)
5 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
5 years ago
Wojciech Uss 4cddb43c5c
Add support for Ernie NLP model to the Slim QAT (#22506)
5 years ago
Zhaolong Xing 54a325a52f
[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3 models for Inference. (#22483)
5 years ago
Zhaolong Xing ceda0b9b1a
[Fix BUG]: Core when multi thread + clone + paddle-trt (#22442)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
joanna.wozna.intel 3099d9d47c Restore requantize squash (#22399)
5 years ago
石晓伟 8cb04664b9
revert paddle_fluid.map, test=develop (#22236)
5 years ago
liu zhengxi 07afc29e90
Make api.cc malloc consistent with paddle_api.h for PaddleBuf (#22255)
5 years ago
silingtong123 4f1da4adcb remove the useless third_party library from C++ inference library (#22021)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
Wilber 1230c110cb
[fluid-lite] adjust to relative error (#22232)
5 years ago
Wojciech Uss 2e90c4eb0a improve mkldnn_quantizer_config test code coverage (#22216)
5 years ago
Wilber 5750152e80
support fluid-lite subgraph run resnet test=develop (#22191)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Pei Yang d8a9b134e3
fix trt instance_norm serialize bug. test=develop (#22152)
5 years ago
Yiqun Liu b1401fb74d Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094)
5 years ago
Pei Yang 50bee83f71
add TRT support for instance_norm op (#21928)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
Michał Gallus 6192108408 [DNNL] 3D Fully-Connected (#21746)
5 years ago
zhouwei25 e66f92d1ae Modify demo_ci to support Windows, prepare for PR_Windows_Inference (#21873)
5 years ago
石晓伟 03479469a7
fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841)
5 years ago
lidanqing 9dff56e8e2 change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895)
5 years ago
Michał Gallus 253e664275 Disable memory opt pass when DNNL is on (#21826)
5 years ago
Michał Gallus a5159d8480 Re-anble vgg and resnet101 models download (#21713)
5 years ago
石晓伟 2bb135825e
fix analysis_predictor when func is called multiple times, test=release/1.6 (#21665)
5 years ago
Chen Weihang 1fd1f06f11 Rename paddle throw error macro (#21657)
5 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
5 years ago
Zhaolong Xing fbbd94a6ce
there is bug for inference using auto grwoth allocator (#21621)
5 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
5 years ago
rensilin 7f5d532a9c fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461)
5 years ago
lidanqing fbf9eca0d3 QAT Int8 document (#21360)
5 years ago
Pei Yang 20d61414b4
fix glog warning, test=develop (#21573)
5 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
5 years ago
Zhaolong Xing da7748c53d
add conv, depthwise_conv, pooling (#20966)
5 years ago
GaoWei8 250a192181 Add ernie large c++ inference test (#21365)
5 years ago
Zhaolong Xing b39c011637
specify the auto growth allocator for inference. (#21448)
5 years ago