Commit Graph

1208 Commits (ceda0b9b1a180e507dc9335acbe5215bb4558d1f)

Author SHA1 Message Date
Zhaolong Xing ceda0b9b1a
[Fix BUG]: Core when multi thread + clone + paddle-trt (#22442)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
joanna.wozna.intel 3099d9d47c Restore requantize squash (#22399)
5 years ago
石晓伟 8cb04664b9
revert paddle_fluid.map, test=develop (#22236)
5 years ago
liu zhengxi 07afc29e90
Make api.cc malloc consistent with paddle_api.h for PaddleBuf (#22255)
5 years ago
silingtong123 4f1da4adcb remove the useless third_party library from C++ inference library (#22021)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
Wilber 1230c110cb
[fluid-lite] adjust to relative error (#22232)
5 years ago
Wojciech Uss 2e90c4eb0a improve mkldnn_quantizer_config test code coverage (#22216)
5 years ago
Wilber 5750152e80
support fluid-lite subgraph run resnet test=develop (#22191)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Pei Yang d8a9b134e3
fix trt instance_norm serialize bug. test=develop (#22152)
5 years ago
Yiqun Liu b1401fb74d Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094)
5 years ago
Pei Yang 50bee83f71
add TRT support for instance_norm op (#21928)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
Michał Gallus 6192108408 [DNNL] 3D Fully-Connected (#21746)
5 years ago
zhouwei25 e66f92d1ae Modify demo_ci to support Windows, prepare for PR_Windows_Inference (#21873)
5 years ago
石晓伟 03479469a7
fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841)
5 years ago
lidanqing 9dff56e8e2 change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895)
5 years ago
Michał Gallus 253e664275 Disable memory opt pass when DNNL is on (#21826)
5 years ago
Michał Gallus a5159d8480 Re-anble vgg and resnet101 models download (#21713)
6 years ago
石晓伟 2bb135825e
fix analysis_predictor when func is called multiple times, test=release/1.6 (#21665)
6 years ago
Chen Weihang 1fd1f06f11 Rename paddle throw error macro (#21657)
6 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
6 years ago
Zhaolong Xing fbbd94a6ce
there is bug for inference using auto grwoth allocator (#21621)
6 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
6 years ago
rensilin 7f5d532a9c fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461)
6 years ago
lidanqing fbf9eca0d3 QAT Int8 document (#21360)
6 years ago
Pei Yang 20d61414b4
fix glog warning, test=develop (#21573)
6 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
6 years ago
Zhaolong Xing da7748c53d
add conv, depthwise_conv, pooling (#20966)
6 years ago
GaoWei8 250a192181 Add ernie large c++ inference test (#21365)
6 years ago
Zhaolong Xing b39c011637
specify the auto growth allocator for inference. (#21448)
6 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
6 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
6 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
6 years ago
Zhaolong Xing d1a6e112e6 fix C++ multicard inference bug. (#20955)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
6 years ago
silingtong123 45c1e7bb7b add prediction demo and script on windows (#21248)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
liu zhengxi 3cb6c0a059
Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240)
6 years ago
Pei Yang 2e2f92a5b1
fix trt weight bug (#21231)
6 years ago
zhouwei25 c0dcb090a3 Determine whether to copy and link inference lib by ON_INFER (#20931)
6 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
6 years ago
GaoWei8 a9d4eed3a8 fix cmake fails on inference_download_and_uncompress (#21185)
6 years ago
Adam d74ea0855f Add relative error measure when (value > 1) (#21144)
6 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
6 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
6 years ago
GaoWei8 829bf871d7 Add ernie c++ inference test (#21015)
6 years ago