Commit Graph

1180 Commits (4f81d1bd5f9be4a564cfc8bd7e2a2bc3b21c24d1)

Author SHA1 Message Date
Pei Yang 20d61414b4
fix glog warning, test=develop (#21573)
6 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
6 years ago
Zhaolong Xing da7748c53d
add conv, depthwise_conv, pooling (#20966)
6 years ago
GaoWei8 250a192181 Add ernie large c++ inference test (#21365)
6 years ago
Zhaolong Xing b39c011637
specify the auto growth allocator for inference. (#21448)
6 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
6 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
6 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
6 years ago
Zhaolong Xing d1a6e112e6 fix C++ multicard inference bug. (#20955)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
6 years ago
silingtong123 45c1e7bb7b add prediction demo and script on windows (#21248)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
liu zhengxi 3cb6c0a059
Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240)
6 years ago
Pei Yang 2e2f92a5b1
fix trt weight bug (#21231)
6 years ago
zhouwei25 c0dcb090a3 Determine whether to copy and link inference lib by ON_INFER (#20931)
6 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
6 years ago
GaoWei8 a9d4eed3a8 fix cmake fails on inference_download_and_uncompress (#21185)
6 years ago
Adam d74ea0855f Add relative error measure when (value > 1) (#21144)
6 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
6 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
6 years ago
GaoWei8 829bf871d7 Add ernie c++ inference test (#21015)
6 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
6 years ago
石晓伟 e742760f8e
optimize version error, test=develop (#20715)
6 years ago
bingyanghuang fd49ebcbd8 update int8 benchmark with 6271 data, test=develop test=document_fix (#20736)
6 years ago
石晓伟 d8f4f4239d
Ensure backward compatibility with the anakin interface, test=develop (#20691)
6 years ago
liu zhengxi d39777fefa
alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697)
6 years ago
liu zhengxi dbc2bb3376
improve the performance of capi in PD_PredictorRun (#20665)
6 years ago
lidanqing 57b656f956 Add document for int8 object detection quantization (#19356)
6 years ago
liu zhengxi 922d432477
fix the PD_ZeroCopyPredictorRun output problem (#20612)
6 years ago
bingyanghuang 85e1f2150b Modify the helper information in full_pascalvoc_test_preprocess.py (#20475)
6 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
6 years ago
zhaoyuchen2018 b8333edef6
Add Multihead matmul fuse pass (#20167)
6 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
6 years ago
liu zhengxi 53d8799bee
remove incorrect new in c style, test=develop (#20370)
6 years ago
石晓伟 2c28e3283a
fix analysis_predictor ci, test=release/1.6 (#20141)
6 years ago
liu zhengxi acb02fd69e
add dll to inference capi (#20180)
6 years ago
liu zhengxi 301eeb5bea
Add capi for fluid inference api (#20092)
6 years ago
Wilber 276b5e3440
fix compile paddle with anakin bug
6 years ago
石晓伟 01b9d07963
update operator compatible info, test=develop (#19978)
6 years ago
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
6 years ago
Aurelius84 99a9615a4b
Removing length dims constraints of seq_pad and seq_unpad (#19497)
6 years ago
pawelpiotrowicz 2c5c636514 Add two extra flags for test_analyzer_int8_image_classification to disable fp32/int8 (#19840)
6 years ago
Pei Yang baccd7e2ca
Add TRT input shape check between model and runtime (#19864)
6 years ago
Pei Yang 74812d1c90
Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825)
6 years ago
石晓伟 d004a0f50e
fix multi-thread exec of trt, test=develop (#19338)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
石晓伟 71b2ed61bc
support MLU nums, test=develop (#19372)
6 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
6 years ago
Zhaolong Xing 110be57c1b
fix memory optimization type (#19781)
6 years ago