Commit Graph

291 Commits (172d4ecb6c8fa8eceb0bf64cbf97e07e684a28e8)

Author SHA1 Message Date
cc 5c8e79956e
Use the specificed scope in post quant, test=develop (#25384)
6 years ago
cc 22720a1535
Fix post quant save bug, test=develop (#25370)
6 years ago
Wojciech Uss d0a921ba98
Quant2 updates and fixes (#25313)
6 years ago
cc d8f4714bc1
[Quantization] Save output threshold by argname_index (#25272)
6 years ago
Wojciech Uss 23a4f54b73
rename qat into quant (#24948)
6 years ago
Wojciech Uss 56fa3880e3
rename qat into quant in filenames only (#25194)
6 years ago
iducn f282599229
disable unitest for gcc8(#25134)
6 years ago
cc 8fc31d501b
Support conv2d_traspose quantize, test=develop (#25084)
6 years ago
Liufang Sang b174b99764
support user defined quantization func and preprocess (#24720)
6 years ago
cc 75eec3d1f6
Post training quantization supports optimize model by fusing (#24822)
6 years ago
Wojciech Uss 78d4f0cc91
add option to exclude ops by id from quantization (#24689)
6 years ago
cc dbcd7c69e9
Update sigmoid output from Y to out, test=develop (#24765)
6 years ago
cc 88e9d74a75
Collecting concat output threshold, test=develop (#24742)
6 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
6 years ago
cc 6c89ca2157
Add output threshold for ops that have several output activations, test=develop (#24726)
6 years ago
lidanqing 8ef3c02e90
Update DNNL QAT document 2.0-alpha (#24494)
6 years ago
cc 4d35112255
[Fix bug] Init scale node in OutScaleForTrainingPass and enable test_quantization_scale_pass UT (#24393)
6 years ago
joanna.wozna.intel 53125c2f6f
Model converter to dot file (#23169)
6 years ago
Wojciech Uss db052009c7
Enabled quantize all and skip missing in QAT (#24281)
6 years ago
Leo Chen 381492fca3
add try finally, test=develop (#24243)
6 years ago
lidanqing 61ec30f030
Update QAT INT8 2.0 doc (#24127)
6 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
6 years ago
ShenLiang 0fb9b208ab
Add batch_fc op in contrib (#24017)
6 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
6 years ago
Wojciech Uss 3d744162dd
QAT: support for new models (#23928)
6 years ago
zhangchunle 6bd200db66
remove high level api (#23854)
6 years ago
ShenLiang 30bd7e1c83
Add rank_attention_op attributes for GPU memory in contrib (#23915)
6 years ago
cc 40aa14ec77
Weight quantization support channel_wise_abs_max method to achieve higher accuracy (#23629)
6 years ago
mapingshuo f0e743f136
fix AMP and recompute (#23551)
6 years ago
joanna.wozna.intel 12ba05ce0c
Add scale-matmul fuse pass (#23734)
6 years ago
Wojciech Uss 2383a9f7ee
[Doc update] Update for QAT INT8 MKL-DNN document (#23361)
6 years ago
Chengmo 8c0bdde934
Add Tdm sampler op in Contrib (#23290)
6 years ago
Wojciech Uss 1753860dd0
Enable matmul and cleanup in QAT2 (#23657)
6 years ago
silingtong123 cec234b1aa
test=develop, error message of tree_conv OP enhancement (#23574)
6 years ago
cc 25628587f1
Collect output scale for quantized op and fused op (#23369)
6 years ago
Bai Yifan 9bc223c8a2
fix test_graph_wrapper failure on cudnnv7, test=develop (#23451)
6 years ago
ShenLiang c706ff20a3
fix conflict, test=develop (#23298)
6 years ago
Chengmo a2e9af5663
Add Tdm child OP in contrib (#23241)
6 years ago
cc 3ea7c59f76
Set fuse_all_reduce_ops=false for quantization test, test=develop (#23413)
6 years ago
cc 7c55a94de5
Disable test_quantization_scale_pass unittest for random error, test=develop (#23441)
6 years ago
Yiqun Liu bc2981e998
Disable test_code_generator and test_post_training_quantization_mobilenetv1 (#23440)
6 years ago
Wojciech Uss 9fd9067455
handle conv2d activations in older QAT models (#23202)
6 years ago
Wojciech Uss be2ac9cc3a
separated QAT1 and QAT2 (#23284)
6 years ago
lidanqing c524b930e7
Update QAT INT8 related code (#23104)
6 years ago
Wojciech Uss f836c8aa8f
add check for scales and a message (#23119)
6 years ago
cc bd80903333
Add activation_type in AddQuantDequantPass to be compatible with paddleslim, test=develop (#23221)
6 years ago
cc 589cd8782f
Post_training_quantizaion supports min_max methon (#23078)
6 years ago
lidanqing 432a4b2789
Changes QAT MKL-DNN documents (#22840)
6 years ago
cc b6717faf80
Added an option to use external FP32 model in QAT comparison test (#22858)
6 years ago
Sylwester Fraczek 5ff2439f51
fixed save qat2 model resnet50 and ernie (#22822)
6 years ago