Commit Graph

94 Commits (4f066e316ef9deecaf17e6d61fdbf34e303f127e)

Author SHA1 Message Date
Adam Osewski 4f066e316e
Layer normalization fuse pass. (#30721)
4 years ago
alncat 7bbf3ac5ab
Added support for inference using quantization aware trained dygraph (#30288)
4 years ago
cc 6a0102b038
map matmul/squeeze2+matmul/reshape2+matmul to mul (#29911)
5 years ago
jakpiase edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772)
5 years ago
Wojciech Uss 4fd4095d1b
Add quantization of multi_gru op and tests (#28615)
5 years ago
joanna.wozna.intel fddea67445
Fix cpu_bfloat16_pass (#28730)
5 years ago
Wojciech Uss 7b5a8e46de
Add multi_gru_fuse_pass and tests (#28601)
5 years ago
Wojciech Uss 991345b368
Add multi_gru_seq_fuse_pass and tests (#28604)
5 years ago
Zhang Ting fdc06f2158
add Fuse bn add act pass (#28196)
5 years ago
Adam Osewski 7db747d9e8
oneDNN BatchNorm + Act fusion pass. (#27912)
5 years ago
Wojciech Uss 966447e338
Added support for quantization of fusion_gru (#27518)
5 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
5 years ago
joanna.wozna.intel 1483ea2304
Add bfloat16 passes (#26999)
5 years ago
joanna.wozna.intel eb097d64f6
Fix int8 performace drop cpu_quantize_placement_pass (#26715)
5 years ago
Zhaolong Xing 7b7e605189
[Fix BUGs]: fix multhead matmul pass's instable bug (#25123)
5 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
joanna.wozna.intel 356f5ee220
[Refactoring] Unify op-dequant squashes (#24277)
5 years ago
joanna.wozna.intel b43b46e619
[INT8] Add requant-op squash (#24143)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
5 years ago
Jacek Czaja c6c65c65c7
[DNNL] Added elementwise_add mkl-dnn inplace (#23477)
5 years ago
joanna.wozna.intel 12ba05ce0c
Add scale-matmul fuse pass (#23734)
5 years ago
joanna.wozna.intel 5ee099ca57
Op-requant squash (#23665)
5 years ago
joanna.wozna.intel 3cb5623dad
Add matmul dequant squash (#23505)
5 years ago
joanna.wozna.intel ce08fdcf2b
Add support for INT8 matmul in C-API quantization (#23463)
5 years ago
Jacek Czaja 2bb1b0e89e
[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315)
5 years ago
joanna.wozna.intel 17f2c0899f
Add dequant-scale squash (#22409)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
Zhen Wang e40cfb1010
fix the bug of assert_is_op_output. test=develop (#22262)
5 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
5 years ago
joanna.wozna.intel 5b2e98aa17 Add multiple quantize operators fuse (#22062)
5 years ago
Pei Yang 3e5008ad01
fix trt calib not working bug, test=develop (#21934)
6 years ago
lidanqing d3a96632fa Add fc-dequantize squash in cpu_quantize_squash_pass for ernie model (#21714)
6 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
6 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
6 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
6 years ago
joanna.wozna.intel 2e3ec66be0 Add conv dequant squash for int8 (#18905)
6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
joanna.wozna.intel 492a00f53e Add conv reqantize squash (#18754)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
6 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago
Sylwester Fraczek 5b2a3c4b12 Conv concat relu quantization (#17466)
6 years ago