Commit Graph

62 Commits (b0edda4d99fdfa388dc6bb348c35d58ee4f819bc)

Author SHA1 Message Date
Wilber 39546aa2f3
Add pass compatible and unit test. (#27377)
5 years ago
joanna.wozna.intel 1483ea2304
Add bfloat16 passes (#26999)
5 years ago
joanna.wozna.intel 0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " (#26882)
5 years ago
石晓伟 ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
5 years ago
Adam 8bcb1f29d9
Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it (#26779)
5 years ago
joanna.wozna.intel 02083bda40
Add mkldnn bfloat16 option to C-API (#26676)
5 years ago
Jacek Czaja 8b88cd5167
[oneDNN] Fix to inplace pass (#24442)
6 years ago
Jacek Czaja d0307145a3
[oneDNN] disabling oneDNN inplace pass (#24406)
6 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
6 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
6 years ago
joanna.wozna.intel 12ba05ce0c
Add scale-matmul fuse pass (#23734)
6 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
6 years ago
Jacek Czaja 2bb1b0e89e
[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315)
6 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
6 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
6 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
6 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
6 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
6 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
6 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
6 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
6 years ago
zhaoyuchen2018 b8333edef6
Add Multihead matmul fuse pass (#20167)
6 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
Yiqun Liu fcec365d29
Add a pass to replace dropout_op with scale_op when is_test is true (#19297)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
7 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
7 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
7 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
7 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
7 years ago
Sylwester Fraczek 5b2a3c4b12 Conv concat relu quantization (#17466)
7 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
7 years ago
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
7 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
7 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
7 years ago
Tao Luo aa7b975bf6 disable runtime_context_cache pass by default
7 years ago
tensor-tang d6c1b5a73b disable seqpool concat pass by default saving CI time
7 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
7 years ago
luotao1 bd636a9ea6 test_analyzer_int8 tests use default pass order
7 years ago
Yan Chunwei 044ae2497d
fix identity temporarily (#15942)
7 years ago
Wojciech Uss ec2750b3c2 fix repeating passes (#16606)
7 years ago
nhzlx d065b5bf2b Anakin ssd support
7 years ago
Wojciech Uss 09dfc7a2aa
C-API quantization core 2 (#16396)
7 years ago
nhzlx 953bdde058 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
7 years ago