Wojciech Uss
2579ade45f
Add cpu_quantize_pass for C-API quantization ( #16127 )
...
* Add cpu_quantize_pass for C-API quantization
test=develop
* add cpu_quantize_pass test
* fix lint: add include memory unorderd_map and unordered_set
test=develop
* fuse_relu 1
test=develop
* tuned 2 without squash
* fixes
test=develop
* remove unused vars
test=develop
* refactored
test=develop
* fix lint c-style cast -> C++ style cast
test=develop
* remove QuantMax and c style casts
test=develop
* last usage of QuantMax removed
test=develop
* Fix Analysis Predictor UT
Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop
* change map to unordered_map
fix the forgotten part of cpu_quantize_pass_tester.cc
test=develop
* removed quantized attribute
* fixed cpu_quantize_pass_tester and op attr comments
test=develop
* removed redundant line
test=debug
* removed gmock
test=develop
* fix after merge
6 years ago
Wojciech Uss
b9252f3df8
Add cpu_quantize_squash_pass for C-API quantization ( #16128 )
...
* Add cpu_quantize_squash_pass for C-API quantization
test=develop
* add cpu_quantize_squash_pass teste
* fix lint: add include memory unorderd_map and unordered_set
test=develop
* lint fix 2
* fixes
test=develop
* refactored
test=develop
* fix windows ci
test=develop
6 years ago
tensor-tang
e1c707fe9c
fix warnings ( #15790 )
...
* fix warnings
test=develop
* fix enforce test
test=develop
6 years ago
Yan Chunwei
dc5e25fc7f
remove dot marked node ( #15606 )
6 years ago
Zhaolong Xing
98e85f3735
add_transpose_flatten_concat_fuse ( #15121 )
6 years ago
Zhaolong Xing
4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
...
Add conv+ affine channel fuse pass
6 years ago
nhzlx
a6aa8ea771
faster rcnn input is presistable. (fix it in paddle-trt)
...
test=develop
6 years ago
hjchen2
956cf92145
Fix conv_elementwise_add2_act pass
...
test=develop
6 years ago
nhzlx
ce3782c193
add affine_channel fuse.
...
fix conv+elemenwise fuse bug.
6 years ago
nhzlx
4e4a777243
add conv+elementwiseadd pass
...
test=develop
6 years ago
Yan Chunwei
a985949be9
Fea/fuse conv elementwise add fuse ( #14669 )
6 years ago
Yihua Xu
240d974ac5
Clean Code
...
test=develop
6 years ago
Yihua Xu
64e261c6cd
Implement the fusion of convolution and bias for mkldnn
...
(test=develop)
6 years ago
minqiyang
53433d7f2e
Revert the changes of VLOG
...
test=develop
6 years ago
Tomasz Patejko
7423748e37
MKLDNN residual connections fuse pass:
...
* implements reachability check between identity node and non-identity argument to elementwise_add
* implements handling identity node as x and as y argument to elementwise_add
6 years ago
Tao Luo
6c32945556
Merge pull request #14372 from luotao1/speedup_analysis
...
speedup DetectPatterns
6 years ago
Tao Luo
668ae523d2
speedup DetectPatterns
...
test=develop
6 years ago
Yan Chunwei
9a6e239281
fix mac graph detector sort ( #14356 )
6 years ago
minqiyang
0c3227a523
Change the origin VLOG level to 10 times
...
Fix code to support cpplint syntax check
test=develop
6 years ago
qingqing01
db8c52da5e
Revert " Exhaustive search for cuDNN conv. ( #14043 )"
...
This reverts commit ce7d9b0799
.
7 years ago
qingqing01
ce7d9b0799
Exhaustive search for cuDNN conv. ( #14043 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Clean code
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
7 years ago
Yan Chunwei
f76fee644c
fix graph pattern detector ( #14186 )
7 years ago
Tao Luo
42aa1d409d
Merge pull request #13485 from tpatejko/tpatejko/capi-resnet-conv-elementwise-fusion
...
MKLDNN conv+elementwise_add fusion for residual connections in Resnet
7 years ago
Tomasz Patejko
ce2464fd98
MKLDNN conv + elementwise_add fusion: UT for missing bias added. UTs refactored. Some minor changes in the pass
7 years ago
Tomasz Patejko
4e72ab411e
MKLDNN conv + elementwise_add fusion: fix for crash when bias is not present
7 years ago
Tomasz Patejko
0fe3079c46
MKLDNN conv + elementwise_add fusion: fix for order of parameters in elementwise_add in resnet50
...
test=develop
7 years ago
Tomasz Patejko
8fb29b2ca9
MKLDNN conv + elementwise_add fusion: new nodes marked as input or output
...
test=develop
7 years ago
Tomasz Patejko
9a335e0277
MKLDNN conv + elementwise_add fusion: changed a name of a formal argument in ElementwiseAdd pattern
7 years ago
Tomasz Patejko
efd76614fb
MKLDNN conv + elementwise_add fusion: implementation changed to conform with Paddle API
7 years ago
tensor-tang
40f8456a4f
refine fuse pattern and attr
...
test=develop
7 years ago
tensor-tang
cbbacb2534
Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
...
test=develop
7 years ago
tensor-tang
603ba5e01d
add seqconv eltadd relu pass
7 years ago
Michal Gallus
d7509d63f1
Conv+Bias: Support non-null bias
...
test=develop
7 years ago
Michal Gallus
582f59c190
Conv+Bias fuse
7 years ago
Wojciech Uss
55fd136ab0
Added comment with request for enhancement
...
This adds a `TODO` comment according to https://github.com/PaddlePaddle/Paddle/issues/13550#issuecomment-430133585
test=develop
7 years ago
Tao Luo
9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
7 years ago
Tao Luo
ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
...
[MKLDNN] Pass: Fuse Conv + Bias
7 years ago
Michal Gallus
40b17be4b0
Pass: Fuse Conv + Bias
...
test=develop
7 years ago
Sylwester Fraczek
78f98294c2
conv bn fuse pass
...
review fix
review from hshen14 fix
test=develop
fix error in broadcast and code cleanup
rename bias -> eltwise and added macro to shorten code
formatting
7 years ago
Jacek Czaja
7ab5626dee
- Added initial pass for embedding-fc-lstm
...
- Added draft of new operator
- Added fused embedding fc lstm files
- First time embedding_fc_lstm_fuse_pass was invoked in
test_text_classification
- Added Embedding pattern
- Not crashing
- Enabled draft of embedding_fc_lstm pass (does it job)
- First working (Seqcompute only) version
- Removed diagnostic comment
- First enabling of BatchCompute
- Disabling pass for embedding with is_sparse and is_distributed
- Cosmetics
- Style
- Style
7 years ago
Sylwester Fraczek
a49aa4dac9
make bias unnecessary for ConvRelu fuse
7 years ago
chengduo
d402234ba8
Feature/op_fuse_pass ( #12440 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* add op_fuse pass
* add backward
* code refine
* use TopologySortOperations
* follow comments
* refine IsFusible
* code enhance
* fix op_fusion_pass
* refine code
* refine fuse_elemwise_act_op
* adjust the input and output
* refine logic
* add intermediate_edge
* disable inplace
* follow comments
* refine logic
* follow comments
* Remove the removable IntermediateOut
* change strategy
* code refine
* enable fuse backward
* code refine
* code refine
* rename unit test
* follow comments
7 years ago
Sylwester Fraczek
41de582bb0
create conv relu pass for MKLDNN ( #13258 )
7 years ago
Yan Chunwei
2fd1bf2ea6
fea/add color log ( #13305 )
7 years ago
Yan Chunwei
478a4e850e
refactor ir pattern ( #13304 )
7 years ago
tensor-tang
3ea19b7596
fix bug and fc pass ut
7 years ago
tensor-tang
c9bd2d50f1
refine fc and gru pattern
7 years ago
tensor-tang
0618077971
Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
tensor-tang
74f95b8da0
fix redefine macro
7 years ago
tensor-tang
f057077c3a
add fuse fc gru pass
7 years ago