Sylwester Fraczek
5b2a3c4b12
Conv concat relu quantization ( #17466 )
...
* add conv_concat_relu fuse
test=develop
* add test code
test=develop
* added missing include with unordered_map
test=develop
* review fixes for wojtuss
test=develop
* remove 'should (not) be fused' comment statements
one of them was invalid anyway
test=develop
6 years ago
Sylwester Fraczek
bccb0ba49a
fix quantize_squash_pass segfault when no tensor linked to Bias ( #17292 )
...
* fix quantize_squash_pass segfault when there is no tensor linked do Bias input
test=develop
* add googlenet test
test=develop
* fix concat CreateKey not using input format
test=develop
6 years ago
Qiao Longfei
58f7695ab2
Async exe support communicator ( #17386 )
...
Async exe support communicator
6 years ago
guomingz
2281ebf0f3
Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. ( #17130 )
...
* Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.
Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
Batch size | with fusion | without fusion
-- | -- | --
1 | 214.7 | 53.4
50 | 1219.727 | 137.280
test=develop
* Fix the format issue
test=develop
* Add the missing nolint comments.
test=develop
* Fix the typos.
test=develop
* Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.
test=develop
* Adjust the indentation.
test=develop
* Add the test_conv_brelu_mkldnn_fuse_pass case.
test=develop
* Slightly update the code per Baidu comments.
Let the parameter definition embedded into the code.
That's will make the code easy to understand.
test=develop
6 years ago
liuwei1031
c3949f5699
remove two useless flags: enable_subgraph_optimize, memory_optimize_debug, test=develop ( #17491 )
6 years ago
Tao Luo
32da5e9c3d
remove unused expected_kernel_cache_pass ( #17486 )
...
test=develop
6 years ago
Zeng Jinle
712bfb17cb
fix recurrent_op,test=develop ( #17433 )
6 years ago
Zhen Wang
4a1b7fec96
Add setting Scope function for the graph class ( #17417 )
...
* add set_not_owned function for graph
* add scope set. test=develop
* add scope_ptr enforce not null before setting.test=develop
6 years ago
chengduo
04bd413acb
Code Clean: Move all pass to paddle::framework::ir ( #17228 )
...
* move pass to ir
* polish code
test=develop
* fix dependency
test=develop
6 years ago
Zeng Jinle
4f8594088d
Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace ( #17225 )
...
* add use_cuda to inplace pass,test=develop
* add test softmax_with_xe_inplace test,test=develop
* fix potential inplace bug
test=develop
* add more skip vars in mem opt pass,test=develop
* follow comment,test=develop
* follow comments,move duplicate out arg check to program->graph,test=develop
6 years ago
石晓伟
a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 ( #17156 )
...
* cherry-pick commit from 8877054
* cherry-pick commit from 3f0b97d
* cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn
(cherry picked from commit 8643dbc233
)
* Cherry-Pick from 16662 : Anakin subgraph cpu support
(cherry picked from commit 7ad182e16c
)
* Cherry-pick from 1662, 16797.. : add anakin int8 support
(cherry picked from commit e14ab180fe
)
* Cherry-pick from 16813 : change singleton to graph RegistBlock
test=release/1.4
(cherry picked from commit 4b9fa42307
)
* Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2
Support ShuffleNet and MobileNet-v2, test=release/1.4
(cherry picked from commit a6fb066f90
)
* Cherry-pick : anakin subgraph add opt config layout argument #16846
test=release/1.4
(cherry picked from commit 8121b3eccb
)
* 1. add shuffle_channel_detect
(cherry picked from commit 6efdea8997
)
* update shuffle_channel op convert, test=release/1.4
(cherry picked from commit e4726a066f
)
* Modify symbol export rules
test=develop
6 years ago
tensor-tang
79ed1c76cd
fix bn fuse vardesc and add model saver ( #17143 )
...
* fix bn fuse vardesc and add model saver
test=develop
* unify save model in test helper
test=develop
* fix mkdir on windows
test=develop
* remove magic number use bn bias var desc
test=develop
6 years ago
Zeng Jinle
4e1bc6e805
Rewrite inplace pass and fix gc bug ( #17126 )
...
* fix op graph view
test=develop
* rewrite inplace pass and fix reference count pass bug
test=develop
* fix unittest failed
test=develop
* follow comments, test=develop
6 years ago
Yihua Xu
93cedfdb9c
Fix the order while sorting the operators ( #16756 )
...
* Fix the order when sorting operators.
test=develop
* Enable transfomer compare test item.
test=develop
* Use set to replace vector.
test=develop
6 years ago
Yiqun Liu
112f16143b
Add an option to enable the cache of expected kernel in train phase. ( #16724 )
...
* Add an option to enable the cache of expected kernel in train phase.
test=develop
* Change the default value of cache_expected_kernel to true.
6 years ago
Tao Luo
ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
...
Cache the chosen kernel of operators
6 years ago
Yiqun Liu
3fe8cb0dd7
Enable the runtime_context_cache pass in train phase ( #16640 )
...
* Try to enable the runtime_context_cache pass in train phase.
* Put the append of runtime_context_cache pass ahead of multi_dev passes.
test=develop
6 years ago
luotao1
695f2db6a0
update expected_kernel_cache_pass
...
test=develop
6 years ago
luotao1
226596a296
Merge branch 'develop' into core_opt_choose_kernel
6 years ago
gongweibao
423bc515da
fix batch merge bug ( #16601 )
6 years ago
Qiao Longfei
baf02328b2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
Qiao Longfei
d8974e6da0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
nhzlx
d065b5bf2b
Anakin ssd support
...
refine trt first run
add quant dequant fuse pass
omit simplify_anakin_priorbox_detection template
omit transpose_flatten_concat_fuse template
test=develop
6 years ago
chengduo
ed61d67c73
Fix the interface of Pass::Apply ( #16484 )
...
* modify the interface of Pass::Allay
test=develop
* Polish code
test=develop
* Fix Travis CI
test=develop
* fix Pass::Apply interface
test=develop
* Fix Travis CI
test=develop
6 years ago
Qiao Longfei
392e97aae5
fix cpplint test=develop
6 years ago
Qiao Longfei
30618409db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
nhzlx
953bdde058
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
...
test=develop
6 years ago
Wojciech Uss
46677fb080
Move cpu_quantize_* passes into mkldnn subfolder
...
test=develop
6 years ago
nhzlx
3df7b98a0f
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
luotao1
056599a738
add expected_kernel_cache_pass
...
test=develop
6 years ago
Wojciech Uss
cbe2dbf0db
Add enabling quantization ( #16326 )
...
* Add enabling quantization
test=develop
* remove unused (here) function
6 years ago
Tao Luo
9a05859179
Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass
...
fix pattern maching conv2d with(out) ResidualData
6 years ago
nhzlx
c407dfa3cb
cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
6 years ago
nhzlx
a25331bc26
cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx
69d37f81d7
cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
...
support change input size
6 years ago
nhzlx
a1d200a5de
cherry-pick from feature/anakin-engine: Anakin support facebox #16111
6 years ago
Wojciech Uss
104a9f1e27
fix pattern maching conv2d with(out) ResidualData
...
test=develop
6 years ago
luotao1
82af8031d9
add runtime_context_cache_pass
...
test=develop
6 years ago
Tao Luo
7d2740db83
Revert "cache runtime_context"
6 years ago
Qiyang Min
c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
...
Implement imperative infer var type
6 years ago
Wojciech Uss
af03008890
Add cpu_quantize_placement_pass for C-API quantization ( #16265 )
...
* Add cpu_quantize_placement_pass for C-API quantization
test=develop
* added a comment on required pass attributes
test=develop
6 years ago
minqiyang
b40e41fbd1
Polish code style
...
test=develop
6 years ago
minqiyang
36dce65bb3
Take DataType and VarType apart
...
test=develop
6 years ago
luotao1
cc0ae1f1a1
refine with comments
...
test=develop
6 years ago
luotao1
a275fd6e0c
Merge branch 'develop' into runtime_context
6 years ago
Wojciech Uss
2579ade45f
Add cpu_quantize_pass for C-API quantization ( #16127 )
...
* Add cpu_quantize_pass for C-API quantization
test=develop
* add cpu_quantize_pass test
* fix lint: add include memory unorderd_map and unordered_set
test=develop
* fuse_relu 1
test=develop
* tuned 2 without squash
* fixes
test=develop
* remove unused vars
test=develop
* refactored
test=develop
* fix lint c-style cast -> C++ style cast
test=develop
* remove QuantMax and c style casts
test=develop
* last usage of QuantMax removed
test=develop
* Fix Analysis Predictor UT
Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop
* change map to unordered_map
fix the forgotten part of cpu_quantize_pass_tester.cc
test=develop
* removed quantized attribute
* fixed cpu_quantize_pass_tester and op attr comments
test=develop
* removed redundant line
test=debug
* removed gmock
test=develop
* fix after merge
6 years ago
qingqing01
86e912c544
Fix windows compiling ( #16230 )
...
test=develop
6 years ago
luotao1
1b59bed989
Merge branch 'develop' into runtime_context
6 years ago
luotao1
6ce25c99a0
Merge branch 'develop' into runtime_context
6 years ago
qingqing01
8ad672a287
Support sync batch norm. ( #16121 )
...
* Support Sync Batch Norm.
* Note, do not enable it in one device.
Usage:
build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
loss_name=loss_mean.name,
build_strategy=build_strategy)
6 years ago
minqiyang
ca392c7e97
Implement infer var type context
6 years ago
Wojciech Uss
b9252f3df8
Add cpu_quantize_squash_pass for C-API quantization ( #16128 )
...
* Add cpu_quantize_squash_pass for C-API quantization
test=develop
* add cpu_quantize_squash_pass teste
* fix lint: add include memory unorderd_map and unordered_set
test=develop
* lint fix 2
* fixes
test=develop
* refactored
test=develop
* fix windows ci
test=develop
6 years ago
luotao1
d94fd97230
add runtime_context_cache_pass
...
test=develop
6 years ago
Zhen Wang
41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
...
Fix NaN bugs for static quantization strategy (mutil-cards train).
6 years ago
Zhen Wang
5685a48c23
Add some fixme. test=develop
6 years ago
Zhen Wang
ac6ef06ffa
Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang
01eddf125c
Not add graph copy construction method. test=develop
6 years ago
Zhen Wang
1b9c8d5f06
add clone function for IrGraph. test=develop
6 years ago
Yihua Xu
40f1dd818b
Fix the node's order issue when the content of graph is changed ( #16088 )
...
* Fix the node's sort issue when the graph is changed.
test=develop
* Clean code
test=develop
6 years ago
Qiao Longfei
fab1b54d99
Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
nhzlx
2eff3e26b6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt
6 years ago
nhzlx
06a088a199
fix comments and fix cpplint
...
test=develop
6 years ago
Krzysztof Binias
72253391b6
Add MKL-DNN placement pass tester
...
test=develop
6 years ago
Qiao Longfei
49f2f4f91d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
Michal Gallus
6a2bc9a275
Add Conv Residual Connection UT for Projection
...
test=develop
6 years ago
Xin Pan
a6e3cd5eb7
Merge pull request #15425 from panyx0718/api
...
Pass graph to parallel executor instead of program
6 years ago
Qiao Longfei
b8491bfd4e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
Xin Pan
0362ef75f4
fix
...
test=develop
6 years ago
Xin Pan
12a0e2ed9d
polish codes
...
test=develop
6 years ago
Xin Pan
19d78f6797
polish
...
test=develop
6 years ago
Xin Pan
32d5a16036
resolve conflicts
...
test=develop
6 years ago
Michał Gallus
c4faf36e7a
MKL-DNN: Add test for conv bias fuse pass ( #15824 )
...
* MKL-DNN: Add test for conv bias fuse pass
test=develop
* Remove const cast from Conv Bias Pass Test
* Add conv with bias test case for conv+bias fuse ut
test=develop
6 years ago
Xin Pan
26e32e095a
allow compiler to use graph
...
test=develop
6 years ago
Sylwester Fraczek
0b926114c0
add override to ApplyImpl
...
and #pragma once in edited headers
add #include<string> in edited headers
test=develop
6 years ago
Xin Pan
6019054cdd
Merge pull request #15716 from Yancey1989/refine_pg
...
Refine ParallelGraph Execution
6 years ago
tensor-tang
e1c707fe9c
fix warnings ( #15790 )
...
* fix warnings
test=develop
* fix enforce test
test=develop
6 years ago
Yancey1989
4b193db14c
polish code test=develop
6 years ago
Yancey1989
642fd68ce0
update by comment test=develop
6 years ago
Yan Chunwei
077d12b939
fix scale cleaner ( #15742 )
6 years ago
nhzlx
ecc12fb430
3. when runing in trt mode, do not allocate memory for parameters in fluid.
...
test=develop
6 years ago
Yancey1989
ecdd1166b8
cleanup code test=develop
6 years ago
Yancey1989
73005ee00d
cleanup code test=develop
6 years ago
Yancey1989
88d3dc949e
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg
...
test=develop
6 years ago
Yancey1989
f3463ecb6e
refine pg execution
6 years ago
dzhwinter
f9ac88e1a0
Merge pull request #15694 from liuwei1031/fix_security_issue
...
Fix security issue
6 years ago
tensor-tang
e49706c80e
Merge pull request #15659 from GBuella/add_to_string
...
Tests - add some missing to_string calls
6 years ago
liuwei1031
b1f97a6fa9
fix security issue 27, 38 test=develop
6 years ago
Gabor Buella
da9c94da33
Clang build fixes ( #15628 )
...
* Remove some superfluous std::move calls
The std:move triggered a build error (with -Werror):
```
[ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o
/home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move]
[this] { return std::move(CreateAllocatorWithChunk()); }, capacity);
^
/home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here
[this] { return std::move(CreateAllocatorWithChunk()); }, capacity);
^~~~~~~~~~ ~
1 error generated.
```
See: https://reviews.llvm.org/D7633
* Remove a superfluous lambda capture from framework/operator.h
```
[ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19:
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture]
[this](Variable* var) { return var; });
^~~~
1 error generated.
```
Changing it to `return it->second;`, as is in the function below.
* Rethrow an exception (instead of copying it)
```
[ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move]
throw exception;
^~~~~~~~~
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying
throw exception;
^~~~~~~~~
std::move(exception)
```
See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message.
* Remove an unused variable
```
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field]
const Scope& scope_;
^
```
* struct ComputationOpHandle -> class ComputationOpHandle
```
[ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21:
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags]
class ComputationOpHandle;
^
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here
struct ComputationOpHandle : public OpHandleBase {
^
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here?
class ComputationOpHandle;
^~~~~
struct
1 error generated.
```
* Fix name() methods under fluid/operators
```
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15:
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19:
/home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override]
virtual const char* name() const = 0;
^
/home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here
virtual const char* name() const = 0;
^
```
test=develop
6 years ago
Dun Liang
1905f1a108
bug fix && test=develop
6 years ago
Qiao Longfei
5cf0092825
add more log and fix test_dist_base in multi_batch_merge_pass
6 years ago
Gabor Buella
4975a9050a
Tests - add some missing to_string calls
...
```
/home/tej/code/gbuella_paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass_tester.cc:167:40: error: adding 'int' to a string does not append to the string [-Werror,-Wstring-plus-int]
std::string prefix = "seqpool_op_" + i;
~~~~~~~~~~~~~~^~~
/home/tej/code/gbuella_paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass_tester.cc:167:40: note: use array indexing to silence this warning
std::string prefix = "seqpool_op_" + i;
^
& [ ]
1 error generated.
```
test=develop
6 years ago
peizhilin
238ef94702
fix the build issue on gpu mode for win
...
test=develop
6 years ago
Xin Pan
74bc55c2a6
Merge pull request #14975 from dzhwinter/ir_inplace_pass
...
Ir inplace pass
6 years ago
Yan Chunwei
dc5e25fc7f
remove dot marked node ( #15606 )
6 years ago
dzhwinter
9c9ad7d40b
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
...
test=develop
6 years ago
Yan Chunwei
897789b16e
fix save_inferece_model bug ( #15365 )
6 years ago
dzhwinter
6f9904e99a
rerun windows ci. test=develop
6 years ago
dzhwinter
a52be7c081
refine build strategy. test=develop
6 years ago
Krzysztof Binias
b1bdcd4de8
Make separate folders for mkldnn codes
...
test=develop
6 years ago
dzhwinter
ee3aae56cd
merge develop branch. test=develop
6 years ago
dzhwinter
d6d3e6afe2
add more skip strategy
6 years ago
nhzlx
0779e35544
fix two bug:
...
1. graph and program_desc alignment
2. trt stream
test=develop
6 years ago
WangZhen
3ce6172052
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
dzhwinter
8f3b252392
squash commits. test=develop
6 years ago
Yan Chunwei
885c4e57ab
fea/infer memory optim2 ( #14953 )
6 years ago
Dun
9f8f0fc2d3
Memory optimization of depthwise conv op and group norm op ( #15313 )
...
* mem opt
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine with cub test=develop
* fix mkldnn test && remove comments && test=develop
* polish code && test=develop
* add only_forward test && test=develop
6 years ago
WangZhen
451896fce4
init quantization.
6 years ago
tensor-tang
d618e48309
fix fuse square mat order and refine test
...
test=develop
6 years ago
tensor-tang
a5d2a6d1ad
add fuse pass of sequared mat sub fusion
6 years ago
tensor-tang
ca6fdc6e33
refine and fix test
...
test=develop
6 years ago
tensor-tang
a89296ac1f
add repeated fc relu pass
6 years ago
Zhaolong Xing
98e85f3735
add_transpose_flatten_concat_fuse ( #15121 )
6 years ago
tensor-tang
c3a9f3c4b2
fix typo and refine
...
test=develop
6 years ago
tensor-tang
ab9c4b2a9f
refine seqpool concat pass and remove unused nodes
...
test=develop
6 years ago
tensor-tang
ce909664d8
Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
flame
fb63cd89d4
Add python ir graph API ( #14917 )
6 years ago
tensor-tang
a0a27bd240
add seqpool concat fuse pass tester
...
test=develop
6 years ago
tensor-tang
8e086a8521
follow comment and fix typo
...
test=develop
6 years ago
tensor-tang
72d2a1801e
add seqpool concat fuse pass
...
test=develop
6 years ago
minqiyang
b76695418a
Polish log
...
test=develop
6 years ago
minqiyang
4bfa110fd8
Add no lock optimize pass
...
test=develop
6 years ago
Wu Yi
e26cced7cc
refine batch merge pass ( #14777 )
...
* refine batch merge pass
* refine batch merge pass test=develop
6 years ago
Zhaolong Xing
4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
...
Add conv+ affine channel fuse pass
6 years ago
chengduo
fe8495a758
[WIP] Refine MultiDevSSAGraph ( #15040 )
...
* refine parallel_exe
test=develop
* rename shared_var_device
* code refine
* add test_weight_decay
* remove Sort
test=develop
* Add SortForReduce
test=develop
* code refine
test=develop
* follow comment
test=develop
6 years ago
nhzlx
a6aa8ea771
faster rcnn input is presistable. (fix it in paddle-trt)
...
test=develop
6 years ago
hjchen2
956cf92145
Fix conv_elementwise_add2_act pass
...
test=develop
6 years ago
nhzlx
ce3782c193
add affine_channel fuse.
...
fix conv+elemenwise fuse bug.
6 years ago
Tomasz Patejko
e9eee0de6a
MKLDNN residual connection fuse: fixing accuracy problem ( #14874 )
...
* MKLDNN residual connection fuse: conv op reused
test=develop
* MKLDNN residual connection fuse: added prints for checking fuse
test=develop
* MKLDNN residual connection fuse: add more prints
test=develop
* MKLDNN residual connection fuse: add hash function. test=develop
* MKLDNN residual connection fuse: add hash to elementwise_add
test=develop
* MKLDNN residual connection fuse: add more hashes. test=develop
* MKLDNN residual connection fuse: added hashes to relu
test=develop
* MKLDNN residual connection fuse: do not fuse when fuse_relu is on
* MKLDNN residual connection fuse: check if fuse_relu attribute is set
test=develop
* MKLDNN residual connection fuse: comment out some printouts
* MKLDNN residual connection fuse: remove unused functions in the pass code
* MKLDNN residual connection fuse: delete commented hashes and printouts
* MKLDNN residual connection fuse: remove unnecessary includes. test=develop
6 years ago
Zhaolong Xing
a9fb34fad8
Merge pull request #14903 from NHZlX/add_conv_elementwise_pass
...
Add conv + elementwiseAdd pass
6 years ago
dzhwinter
7cd24b1318
add ir memory optimize. ( #14530 )
...
* follow comments. test=develop
* Fix typo
* fix compile error. test=develop
* merge develop branch. test=develop
* Remove set_equal
* Polish code
* Delete unused functions
test=develop
* polish code. test=develop
* follow comment
* polish code.
* fix windows compile error. test=develop
* fix op handle.
* rerun ci. test=develop
* rerun ci. test=develop
* rerun macci. test=develop
* polish code. test=develop
* rewrite sort code. test=develop
* remove unused code. test=develop
* fix tests. test=develop
* fix conflict. test=develop
* follow comment. test=develop
* merge develop branch. test=develop
* fix tests. test=develop
* remove ToTypeIndex. test=develop
* rerun ci. test=develop
6 years ago
nhzlx
c0c9fcd9c7
add source file
...
test=develop
6 years ago
nhzlx
4e4a777243
add conv+elementwiseadd pass
...
test=develop
6 years ago
Yan Chunwei
a985949be9
Fea/fuse conv elementwise add fuse ( #14669 )
6 years ago
sneaxiy
7923042365
merge develop
...
test=develop
6 years ago
Tao Luo
3243b45bb2
Merge pull request #14651 from luotao1/has_attr
...
add Opdesc's HasProtoAttr
6 years ago
Tao Luo
067ed70f2d
add HasProtoAttr function in op_desc.h, clean node.h
...
test=develop
6 years ago
Xin Pan
748549b2e3
Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
...
This reverts commit b1d3a1c8b4
, reversing
changes made to f1fb64b17f
.
6 years ago
bingyanghuang
943ad4781f
One possible solution to add flexibility for mkldnn placement pass ( #14768 )
...
* Choose to turn on use_mkldnn attribute v1
* Fix mkldnn_op empty bug
* format change test=develop
* fix ci test=develop
* fix ci test and add test in dam test=develop
* add example to dam compare test test=develop
* review changes test=develop
6 years ago
Xin Pan
c049fa7cf7
Revert "Revert "Revert "Imperative"""
6 years ago
Tao Luo
c83d5b7a16
Merge pull request #14709 from yihuaxu/develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
...
Implement the fusion of convolution 3D and bias for mkldnn
6 years ago
Yihua Xu
3821fc3950
Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
...
test=develop
6 years ago
Yihua Xu
240d974ac5
Clean Code
...
test=develop
6 years ago
Xin Pan
2538ef64f1
Revert "Revert "Imperative""
6 years ago
Xin Pan
6217f42ab7
Revert "Imperative"
6 years ago
Xin Pan
b52f5d2870
Merge pull request #14398 from panyx0718/imperative
...
Imperative
6 years ago
Xin Pan
0591ba96ec
fix hack
...
test=develop
6 years ago
sneaxiy
64ad051b9a
merge develop
...
test=develop
6 years ago
Yihua Xu
64e261c6cd
Implement the fusion of convolution and bias for mkldnn
...
(test=develop)
6 years ago
Tao Luo
8d6984eb9b
change OpHasAttr to RuntimeHasAttr, add some comments
...
test=develop
6 years ago
Xin Pan
aeb74af54c
allow operator to run imperatively
6 years ago
sneaxiy
096673f675
refactor eager deletion
...
test=develop
6 years ago
Tao Luo
c856ac8721
add OpHasAttr in node.h, update is_test_pass and mkldnn_placement_pass
...
test=develop
6 years ago
minqiyang
be04d99fe4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
6 years ago
minqiyang
53433d7f2e
Revert the changes of VLOG
...
test=develop
6 years ago
peizhilin
7c8c9dc9bf
fix unit test cases
6 years ago
chengduo
a94a7355f0
Refine the GraphNum check ( #14144 )
...
* refine GraphCheck
test=develop
* fix ci fail
test=develop
6 years ago
Tomasz Patejko
53da846d1e
MKLDNN residual connections fuse pass: initial implementation of fusion for projection pass
...
test=develop
6 years ago
Tomasz Patejko
dbc4fcd722
MKLDNN residual connections fuse pass: unit tests enabled and added
6 years ago
Tomasz Patejko
4224089354
MKLDNN residual connections fuse pass: Maybe removed and boost::optional used where it makes sense
6 years ago
Tomasz Patejko
86fd3b32be
MKLDNN residual connections fuse pass: counting statistics added to the pass
6 years ago
Tomasz Patejko
ee6f778beb
MKLDNN residual connections fuse pass: further refactoring
6 years ago
Tomasz Patejko
7423748e37
MKLDNN residual connections fuse pass:
...
* implements reachability check between identity node and non-identity argument to elementwise_add
* implements handling identity node as x and as y argument to elementwise_add
6 years ago
Sylwester Fraczek
8a1eeec579
add mkldnn prop_kind phase for inference-only case to pooling and activations ( #14278 )
...
* add is_test to pooling and activations
add prop_kind support for layers activation. conv and pooling
add a pass that sets is_test to true
add transpiler version of is_test pass
test=develop
* patch test and pass
test=develop
* add pass to analyzer.h
test=develop
* add is_test attr description & pass only on mkldnn
in:
activation_op.cc
batch_norm_op.cc
conv_op.cc
dropout_op.cc
lrn_op.cc
pool_op.cc
sequence_pool_op.cc
softmax_op.cc
* fix is_test handling for activation pool and conv
* change description of is_test for all layers again
* remove GetAttr(use_mkldnn) from pass
* rename correct_mkldnn_test_phase to is_test
and remove dependency on MKLDNN
test=develop
* review fix magic number
* two if(..)s into one
* Check is_test once and pass mkldnn forward prop kind
* dereference shared_ptr with * (without get())
test=develop
* add is_test_pass back
test=develop
6 years ago
Tao Luo
5ef123c778
Merge branch 'develop' into dam_fc
6 years ago
Tao Luo
980a6753a8
fix typo to pass the ci
...
test=develop
6 years ago
peizhilin
1a9008c420
code style fix
...
test=develop
6 years ago
Tao Luo
8ea13e336a
add in_num_col_dims for fc
6 years ago
peizhilin
08d1dc84a9
fix
6 years ago
peizhilin
30ddc07a7e
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yan Chunwei
9f252e0032
Combine Inference Analysis with IR ( #13914 )
6 years ago
Tao Luo
6c32945556
Merge pull request #14372 from luotao1/speedup_analysis
...
speedup DetectPatterns
6 years ago
Tao Luo
668ae523d2
speedup DetectPatterns
...
test=develop
6 years ago
Yan Chunwei
9a6e239281
fix mac graph detector sort ( #14356 )
6 years ago
peizhilin
dc339b78d7
fix code style
6 years ago
peizhilin
ca60e1d34d
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
d08334011a
fix merge issue
6 years ago
peizhilin
52f7644f53
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01
abe209234f
Exhaustive search for cuDNN conv. ( #14286 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
* Fix compiling test=develop
6 years ago
minqiyang
87450b9ad4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
6 years ago
peizhilin
4ffa92d4f0
Merge branch 'develop' into windows/build
6 years ago
Tao Luo
813e54efbd
Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug
...
Revert "cherry picked windows patches."
6 years ago
minqiyang
3db9fad764
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
6 years ago
Xin Pan
ff6c809bfc
Merge pull request #14251 from panyx0718/fix
...
Make OpHandle/VarHandle and ir::Node works cleaner
6 years ago
Zhaolong Xing
ba8b5619a3
Revert "cherry picked windows patches."
6 years ago
minqiyang
fcc0452c8b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
6 years ago
minqiyang
0c3227a523
Change the origin VLOG level to 10 times
...
Fix code to support cpplint syntax check
test=develop
6 years ago
Xin Pan
59c66532e7
add more logs and comments
...
test=develop
6 years ago
peizhilin
869487a2b7
Merge remote-tracking branch 'origin/develop' into windows/build
6 years ago
dzhwinter
2835e04409
merge develop branch. test=develop
6 years ago
qingqing01
db8c52da5e
Revert " Exhaustive search for cuDNN conv. ( #14043 )"
...
This reverts commit ce7d9b0799
.
6 years ago
qingqing01
ce7d9b0799
Exhaustive search for cuDNN conv. ( #14043 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Clean code
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
6 years ago
Xin Pan
25123a3b7e
add tests
...
test=develop
6 years ago
Xin Pan
8c11d3fed6
clean up
6 years ago
Xin Pan
adf5615e54
clean kGraphOp
...
test=develop
6 years ago
Xin Pan
fb576cb5cb
allow to compare type
...
test=develop
6 years ago
Xin Pan
ead94bfc6c
fix destructor
...
test=develop
6 years ago
Xin Pan
2e14999942
clean1
...
test=develop
6 years ago
peizhilin
9d67c1fb69
cpu build support
6 years ago
Xin Pan
c2d70fca30
fix to only check block 0
...
test=develop
6 years ago
Xin Pan
aaeedd0ff3
make it warn
...
test=develop
6 years ago
Xin Pan
ddd2225b56
add more debug info.
...
test=develop
6 years ago
Tao Luo
d2a56f7909
Merge pull request #14159 from sfraczek/sfraczek/depthwise-conv-mkldnn-pass
...
add depthwise conv mkldnn pass
6 years ago
dzhwinter
eb2f7ed21b
refine tests. test=develop
6 years ago
Xin Pan
08d22cf7e1
Merge pull request #14091 from panyx0718/fix2
...
add program check
6 years ago
Yan Chunwei
f76fee644c
fix graph pattern detector ( #14186 )
6 years ago
Sylwester Fraczek
4e2aaf01bc
add depthwise conv mkldnn pass
...
added depthwise conv mkldnn pass which for MKLDNN changes depthwise_conv operator to conv operator because for mkldnn this is the same api
test=develop
6 years ago
Xin Pan
a943134a97
fix a few more tests
...
test=develop
6 years ago
Xin Pan
5839e3236b
add program check
...
test=develop
6 years ago
dzhwinter
ebfe5a02b3
merge develop branch
6 years ago
Wu Yi
26200f2e42
[1.1] [project] train imagenet using large batch size ( #13766 )
...
* fix nccl2 lars dist support
* put lars in momentum op
* add tests lars
* fix ci
* fix cpu kernel
* soft warning
* remove lars in test_recognize_digits.py
* move to another op
* add file
* update api.spec test=develop
* update test=develop
* fix api.spec test=develop
* wip
* wip, finish grad merge ops
* wip, finish graph build
* wip test running
* work on 1 gpu
* workable version
* update
* fix tests
* fuse broadcast op
* fix compile failed
* refine
* add batch merge test mnist
* fix CI test=develop
* fix build
* use independent bn params for batch merge test=develop
* update api.spec
* follow comments and for test
* wip
* refine tests test=develop
* follow comments test=develop
* remove startup bn modify test=develop
* follow comments test=develop
* fix merge test=develop
6 years ago
Xin Pan
0a80f06ec4
Merge pull request #14086 from panyx0718/fix6
...
delete unused codes.
6 years ago
Wu Yi
9da9b1926b
[1.1] fix graph num hang ( #14072 )
...
* fix graph num hang test=develop
* re-enable tests test=develop
* re-enable graph num check test=develop
* fix multi device pass role check test=develop
6 years ago
Xin Pan
bba0c4a9f2
delete unused codes.
...
test=develop
6 years ago
wanghaoshuang
78cf76a1ca
fix linux compile
6 years ago
Tao Luo
316bc9bfc9
fix typo and warning in analyzer_resnet50_test
...
test=develop
6 years ago
Tao Luo
42aa1d409d
Merge pull request #13485 from tpatejko/tpatejko/capi-resnet-conv-elementwise-fusion
...
MKLDNN conv+elementwise_add fusion for residual connections in Resnet
6 years ago
Tomasz Patejko
ce2464fd98
MKLDNN conv + elementwise_add fusion: UT for missing bias added. UTs refactored. Some minor changes in the pass
6 years ago
Tomasz Patejko
4e72ab411e
MKLDNN conv + elementwise_add fusion: fix for crash when bias is not present
6 years ago
Tomasz Patejko
415b261555
MKLDNN conv + elementwise_add fusion: fusion options added
6 years ago
Tomasz Patejko
0fe3079c46
MKLDNN conv + elementwise_add fusion: fix for order of parameters in elementwise_add in resnet50
...
test=develop
6 years ago
Tomasz Patejko
b73b868366
MKLDNN conv + elementwise_add fusion: bias in tests made persistent.
...
test=develop
6 years ago
Tomasz Patejko
a1fa203287
MKLDNN conv + elementwise_add fusion: name of the pass reused with name_scope_
6 years ago
Tomasz Patejko
2c43419db1
MKLDNN conv + elementwise_add fusion: comment explaining CorrectGraphEdges added
6 years ago
Tomasz Patejko
8fb29b2ca9
MKLDNN conv + elementwise_add fusion: new nodes marked as input or output
...
test=develop
6 years ago
Tomasz Patejko
cc1c8e37c1
MKLDNN conv + elementwise_add fusion: attributes in new conv op copied from old op
6 years ago
Tomasz Patejko
a27a8c5da8
MKLDNN conv + elementwise_add fusion: bias in test marked as persistable
6 years ago
Tomasz Patejko
af8c71317c
MKLDNN conv + elementwise_add fusion: CorrectGraphEdges refactored
6 years ago
Tomasz Patejko
3e033087f1
MKLDNN conv + elementwise_add fusion: LinkNodes function removed and
...
macro used.
test=develop
6 years ago
Tomasz Patejko
4be45af1cc
MKLDNN conv + elementwise_add fusion: skip connection attribute renamed. Comments about patterns added.
...
test=develop
6 years ago
Tomasz Patejko
9a335e0277
MKLDNN conv + elementwise_add fusion: changed a name of a formal argument in ElementwiseAdd pattern
6 years ago
Tomasz Patejko
fb7a50b230
MKLDNN conv + elementwise_add fusion: removed commented code. Internal functions marked as static.
...
test=develop
6 years ago
Tomasz Patejko
efd76614fb
MKLDNN conv + elementwise_add fusion: implementation changed to conform with Paddle API
6 years ago
Tomasz Patejko
347bf90412
MKLDNN conv + elementwise_add fusion: bias is also handled
6 years ago
Tomasz Patejko
bf95ac36a7
MKLDNN conv + elementwise_add fusion: further reformatting
6 years ago
Tomasz Patejko
cbe122ae2e
MKLDNN conv + elementwise_add fusion: correcting formatting
6 years ago
Tomasz Patejko
2a251bbf27
MKLDNN conv + elementwise_add fusion: some refactoring: consts, function calls instead of constant values
6 years ago
Tomasz Patejko
b8e54ab5cc
MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData
6 years ago
Tomasz Patejko
27573ece03
MKLDNN conv + elementwise_add fusion: trailing spaces removed
6 years ago
Tomasz Patejko
7f5c8a95e8
MKLDNN conv + elementwise_add fusion: arguments are replaced for many parameters in operator
6 years ago
Tomasz Patejko
5996bd39e8
MKLDNN conv + elementwise_add fusion: graph is corrected based on actual argument name, not formal argument name
6 years ago
Tomasz Patejko
41f3d78fdf
MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
6 years ago
Tomasz Patejko
07a62ddc08
MKLDNN conv + elementwise_add fusion: inputs in pass modified. Support for new conv parameter. UTs corrected
6 years ago
Tomasz Patejko
42f569fdfd
MKLDNN conv + elementwise_add fusion: use_mkldnn attribute added
6 years ago
Tomasz Patejko
441d3a4726
MKLDNN conv + elementwise_add: added some refactoring in the pass
6 years ago
Tomasz Patejko
38b7b34b1c
MKLDNN conv + elementwise_add fusion: added reachability tests, inputs and outputs in graph nodes are transformed
6 years ago
Tomasz Patejko
16eaaf3fbe
MKLDNN conv + elementwise_add fusion: added one more UT, found and corrected bugs in pass
6 years ago
Tomasz Patejko
604bad08bc
MKLDNN conv + elementwise_add fusion: implementation of patterns refarctored, applied to graph. UTs added
6 years ago
Tomasz Patejko
9ce343f868
MKLDNN conv + elementwise_add fusion: initial implementation of patterns
6 years ago
tensor-tang
40f8456a4f
refine fuse pattern and attr
...
test=develop
6 years ago
tensor-tang
cbbacb2534
Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
...
test=develop
6 years ago
tensor-tang
603ba5e01d
add seqconv eltadd relu pass
6 years ago
Michal Gallus
c504a5a1b7
Adjust Conv+bias to placement pass
...
test=develop
6 years ago
Michal Gallus
d7509d63f1
Conv+Bias: Support non-null bias
...
test=develop
6 years ago
Michal Gallus
582f59c190
Conv+Bias fuse
6 years ago
Wojciech Uss
c3b70aece9
Add MKL-DNN placement pass ( #13958 )
...
* add MKL-DNN placement pass
This patch also refactors conv+bn (includes changes from PR
https://github.com/PaddlePaddle/Paddle/pull/13926 )
updated to use the mkldnn-placement-pass.
test=develop
* remove redundant pass list
* add comment on the default first pass
* fix test for conv+relu mkldnn fuse
6 years ago
Tao Luo
7b11162ab5
Merge pull request #13949 from PaddlePaddle/wojtuss/unique-patterns-request-comment
...
Added comment with request for enhancement
6 years ago
Wojciech Uss
55fd136ab0
Added comment with request for enhancement
...
This adds a `TODO` comment according to https://github.com/PaddlePaddle/Paddle/issues/13550#issuecomment-430133585
test=develop
6 years ago
Tao Luo
e47f4186ae
fix some compiler warning
6 years ago
dzhwinter
e41a3fcd68
fix update to develop hang problem.
6 years ago
Sylwester Fraczek
50c5e9b0c6
reshape_2d used from ddim.h
...
test=develop
6 years ago
wanghaoshuang
3ae9645084
compile in linux
6 years ago
Sylwester Fraczek
55d6950a1a
rewrite conv_bn fuse pass to eigen
...
test=develop
6 years ago
Tao Luo
9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
6 years ago
Tao Luo
ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
...
[MKLDNN] Pass: Fuse Conv + Bias
6 years ago
Michal Gallus
40b17be4b0
Pass: Fuse Conv + Bias
...
test=develop
6 years ago
Sylwester Fraczek
3fcca40909
eigen sqrt fix and change 1e-5 to epsilon
...
test=develop
6 years ago
Sylwester Fraczek
78f98294c2
conv bn fuse pass
...
review fix
review from hshen14 fix
test=develop
fix error in broadcast and code cleanup
rename bias -> eltwise and added macro to shorten code
formatting
6 years ago
Tao Luo
a89afd4c22
Merge pull request #13685 from luotao1/naive_cmake
...
update libpaddle_fluid.a/so
6 years ago
luotao1
a989a4e7c2
refine paddle_inference_helper.h
6 years ago
tensor-tang
ede4b230be
Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op
...
Adding fused_embedding_fc_lstm op
6 years ago
Jacek Czaja
e202f33aa9
- Yet another clarification to comment
...
test=develop
7 years ago
Jacek Czaja
1df69f7c9d
- Fix to comment
...
test=develop
7 years ago
Yan Chunwei
c8744d118d
fea/infer executor and concurrency performance issue bug fix ( #13451 )
...
- add naive executor
- fix concurrency performance issue
7 years ago
chengduo
5175b3cb2b
Add GraphChecker ( #13580 )
...
* add GraphNum
test=develop
* add graph number check in parallelExecutor
test=develop
* fix transformer_model bug
test=develop
* fix graph num
7 years ago
Xin Pan
7cd2761736
Merge pull request #13416 from panyx0718/ir
...
PassBuilder
7 years ago
Jacek Czaja
d5114c60b0
- Reviewers suggesstions to fused_embedding_fc_lstm_op
7 years ago
Jacek Czaja
7ab5626dee
- Added initial pass for embedding-fc-lstm
...
- Added draft of new operator
- Added fused embedding fc lstm files
- First time embedding_fc_lstm_fuse_pass was invoked in
test_text_classification
- Added Embedding pattern
- Not crashing
- Enabled draft of embedding_fc_lstm pass (does it job)
- First working (Seqcompute only) version
- Removed diagnostic comment
- First enabling of BatchCompute
- Disabling pass for embedding with is_sparse and is_distributed
- Cosmetics
- Style
- Style
7 years ago
Yan Chunwei
9e8d372ff4
hide attention lstm fuse ( #13615 )
7 years ago
tensor-tang
6938e6cf06
Merge pull request #13603 from tensor-tang/refine/peephole
...
refine peephole
7 years ago
Wu Yi
16e73e0d28
hide operator API ( #12543 )
...
* hide operator API
* update
* update api.spec
* fix merge
* fix test
7 years ago
tensor-tang
209e9c3db1
refine peephole
...
test=develop
7 years ago
Sylwester Fraczek
e5d1bd1e93
remove unused variable nodes2delete
7 years ago
Sylwester Fraczek
a49aa4dac9
make bias unnecessary for ConvRelu fuse
7 years ago
Sylwester Fraczek
493ef0c8df
do not remove conv node just rewire the output
7 years ago
Sylwester Fraczek
667b661786
updated the test
7 years ago
Xin Pan
c2b3838cf5
add some comments
7 years ago
Xin Pan
b43e49fa31
fix
7 years ago
Xin Pan
36c2a9af27
pass builder allow cutomize pass in python.
7 years ago
chengduo
d402234ba8
Feature/op_fuse_pass ( #12440 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* add op_fuse pass
* add backward
* code refine
* use TopologySortOperations
* follow comments
* refine IsFusible
* code enhance
* fix op_fusion_pass
* refine code
* refine fuse_elemwise_act_op
* adjust the input and output
* refine logic
* add intermediate_edge
* disable inplace
* follow comments
* refine logic
* follow comments
* Remove the removable IntermediateOut
* change strategy
* code refine
* enable fuse backward
* code refine
* code refine
* rename unit test
* follow comments
7 years ago
Xin Pan
23ba766252
Merge pull request #13475 from panyx0718/ir5
...
avoid creating dangling ir::Node.
7 years ago
Xin Pan
0bd7a67eaf
avoid creating dangling ir::Node.
...
Node should be created by Graph::CreateXXX so that
they are managed by graph.
7 years ago
sneaxiy
d0b2453ecd
merge develop
7 years ago
sneaxiy
24ea39c4c6
feature/eager_delete_tensor
7 years ago
Tao Luo
9f03ce6c4a
Merge pull request #13318 from Superjomn/refine/simple_fix
...
simple fix
7 years ago
Sylwester Fraczek
dd149d469b
hotfix for conv-relu pass
7 years ago
luotao1
b12322ce95
fix fusion_lstm unique_name bug
7 years ago
Sylwester Fraczek
41de582bb0
create conv relu pass for MKLDNN ( #13258 )
7 years ago
Yan Chunwei
2fd1bf2ea6
fea/add color log ( #13305 )
7 years ago
superjomn
6b2f680d88
simple fix
7 years ago
Yan Chunwei
478a4e850e
refactor ir pattern ( #13304 )
7 years ago