Michal Gallus
c504a5a1b7
Adjust Conv+bias to placement pass
...
test=develop
7 years ago
Michal Gallus
d7509d63f1
Conv+Bias: Support non-null bias
...
test=develop
7 years ago
Michal Gallus
582f59c190
Conv+Bias fuse
7 years ago
Wojciech Uss
c3b70aece9
Add MKL-DNN placement pass ( #13958 )
...
* add MKL-DNN placement pass
This patch also refactors conv+bn (includes changes from PR
https://github.com/PaddlePaddle/Paddle/pull/13926 )
updated to use the mkldnn-placement-pass.
test=develop
* remove redundant pass list
* add comment on the default first pass
* fix test for conv+relu mkldnn fuse
7 years ago
Tao Luo
7b11162ab5
Merge pull request #13949 from PaddlePaddle/wojtuss/unique-patterns-request-comment
...
Added comment with request for enhancement
7 years ago
Wojciech Uss
55fd136ab0
Added comment with request for enhancement
...
This adds a `TODO` comment according to https://github.com/PaddlePaddle/Paddle/issues/13550#issuecomment-430133585
test=develop
7 years ago
Tao Luo
e47f4186ae
fix some compiler warning
7 years ago
dzhwinter
e41a3fcd68
fix update to develop hang problem.
7 years ago
Sylwester Fraczek
50c5e9b0c6
reshape_2d used from ddim.h
...
test=develop
7 years ago
wanghaoshuang
3ae9645084
compile in linux
7 years ago
Sylwester Fraczek
55d6950a1a
rewrite conv_bn fuse pass to eigen
...
test=develop
7 years ago
Tao Luo
9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
7 years ago
Tao Luo
ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
...
[MKLDNN] Pass: Fuse Conv + Bias
7 years ago
Michal Gallus
40b17be4b0
Pass: Fuse Conv + Bias
...
test=develop
7 years ago
Sylwester Fraczek
3fcca40909
eigen sqrt fix and change 1e-5 to epsilon
...
test=develop
7 years ago
Sylwester Fraczek
78f98294c2
conv bn fuse pass
...
review fix
review from hshen14 fix
test=develop
fix error in broadcast and code cleanup
rename bias -> eltwise and added macro to shorten code
formatting
7 years ago
Tao Luo
a89afd4c22
Merge pull request #13685 from luotao1/naive_cmake
...
update libpaddle_fluid.a/so
7 years ago
luotao1
a989a4e7c2
refine paddle_inference_helper.h
7 years ago
tensor-tang
ede4b230be
Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op
...
Adding fused_embedding_fc_lstm op
7 years ago
Jacek Czaja
e202f33aa9
- Yet another clarification to comment
...
test=develop
7 years ago
Jacek Czaja
1df69f7c9d
- Fix to comment
...
test=develop
7 years ago
Yan Chunwei
c8744d118d
fea/infer executor and concurrency performance issue bug fix ( #13451 )
...
- add naive executor
- fix concurrency performance issue
7 years ago
chengduo
5175b3cb2b
Add GraphChecker ( #13580 )
...
* add GraphNum
test=develop
* add graph number check in parallelExecutor
test=develop
* fix transformer_model bug
test=develop
* fix graph num
7 years ago
Xin Pan
7cd2761736
Merge pull request #13416 from panyx0718/ir
...
PassBuilder
7 years ago
Jacek Czaja
d5114c60b0
- Reviewers suggesstions to fused_embedding_fc_lstm_op
7 years ago
Jacek Czaja
7ab5626dee
- Added initial pass for embedding-fc-lstm
...
- Added draft of new operator
- Added fused embedding fc lstm files
- First time embedding_fc_lstm_fuse_pass was invoked in
test_text_classification
- Added Embedding pattern
- Not crashing
- Enabled draft of embedding_fc_lstm pass (does it job)
- First working (Seqcompute only) version
- Removed diagnostic comment
- First enabling of BatchCompute
- Disabling pass for embedding with is_sparse and is_distributed
- Cosmetics
- Style
- Style
7 years ago
Yan Chunwei
9e8d372ff4
hide attention lstm fuse ( #13615 )
7 years ago
tensor-tang
6938e6cf06
Merge pull request #13603 from tensor-tang/refine/peephole
...
refine peephole
7 years ago
Wu Yi
16e73e0d28
hide operator API ( #12543 )
...
* hide operator API
* update
* update api.spec
* fix merge
* fix test
7 years ago
tensor-tang
209e9c3db1
refine peephole
...
test=develop
7 years ago
Sylwester Fraczek
e5d1bd1e93
remove unused variable nodes2delete
7 years ago
Sylwester Fraczek
a49aa4dac9
make bias unnecessary for ConvRelu fuse
7 years ago
Sylwester Fraczek
493ef0c8df
do not remove conv node just rewire the output
7 years ago
Sylwester Fraczek
667b661786
updated the test
7 years ago
Xin Pan
c2b3838cf5
add some comments
7 years ago
Xin Pan
b43e49fa31
fix
7 years ago
Xin Pan
36c2a9af27
pass builder allow cutomize pass in python.
7 years ago
chengduo
d402234ba8
Feature/op_fuse_pass ( #12440 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* add op_fuse pass
* add backward
* code refine
* use TopologySortOperations
* follow comments
* refine IsFusible
* code enhance
* fix op_fusion_pass
* refine code
* refine fuse_elemwise_act_op
* adjust the input and output
* refine logic
* add intermediate_edge
* disable inplace
* follow comments
* refine logic
* follow comments
* Remove the removable IntermediateOut
* change strategy
* code refine
* enable fuse backward
* code refine
* code refine
* rename unit test
* follow comments
7 years ago
Xin Pan
23ba766252
Merge pull request #13475 from panyx0718/ir5
...
avoid creating dangling ir::Node.
7 years ago
Xin Pan
0bd7a67eaf
avoid creating dangling ir::Node.
...
Node should be created by Graph::CreateXXX so that
they are managed by graph.
7 years ago
sneaxiy
d0b2453ecd
merge develop
7 years ago
sneaxiy
24ea39c4c6
feature/eager_delete_tensor
7 years ago
Tao Luo
9f03ce6c4a
Merge pull request #13318 from Superjomn/refine/simple_fix
...
simple fix
7 years ago
Sylwester Fraczek
dd149d469b
hotfix for conv-relu pass
7 years ago
luotao1
b12322ce95
fix fusion_lstm unique_name bug
7 years ago
Sylwester Fraczek
41de582bb0
create conv relu pass for MKLDNN ( #13258 )
7 years ago
Yan Chunwei
2fd1bf2ea6
fea/add color log ( #13305 )
7 years ago
superjomn
6b2f680d88
simple fix
7 years ago
Yan Chunwei
478a4e850e
refactor ir pattern ( #13304 )
7 years ago
tensor-tang
3ea19b7596
fix bug and fc pass ut
7 years ago
tensor-tang
df0c695618
fix fusion gru pass and enable it
7 years ago
tensor-tang
c9bd2d50f1
refine fc and gru pattern
7 years ago
tensor-tang
353d7d1be6
Merge branch 'develop' into refine/ut/lac
7 years ago
Yan Chunwei
e2d325ac08
refactor pass_library ( #13261 )
7 years ago
tensor-tang
0618077971
Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
tensor-tang
40dbd97f8e
Merge remote-tracking branch 'ups/develop' into refine/op/peephole
7 years ago
tensor-tang
4d774953c6
enable fc gru fuse pass
7 years ago
tensor-tang
74f95b8da0
fix redefine macro
7 years ago
Tao Luo
7dd8adb5fd
Merge pull request #13211 from luotao1/multi-thread2
...
add multi-thread ut for ditu-rnn
7 years ago
tensor-tang
f057077c3a
add fuse fc gru pass
7 years ago
Yan Chunwei
f143c9a916
port VisualDL's graphviz theme to IR ( #13246 )
7 years ago
tensor-tang
3eb55f0643
Merge remote-tracking branch 'ups/develop' into refine/op/peephole
7 years ago
luotao1
35cff5e00d
Merge branch 'develop' into multi-thread2
7 years ago
Yan Chunwei
9df2d8b5ba
test/add text-classification test ( #13081 )
7 years ago
tensor-tang
2f3b498949
refine fusion seq lstm peephole
7 years ago
luotao1
2dc23ffaa8
Merge branch 'develop' into multi-thread2
7 years ago
Brian Liu
04272c0d41
Enable lstm peephole ( #13160 )
...
* Refine fusion lstm op code for better readability
* Enable peephole in fusion lstm op (seq_mode part) and add unit test
* Enable peephole in fused lstop op (batch_mode part)
Set batch_mode as default as well
* Use pre-commit to clean format
* Follow up review comments as well as adding more unit tests for seq mode
7 years ago
luotao1
459d4cc811
Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo
907696709f
Merge pull request #13133 from luotao1/library
...
add static and shared Library for analysis and IR
7 years ago
Jiabin Yang
d091dd02a0
fix mac compile error 0903 ( #13184 )
7 years ago
Yan Chunwei
2ef34c64c6
refine fc with pattern reusing ( #13187 )
7 years ago
Yan Chunwei
796c87d563
bugfix/fusion lstm ( #13185 )
7 years ago
luotao1
6f18217386
fix codestyle
7 years ago
luotao1
9790acbde6
Merge branch 'library' of https://github.com/luotao1/Paddle into luotao1-library
7 years ago
luotao1
d7b4965785
auto generate paddle_inference_pass.h
7 years ago
dzhwinter
379b471ee2
squash commit
7 years ago
luotao1
0639a32477
Merge branch 'develop' into library
7 years ago
luotao1
37d1a6685c
Merge branch 'develop' into multi-thread2
7 years ago
tensor-tang
83f4bc4ecf
follow comment and refine code
7 years ago
tensor-tang
9838bacb35
Merge branch 'develop' into optimize/op/fusion_lstm
7 years ago
Yan Chunwei
597b73053d
refine/fc lstm fusion link ( #13158 )
7 years ago
Qiao Longfei
1835e91183
Merge pull request #13099 from jacquesqiao/fix-mac-compile
...
fix mac compie
7 years ago
dzhwinter
75681c0a79
switch to 9.2
7 years ago
dzhwinter
dbe90cc0f6
merge develop branch
7 years ago
Jiabin Yang
6ba2b22279
Merge pull request #13096 from JiabinYang/fix_mac
...
Fix Mac compile error
7 years ago
luotao1
fb077c17e6
add shared library for analysis
7 years ago
luotao1
0fbe0a7a28
add multi-thread ut for ditu-rnn
7 years ago
tensor-tang
80edd7ef29
enable run with fuse pass
7 years ago
Xin Pan
c709a04ae2
Merge pull request #13018 from panyx0718/name
...
support op_namescope for better debugging
7 years ago
Yan Chunwei
af15f6f038
fea/refine fuse ( #13076 )
7 years ago
qiaolongfei
583e21e60e
fix mac compie
7 years ago
Jiabin Yang
a9be969cf9
Update attention_lstm_fuse_pass.cc
7 years ago
JiabinYang
7c7d3d6172
Fix mac
7 years ago
Yan Chunwei
cfa6bbb755
move nodeid from graph to node ( #13065 )
7 years ago
Xin Pan
7dff0b11c6
fix
7 years ago
Xin Pan
0aa8135b11
Merge pull request #13045 from panyx0718/ir5
...
fix graph_to_program_test to be platform-independent
7 years ago
Xin Pan
d1058003fe
fix style
7 years ago
Xin Pan
51ef0ad766
allow to use name_scope for debugging and visiualization
7 years ago
Yan Chunwei
902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand ( #13006 )
7 years ago
Xin Pan
c720ef972b
fix graph_to_program_test to be platform-independent
7 years ago
Xin Pan
2bb15f437c
Merge pull request #12791 from panyx0718/ir3
...
graph to program pass
7 years ago
Wu Yi
0ee6fed05b
Refine dist rpc deps ( #12899 )
...
* refine dist train RPC deps
* clean up
* clean up
* fix ut
* remove input for fetch_barrier
* follow comments
7 years ago
Xin Pan
08352fe56a
fix
7 years ago
Xin Pan
6fdb7f5348
add test
7 years ago
Xin Pan
1a67061fee
graph to program pass
...
fix a few other things
7 years ago
dzhwinter
efd0884fa9
add op registry
7 years ago
Tao Luo
ef2da86b4f
Merge pull request #12885 from luotao1/test_ditu_rnn
...
enhance test_analyzer to profile ditu inference demo
7 years ago
luotao1
9c7fde45a7
enhance test_analyzer to profile ditu inference demo
7 years ago
Wu Yi
b8da70c370
Resovle multi gpu async deps ( #12828 )
...
* dist transpiler add control dependency var between send and recv
* fix async deps
* follow comments and refine
* fix deps connect for rpc ops
7 years ago
Xin Pan
698c926ce5
copy program and fix op_desc
7 years ago
Yan Chunwei
9ee698e605
enhance/ditu rnn with fc fuse ( #12831 )
...
* make fc fuse work with ditu rnn
* add ditu rnn data download to CMAKE
7 years ago
Yan Chunwei
896a37b6e3
fea/link ir to inference analysis and fc fuse support ( #12789 )
...
* link IR graph to analysis graph
* add clean code and update
* add infer_clean_pass
* add ir_pass_manager
* support fc fuse executation
* fix ir circle
7 years ago
minqiyang
beb93bb901
Fix ut bug for graph_test
...
Port dist_transpiler new added codes
Port ut for clone desc
7 years ago
Xin Pan
e525aa232e
Merge pull request #12780 from panyx0718/ir4
...
fix ProgramToGraph
7 years ago
Xin Pan
1d3343240e
fix
7 years ago
Xin Pan
17b88811e0
fix ProgramToGraph
...
when while_grad, it writes multiple @EMPTY@ with no VarDesc.
7 years ago
Xin Pan
a9217031ba
small fix
7 years ago
Qiao Longfei
64d48f4d6a
fix mac compile ( #12751 )
7 years ago
Yan Chunwei
5d2834fcf7
fea/ir support fuse, based on graph pattern detection helper ( #12636 )
7 years ago
Xin Pan
891c3c0f9a
test and doc IR Graph
7 years ago
chengduo
64824ac73f
Add write after write dependence ( #12632 )
...
* Add write after write
* follow comment
7 years ago
nhzlx
a2749adf5d
fix warning
7 years ago
Xin Pan
4b8ae523c4
Merge pull request #12367 from panyx0718/ir_pass
...
Ir pass
7 years ago
Xin Pan
25706d0868
properly set up dep of concat and fetch_bar
7 years ago
Xin Pan
5fff8d7a55
add distributed training deps.
7 years ago
Xin Pan
99c0c20468
add pass test
7 years ago
Xin Pan
ab72d28a5e
clean up and correctness check
7 years ago
Xin Pan
aa1085ddc5
all passes
...
add doc
7 years ago
Xin Pan
e4d7d7ae8f
pass refactoring
7 years ago
Xin Pan
142e832d21
pass registration
7 years ago
Xin Pan
5b183557f3
graph viz pass
7 years ago
Xin Pan
5173a53c8a
fix reorder issue.
7 years ago
Xin Pan
21a45420f0
polish and test
7 years ago
Xin Pan
2782e71a11
fix graph_test
7 years ago
Xin Pan
93355cc0d2
fix control deps
7 years ago
Xin Pan
f6d99d1f73
polish
7 years ago
Xin Pan
c3f6e0e8a2
add namespace to Graph
7 years ago
Xin Pan
0b3465d215
better
7 years ago
Xin Pan
dcaf183daa
builder SSA graph at the beginning.
7 years ago
Xin Pan
1a3b84369c
disable warning
7 years ago
Dang
6f72b3b486
Fix ir CMake dependence.
7 years ago
Qiao Longfei
37713f2282
Merge pull request #12244 from jacquesqiao/fix-mac-build
...
fix mac build
7 years ago
qiaolongfei
0e30c9d6fb
fix mac build
7 years ago
yuyang18
7cd5f906b7
Correct IR dependencies
7 years ago
Xin Pan
950585f419
follow comments
7 years ago
Xin Pan
5daad16218
polish
7 years ago
Xin Pan
ff5a7b67ed
polish
7 years ago
Xin Pan
62e2aa115d
add a graph_test
7 years ago
Xin Pan
9c9e28b57b
fix program to graph
7 years ago
Xin Pan
64eaa4c829
clean
7 years ago