Paddle

Commit Graph

Author	SHA1	Message	Date
Sylwester Fraczek	bccb0ba49a	fix quantize_squash_pass segfault when no tensor linked to Bias (#17292 ) * fix quantize_squash_pass segfault when there is no tensor linked do Bias input test=develop * add googlenet test test=develop * fix concat CreateKey not using input format test=develop	6 years ago
Qiao Longfei	58f7695ab2	Async exe support communicator (#17386 ) Async exe support communicator	6 years ago
guomingz	2281ebf0f3	Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130 ) * Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization. Below table shows the benchmark(FPS) which measured on skx-8180(28 cores) Batch size \| with fusion \| without fusion -- \| -- \| -- 1 \| 214.7 \| 53.4 50 \| 1219.727 \| 137.280 test=develop * Fix the format issue test=develop * Add the missing nolint comments. test=develop * Fix the typos. test=develop * Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine. test=develop * Adjust the indentation. test=develop * Add the test_conv_brelu_mkldnn_fuse_pass case. test=develop * Slightly update the code per Baidu comments. Let the parameter definition embedded into the code. That's will make the code easy to understand. test=develop	6 years ago
liuwei1031	c3949f5699	remove two useless flags: enable_subgraph_optimize, memory_optimize_debug, test=develop (#17491 )	6 years ago
Tao Luo	32da5e9c3d	remove unused expected_kernel_cache_pass (#17486 ) test=develop	6 years ago
Zeng Jinle	712bfb17cb	fix recurrent_op,test=develop (#17433 )	6 years ago
Zhen Wang	4a1b7fec96	Add setting Scope function for the graph class (#17417 ) * add set_not_owned function for graph * add scope set. test=develop * add scope_ptr enforce not null before setting.test=develop	6 years ago
chengduo	04bd413acb	Code Clean: Move all pass to paddle::framework::ir (#17228 ) * move pass to ir * polish code test=develop * fix dependency test=develop	6 years ago
Zeng Jinle	4f8594088d	Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225 ) * add use_cuda to inplace pass,test=develop * add test softmax_with_xe_inplace test,test=develop * fix potential inplace bug test=develop * add more skip vars in mem opt pass,test=develop * follow comment,test=develop * follow comments,move duplicate out arg check to program->graph,test=develop	6 years ago
石晓伟	a72dbe9abf	Cherry-pick benchmark related changes from release/1.4 (#17156 ) * cherry-pick commit from `8877054` * cherry-pick commit from `3f0b97d` * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn (cherry picked from commit `8643dbc233`) * Cherry-Pick from 16662 : Anakin subgraph cpu support (cherry picked from commit `7ad182e16c`) * Cherry-pick from 1662, 16797.. : add anakin int8 support (cherry picked from commit `e14ab180fe`) * Cherry-pick from 16813 : change singleton to graph RegistBlock test=release/1.4 (cherry picked from commit `4b9fa42307`) * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2 Support ShuffleNet and MobileNet-v2, test=release/1.4 (cherry picked from commit `a6fb066f90`) * Cherry-pick : anakin subgraph add opt config layout argument #16846 test=release/1.4 (cherry picked from commit `8121b3eccb`) * 1. add shuffle_channel_detect (cherry picked from commit `6efdea8997`) * update shuffle_channel op convert, test=release/1.4 (cherry picked from commit `e4726a066f`) * Modify symbol export rules test=develop	6 years ago
tensor-tang	79ed1c76cd	fix bn fuse vardesc and add model saver (#17143 ) * fix bn fuse vardesc and add model saver test=develop * unify save model in test helper test=develop * fix mkdir on windows test=develop * remove magic number use bn bias var desc test=develop	6 years ago
Zeng Jinle	4e1bc6e805	Rewrite inplace pass and fix gc bug (#17126 ) * fix op graph view test=develop * rewrite inplace pass and fix reference count pass bug test=develop * fix unittest failed test=develop * follow comments, test=develop	6 years ago
Yihua Xu	93cedfdb9c	Fix the order while sorting the operators (#16756 ) * Fix the order when sorting operators. test=develop * Enable transfomer compare test item. test=develop * Use set to replace vector. test=develop	6 years ago
Yiqun Liu	112f16143b	Add an option to enable the cache of expected kernel in train phase. (#16724 ) * Add an option to enable the cache of expected kernel in train phase. test=develop * Change the default value of cache_expected_kernel to true.	6 years ago
Tao Luo	ad4a1bd13c	Merge pull request #16339 from luotao1/core_opt_choose_kernel Cache the chosen kernel of operators	6 years ago
Yiqun Liu	3fe8cb0dd7	Enable the runtime_context_cache pass in train phase (#16640 ) * Try to enable the runtime_context_cache pass in train phase. * Put the append of runtime_context_cache pass ahead of multi_dev passes. test=develop	6 years ago
luotao1	695f2db6a0	update expected_kernel_cache_pass test=develop	6 years ago
luotao1	226596a296	Merge branch 'develop' into core_opt_choose_kernel	6 years ago
gongweibao	423bc515da	fix batch merge bug (#16601 )	6 years ago
Qiao Longfei	baf02328b2	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator test=develop	6 years ago
Qiao Longfei	d8974e6da0	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator test=develop	6 years ago
nhzlx	d065b5bf2b	Anakin ssd support refine trt first run add quant dequant fuse pass omit simplify_anakin_priorbox_detection template omit transpose_flatten_concat_fuse template test=develop	6 years ago
chengduo	ed61d67c73	Fix the interface of Pass::Apply (#16484 ) * modify the interface of Pass::Allay test=develop * Polish code test=develop * Fix Travis CI test=develop * fix Pass::Apply interface test=develop * Fix Travis CI test=develop	6 years ago
Qiao Longfei	392e97aae5	fix cpplint test=develop	6 years ago
Qiao Longfei	30618409db	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator	6 years ago
nhzlx	953bdde058	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD test=develop	6 years ago
Wojciech Uss	46677fb080	Move cpu_quantize_* passes into mkldnn subfolder test=develop	6 years ago
nhzlx	3df7b98a0f	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD	6 years ago
luotao1	056599a738	add expected_kernel_cache_pass test=develop	6 years ago
Wojciech Uss	cbe2dbf0db	Add enabling quantization (#16326 ) * Add enabling quantization test=develop * remove unused (here) function	6 years ago
Tao Luo	9a05859179	Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass fix pattern maching conv2d with(out) ResidualData	6 years ago
nhzlx	c407dfa3cb	cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276	6 years ago
nhzlx	a25331bc26	cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189	6 years ago
nhzlx	69d37f81d7	cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157 support change input size	6 years ago
nhzlx	a1d200a5de	cherry-pick from feature/anakin-engine: Anakin support facebox #16111	6 years ago
Wojciech Uss	104a9f1e27	fix pattern maching conv2d with(out) ResidualData test=develop	6 years ago
luotao1	82af8031d9	add runtime_context_cache_pass test=develop	6 years ago
Tao Luo	7d2740db83	Revert "cache runtime_context"	6 years ago
Qiyang Min	c7f1f3ed0c	Merge pull request #16214 from velconia/imperative_infer_var_type Implement imperative infer var type	6 years ago
Wojciech Uss	af03008890	Add cpu_quantize_placement_pass for C-API quantization (#16265 ) * Add cpu_quantize_placement_pass for C-API quantization test=develop * added a comment on required pass attributes test=develop	6 years ago
minqiyang	b40e41fbd1	Polish code style test=develop	6 years ago
minqiyang	36dce65bb3	Take DataType and VarType apart test=develop	6 years ago
luotao1	cc0ae1f1a1	refine with comments test=develop	6 years ago
luotao1	a275fd6e0c	Merge branch 'develop' into runtime_context	6 years ago
Wojciech Uss	2579ade45f	Add cpu_quantize_pass for C-API quantization (#16127 ) * Add cpu_quantize_pass for C-API quantization test=develop * add cpu_quantize_pass test * fix lint: add include memory unorderd_map and unordered_set test=develop * fuse_relu 1 test=develop * tuned 2 without squash * fixes test=develop * remove unused vars test=develop * refactored test=develop * fix lint c-style cast -> C++ style cast test=develop * remove QuantMax and c style casts test=develop * last usage of QuantMax removed test=develop * Fix Analysis Predictor UT Check if memory_optimize_pass has already been added to the analysis config before adding a new one, so that it is not added multiple times. test=develop * change map to unordered_map fix the forgotten part of cpu_quantize_pass_tester.cc test=develop * removed quantized attribute * fixed cpu_quantize_pass_tester and op attr comments test=develop * removed redundant line test=debug * removed gmock test=develop * fix after merge	6 years ago
qingqing01	86e912c544	Fix windows compiling (#16230 ) test=develop	6 years ago
luotao1	1b59bed989	Merge branch 'develop' into runtime_context	6 years ago
luotao1	6ce25c99a0	Merge branch 'develop' into runtime_context	6 years ago
qingqing01	8ad672a287	Support sync batch norm. (#16121 ) * Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)	6 years ago
minqiyang	ca392c7e97	Implement infer var type context	6 years ago
Wojciech Uss	b9252f3df8	Add cpu_quantize_squash_pass for C-API quantization (#16128 ) * Add cpu_quantize_squash_pass for C-API quantization test=develop * add cpu_quantize_squash_pass teste * fix lint: add include memory unorderd_map and unordered_set test=develop * lint fix 2 * fixes test=develop * refactored test=develop * fix windows ci test=develop	6 years ago
luotao1	d94fd97230	add runtime_context_cache_pass test=develop	6 years ago
Zhen Wang	41b8cf0bae	Merge pull request #16162 from wzzju/fix_nan_static_quant Fix NaN bugs for static quantization strategy (mutil-cards train).	6 years ago
Zhen Wang	5685a48c23	Add some fixme. test=develop	6 years ago
Zhen Wang	ac6ef06ffa	Add the Clone method in Graph. test=develop	6 years ago
Zhen Wang	01eddf125c	Not add graph copy construction method. test=develop	6 years ago
Zhen Wang	1b9c8d5f06	add clone function for IrGraph. test=develop	6 years ago
Yihua Xu	40f1dd818b	Fix the node's order issue when the content of graph is changed (#16088 ) * Fix the node's sort issue when the graph is changed. test=develop * Clean code test=develop	6 years ago
Qiao Longfei	fab1b54d99	Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator	6 years ago
nhzlx	2eff3e26b6	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt	6 years ago
nhzlx	06a088a199	fix comments and fix cpplint test=develop	6 years ago
Krzysztof Binias	72253391b6	Add MKL-DNN placement pass tester test=develop	6 years ago
Qiao Longfei	49f2f4f91d	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator	6 years ago
Michal Gallus	6a2bc9a275	Add Conv Residual Connection UT for Projection test=develop	6 years ago
Xin Pan	a6e3cd5eb7	Merge pull request #15425 from panyx0718/api Pass graph to parallel executor instead of program	6 years ago
Qiao Longfei	b8491bfd4e	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator	6 years ago
Xin Pan	0362ef75f4	fix test=develop	6 years ago
Xin Pan	12a0e2ed9d	polish codes test=develop	6 years ago
Xin Pan	19d78f6797	polish test=develop	6 years ago
Xin Pan	32d5a16036	resolve conflicts test=develop	6 years ago
Michał Gallus	c4faf36e7a	MKL-DNN: Add test for conv bias fuse pass (#15824 ) * MKL-DNN: Add test for conv bias fuse pass test=develop * Remove const cast from Conv Bias Pass Test * Add conv with bias test case for conv+bias fuse ut test=develop	6 years ago
Xin Pan	26e32e095a	allow compiler to use graph test=develop	6 years ago
Sylwester Fraczek	0b926114c0	add override to ApplyImpl and #pragma once in edited headers add #include<string> in edited headers test=develop	6 years ago
Xin Pan	6019054cdd	Merge pull request #15716 from Yancey1989/refine_pg Refine ParallelGraph Execution	6 years ago
tensor-tang	e1c707fe9c	fix warnings (#15790 ) * fix warnings test=develop * fix enforce test test=develop	6 years ago
Yancey1989	4b193db14c	polish code test=develop	6 years ago
Yancey1989	642fd68ce0	update by comment test=develop	6 years ago
Yan Chunwei	077d12b939	fix scale cleaner (#15742 )	6 years ago
nhzlx	ecc12fb430	3. when runing in trt mode, do not allocate memory for parameters in fluid. test=develop	6 years ago
Yancey1989	ecdd1166b8	cleanup code test=develop	6 years ago
Yancey1989	73005ee00d	cleanup code test=develop	6 years ago
Yancey1989	88d3dc949e	Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg test=develop	6 years ago
Yancey1989	f3463ecb6e	refine pg execution	6 years ago
dzhwinter	f9ac88e1a0	Merge pull request #15694 from liuwei1031/fix_security_issue Fix security issue	6 years ago
tensor-tang	e49706c80e	Merge pull request #15659 from GBuella/add_to_string Tests - add some missing to_string calls	6 years ago
liuwei1031	b1f97a6fa9	fix security issue 27, 38 test=develop	6 years ago
Gabor Buella	da9c94da33	Clang build fixes (#15628 ) * Remove some superfluous std::move calls The std:move triggered a build error (with -Werror): ``` [ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move] [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^ /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^~~~~~~~~~ ~ 1 error generated. ``` See: https://reviews.llvm.org/D7633 * Remove a superfluous lambda capture from framework/operator.h ``` [ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19: /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture] [this](Variable* var) { return var; }); ^~~~ 1 error generated. ``` Changing it to `return it->second;`, as is in the function below. * Rethrow an exception (instead of copying it) ``` [ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move] throw exception; ^~~~~~~~~ /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying throw exception; ^~~~~~~~~ std::move(exception) ``` See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message. * Remove an unused variable ``` /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field] const Scope& scope_; ^ ``` * struct ComputationOpHandle -> class ComputationOpHandle ``` [ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21: /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags] class ComputationOpHandle; ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here struct ComputationOpHandle : public OpHandleBase { ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here? class ComputationOpHandle; ^~~~~ struct 1 error generated. ``` * Fix name() methods under fluid/operators ``` In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15: In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19: /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override] virtual const char* name() const = 0; ^ /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here virtual const char* name() const = 0; ^ ``` test=develop	6 years ago
Dun Liang	1905f1a108	bug fix && test=develop	6 years ago
Qiao Longfei	5cf0092825	add more log and fix test_dist_base in multi_batch_merge_pass	6 years ago
Gabor Buella	4975a9050a	Tests - add some missing to_string calls ``` /home/tej/code/gbuella_paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass_tester.cc:167:40: error: adding 'int' to a string does not append to the string [-Werror,-Wstring-plus-int] std::string prefix = "seqpool_op_" + i; ~~~~~~~~~~~~~~^~~ /home/tej/code/gbuella_paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass_tester.cc:167:40: note: use array indexing to silence this warning std::string prefix = "seqpool_op_" + i; ^ & [ ] 1 error generated. ``` test=develop	6 years ago
peizhilin	238ef94702	fix the build issue on gpu mode for win test=develop	6 years ago
Xin Pan	74bc55c2a6	Merge pull request #14975 from dzhwinter/ir_inplace_pass Ir inplace pass	6 years ago
Yan Chunwei	dc5e25fc7f	remove dot marked node (#15606 )	6 years ago
dzhwinter	9c9ad7d40b	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass test=develop	6 years ago
Yan Chunwei	897789b16e	fix save_inferece_model bug (#15365 )	6 years ago
dzhwinter	6f9904e99a	rerun windows ci. test=develop	6 years ago
dzhwinter	a52be7c081	refine build strategy. test=develop	6 years ago
Krzysztof Binias	b1bdcd4de8	Make separate folders for mkldnn codes test=develop	6 years ago
dzhwinter	ee3aae56cd	merge develop branch. test=develop	6 years ago
dzhwinter	d6d3e6afe2	add more skip strategy	6 years ago
nhzlx	0779e35544	fix two bug: 1. graph and program_desc alignment 2. trt stream test=develop	6 years ago
WangZhen	3ce6172052	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization	6 years ago
dzhwinter	8f3b252392	squash commits. test=develop	6 years ago
Yan Chunwei	885c4e57ab	fea/infer memory optim2 (#14953 )	6 years ago
Dun	9f8f0fc2d3	Memory optimization of depthwise conv op and group norm op (#15313 ) * mem opt * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine with cub test=develop * fix mkldnn test && remove comments && test=develop * polish code && test=develop * add only_forward test && test=develop	6 years ago
WangZhen	451896fce4	init quantization.	6 years ago
tensor-tang	d618e48309	fix fuse square mat order and refine test test=develop	6 years ago
tensor-tang	a5d2a6d1ad	add fuse pass of sequared mat sub fusion	6 years ago
tensor-tang	ca6fdc6e33	refine and fix test test=develop	6 years ago
tensor-tang	a89296ac1f	add repeated fc relu pass	6 years ago
Zhaolong Xing	98e85f3735	add_transpose_flatten_concat_fuse (#15121 )	6 years ago
tensor-tang	c3a9f3c4b2	fix typo and refine test=develop	6 years ago
tensor-tang	ab9c4b2a9f	refine seqpool concat pass and remove unused nodes test=develop	6 years ago
tensor-tang	ce909664d8	Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed	6 years ago
flame	fb63cd89d4	Add python ir graph API (#14917 )	6 years ago
tensor-tang	a0a27bd240	add seqpool concat fuse pass tester test=develop	6 years ago
tensor-tang	8e086a8521	follow comment and fix typo test=develop	6 years ago
tensor-tang	72d2a1801e	add seqpool concat fuse pass test=develop	6 years ago
minqiyang	b76695418a	Polish log test=develop	6 years ago
minqiyang	4bfa110fd8	Add no lock optimize pass test=develop	6 years ago
Wu Yi	e26cced7cc	refine batch merge pass (#14777 ) * refine batch merge pass * refine batch merge pass test=develop	6 years ago
Zhaolong Xing	4048cfa9da	Merge pull request #15048 from NHZlX/add_affine_channel_fuse Add conv+ affine channel fuse pass	6 years ago
chengduo	fe8495a758	[WIP] Refine MultiDevSSAGraph (#15040 ) * refine parallel_exe test=develop * rename shared_var_device * code refine * add test_weight_decay * remove Sort test=develop * Add SortForReduce test=develop * code refine test=develop * follow comment test=develop	6 years ago
nhzlx	a6aa8ea771	faster rcnn input is presistable. (fix it in paddle-trt) test=develop	6 years ago
hjchen2	956cf92145	Fix conv_elementwise_add2_act pass test=develop	6 years ago
nhzlx	ce3782c193	add affine_channel fuse. fix conv+elemenwise fuse bug.	6 years ago
Tomasz Patejko	e9eee0de6a	MKLDNN residual connection fuse: fixing accuracy problem (#14874 ) * MKLDNN residual connection fuse: conv op reused test=develop * MKLDNN residual connection fuse: added prints for checking fuse test=develop * MKLDNN residual connection fuse: add more prints test=develop * MKLDNN residual connection fuse: add hash function. test=develop * MKLDNN residual connection fuse: add hash to elementwise_add test=develop * MKLDNN residual connection fuse: add more hashes. test=develop * MKLDNN residual connection fuse: added hashes to relu test=develop * MKLDNN residual connection fuse: do not fuse when fuse_relu is on * MKLDNN residual connection fuse: check if fuse_relu attribute is set test=develop * MKLDNN residual connection fuse: comment out some printouts * MKLDNN residual connection fuse: remove unused functions in the pass code * MKLDNN residual connection fuse: delete commented hashes and printouts * MKLDNN residual connection fuse: remove unnecessary includes. test=develop	6 years ago
Zhaolong Xing	a9fb34fad8	Merge pull request #14903 from NHZlX/add_conv_elementwise_pass Add conv + elementwiseAdd pass	6 years ago
dzhwinter	7cd24b1318	add ir memory optimize. (#14530 ) * follow comments. test=develop * Fix typo * fix compile error. test=develop * merge develop branch. test=develop * Remove set_equal * Polish code * Delete unused functions test=develop * polish code. test=develop * follow comment * polish code. * fix windows compile error. test=develop * fix op handle. * rerun ci. test=develop * rerun ci. test=develop * rerun macci. test=develop * polish code. test=develop * rewrite sort code. test=develop * remove unused code. test=develop * fix tests. test=develop * fix conflict. test=develop * follow comment. test=develop * merge develop branch. test=develop * fix tests. test=develop * remove ToTypeIndex. test=develop * rerun ci. test=develop	6 years ago
nhzlx	c0c9fcd9c7	add source file test=develop	6 years ago
nhzlx	4e4a777243	add conv+elementwiseadd pass test=develop	6 years ago
Yan Chunwei	a985949be9	Fea/fuse conv elementwise add fuse (#14669 )	6 years ago
sneaxiy	7923042365	merge develop test=develop	6 years ago
Tao Luo	3243b45bb2	Merge pull request #14651 from luotao1/has_attr add Opdesc's HasProtoAttr	6 years ago
Tao Luo	067ed70f2d	add HasProtoAttr function in op_desc.h, clean node.h test=develop	6 years ago
Xin Pan	748549b2e3	Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative" This reverts commit `b1d3a1c8b4`, reversing changes made to `f1fb64b17f`.	6 years ago
bingyanghuang	943ad4781f	One possible solution to add flexibility for mkldnn placement pass (#14768 ) * Choose to turn on use_mkldnn attribute v1 * Fix mkldnn_op empty bug * format change test=develop * fix ci test=develop * fix ci test and add test in dam test=develop * add example to dam compare test test=develop * review changes test=develop	6 years ago
Xin Pan	c049fa7cf7	Revert "Revert "Revert "Imperative"""	6 years ago
Tao Luo	c83d5b7a16	Merge pull request #14709 from yihuaxu/develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl Implement the fusion of convolution 3D and bias for mkldnn	6 years ago
Yihua Xu	3821fc3950	Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl test=develop	6 years ago
Yihua Xu	240d974ac5	Clean Code test=develop	6 years ago
Xin Pan	2538ef64f1	Revert "Revert "Imperative""	6 years ago
Xin Pan	6217f42ab7	Revert "Imperative"	6 years ago
Xin Pan	b52f5d2870	Merge pull request #14398 from panyx0718/imperative Imperative	6 years ago
Xin Pan	0591ba96ec	fix hack test=develop	6 years ago
sneaxiy	64ad051b9a	merge develop test=develop	6 years ago
Yihua Xu	64e261c6cd	Implement the fusion of convolution and bias for mkldnn (test=develop)	6 years ago
Tao Luo	8d6984eb9b	change OpHasAttr to RuntimeHasAttr, add some comments test=develop	6 years ago
Xin Pan	aeb74af54c	allow operator to run imperatively	6 years ago
sneaxiy	096673f675	refactor eager deletion test=develop	6 years ago
Tao Luo	c856ac8721	add OpHasAttr in node.h, update is_test_pass and mkldnn_placement_pass test=develop	6 years ago
minqiyang	be04d99fe4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog test=develop	6 years ago
minqiyang	53433d7f2e	Revert the changes of VLOG test=develop	6 years ago
peizhilin	7c8c9dc9bf	fix unit test cases	6 years ago
chengduo	a94a7355f0	Refine the GraphNum check (#14144 ) * refine GraphCheck test=develop * fix ci fail test=develop	6 years ago
Tomasz Patejko	53da846d1e	MKLDNN residual connections fuse pass: initial implementation of fusion for projection pass test=develop	6 years ago
Tomasz Patejko	dbc4fcd722	MKLDNN residual connections fuse pass: unit tests enabled and added	6 years ago
Tomasz Patejko	4224089354	MKLDNN residual connections fuse pass: Maybe removed and boost::optional used where it makes sense	6 years ago
Tomasz Patejko	86fd3b32be	MKLDNN residual connections fuse pass: counting statistics added to the pass	6 years ago
Tomasz Patejko	ee6f778beb	MKLDNN residual connections fuse pass: further refactoring	6 years ago
Tomasz Patejko	7423748e37	MKLDNN residual connections fuse pass: * implements reachability check between identity node and non-identity argument to elementwise_add * implements handling identity node as x and as y argument to elementwise_add	6 years ago
Sylwester Fraczek	8a1eeec579	add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278 ) * add is_test to pooling and activations add prop_kind support for layers activation. conv and pooling add a pass that sets is_test to true add transpiler version of is_test pass test=develop * patch test and pass test=develop * add pass to analyzer.h test=develop * add is_test attr description & pass only on mkldnn in: activation_op.cc batch_norm_op.cc conv_op.cc dropout_op.cc lrn_op.cc pool_op.cc sequence_pool_op.cc softmax_op.cc * fix is_test handling for activation pool and conv * change description of is_test for all layers again * remove GetAttr(use_mkldnn) from pass * rename correct_mkldnn_test_phase to is_test and remove dependency on MKLDNN test=develop * review fix magic number * two if(..)s into one * Check is_test once and pass mkldnn forward prop kind * dereference shared_ptr with * (without get()) test=develop * add is_test_pass back test=develop	6 years ago
Tao Luo	5ef123c778	Merge branch 'develop' into dam_fc	6 years ago
Tao Luo	980a6753a8	fix typo to pass the ci test=develop	6 years ago
peizhilin	1a9008c420	code style fix test=develop	6 years ago
Tao Luo	8ea13e336a	add in_num_col_dims for fc	6 years ago
peizhilin	08d1dc84a9	fix	6 years ago
peizhilin	30ddc07a7e	Merge remote-tracking branch 'upstream/develop' into windows/build	6 years ago
Yan Chunwei	9f252e0032	Combine Inference Analysis with IR (#13914 )	6 years ago
Tao Luo	6c32945556	Merge pull request #14372 from luotao1/speedup_analysis speedup DetectPatterns	6 years ago
Tao Luo	668ae523d2	speedup DetectPatterns test=develop	6 years ago
Yan Chunwei	9a6e239281	fix mac graph detector sort (#14356 )	6 years ago
peizhilin	dc339b78d7	fix code style	6 years ago
peizhilin	ca60e1d34d	Merge remote-tracking branch 'upstream/develop' into windows/build	6 years ago
peizhilin	d08334011a	fix merge issue	6 years ago
peizhilin	52f7644f53	Merge remote-tracking branch 'upstream/develop' into windows/build	6 years ago
qingqing01	abe209234f	Exhaustive search for cuDNN conv. (#14286 ) * exhaustive search for cuDNN conv. * Refine code and add unit testing. * Fix model load in fluid/inference and unit testing in conv2d * Follow comments. * Fix compiling test=develop	6 years ago
minqiyang	87450b9ad4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog test=develop	6 years ago
peizhilin	4ffa92d4f0	Merge branch 'develop' into windows/build	6 years ago
Tao Luo	813e54efbd	Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug Revert "cherry picked windows patches."	6 years ago
minqiyang	3db9fad764	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog test=develop	6 years ago
Xin Pan	ff6c809bfc	Merge pull request #14251 from panyx0718/fix Make OpHandle/VarHandle and ir::Node works cleaner	6 years ago
Zhaolong Xing	ba8b5619a3	Revert "cherry picked windows patches."	6 years ago
minqiyang	fcc0452c8b	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog test=develop	6 years ago
minqiyang	0c3227a523	Change the origin VLOG level to 10 times Fix code to support cpplint syntax check test=develop	6 years ago
Xin Pan	59c66532e7	add more logs and comments test=develop	6 years ago
peizhilin	869487a2b7	Merge remote-tracking branch 'origin/develop' into windows/build	6 years ago
dzhwinter	2835e04409	merge develop branch. test=develop	6 years ago
qingqing01	db8c52da5e	Revert " Exhaustive search for cuDNN conv. (#14043 )" This reverts commit `ce7d9b0799`.	6 years ago
qingqing01	ce7d9b0799	Exhaustive search for cuDNN conv. (#14043 ) * exhaustive search for cuDNN conv. * Refine code and add unit testing. * Clean code * Fix model load in fluid/inference and unit testing in conv2d * Follow comments.	6 years ago
Xin Pan	25123a3b7e	add tests test=develop	6 years ago
Xin Pan	8c11d3fed6	clean up	6 years ago
Xin Pan	adf5615e54	clean kGraphOp test=develop	6 years ago
Xin Pan	fb576cb5cb	allow to compare type test=develop	6 years ago
Xin Pan	ead94bfc6c	fix destructor test=develop	6 years ago
Xin Pan	2e14999942	clean1 test=develop	6 years ago
peizhilin	9d67c1fb69	cpu build support	6 years ago
Xin Pan	c2d70fca30	fix to only check block 0 test=develop	6 years ago
Xin Pan	aaeedd0ff3	make it warn test=develop	6 years ago
Xin Pan	ddd2225b56	add more debug info. test=develop	6 years ago

... 2 3 4 5 6 ...

561 Commits (5b97b8556eb21698f621504c9f7031db1de58bb5)