Commit Graph

479 Commits (bb8d778358e97702d9cc232e322444fb4fc4d617)

Author SHA1 Message Date
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
6 years ago
Michał Gallus 7023a86c3a Fix Pooling output scale (#18186)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago
chengduo 14e1e165df
update alloc_continuous_space_for_grad_pass (#18287)
6 years ago
gongweibao f5caf3443c
Fix reinitialized ncclid error! (#18025)
6 years ago
gongweibao da9143c1cc
Polish codes of old prs. (#17938)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
6 years ago
gongweibao 972c54cd70
Fix FLAGS_fuse_parameter_memory_size unit from Bytes to MBytes. (#17924)
6 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
6 years ago
Yiqun Liu 8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
6 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago
Sylwester Fraczek 5b2a3c4b12 Conv concat relu quantization (#17466)
6 years ago
Sylwester Fraczek bccb0ba49a fix quantize_squash_pass segfault when no tensor linked to Bias (#17292)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
6 years ago
liuwei1031 c3949f5699
remove two useless flags: enable_subgraph_optimize, memory_optimize_debug, test=develop (#17491)
6 years ago
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
6 years ago
Zeng Jinle 712bfb17cb
fix recurrent_op,test=develop (#17433)
6 years ago
Zhen Wang 4a1b7fec96
Add setting Scope function for the graph class (#17417)
6 years ago
chengduo 04bd413acb
Code Clean: Move all pass to paddle::framework::ir (#17228)
6 years ago
Zeng Jinle 4f8594088d
Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
tensor-tang 79ed1c76cd
fix bn fuse vardesc and add model saver (#17143)
6 years ago
Zeng Jinle 4e1bc6e805
Rewrite inplace pass and fix gc bug (#17126)
6 years ago
Yihua Xu 93cedfdb9c Fix the order while sorting the operators (#16756)
6 years ago
Yiqun Liu 112f16143b
Add an option to enable the cache of expected kernel in train phase. (#16724)
6 years ago
Tao Luo ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
6 years ago
Yiqun Liu 3fe8cb0dd7
Enable the runtime_context_cache pass in train phase (#16640)
6 years ago
luotao1 695f2db6a0 update expected_kernel_cache_pass
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
gongweibao 423bc515da
fix batch merge bug (#16601)
6 years ago
Qiao Longfei baf02328b2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Qiao Longfei d8974e6da0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
nhzlx d065b5bf2b Anakin ssd support
6 years ago
chengduo ed61d67c73
Fix the interface of Pass::Apply (#16484)
6 years ago
Qiao Longfei 392e97aae5 fix cpplint test=develop
6 years ago
Qiao Longfei 30618409db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
nhzlx 953bdde058 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
Wojciech Uss 46677fb080 Move cpu_quantize_* passes into mkldnn subfolder
6 years ago
nhzlx 3df7b98a0f Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
7 years ago
luotao1 056599a738 add expected_kernel_cache_pass
7 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
7 years ago
Tao Luo 9a05859179
Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass
7 years ago
nhzlx c407dfa3cb cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
7 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
7 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
7 years ago
nhzlx a1d200a5de cherry-pick from feature/anakin-engine: Anakin support facebox #16111
7 years ago
Wojciech Uss 104a9f1e27 fix pattern maching conv2d with(out) ResidualData
7 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
7 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
7 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
7 years ago
Wojciech Uss af03008890 Add cpu_quantize_placement_pass for C-API quantization (#16265)
7 years ago
minqiyang b40e41fbd1 Polish code style
7 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
7 years ago
luotao1 cc0ae1f1a1 refine with comments
7 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
7 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
7 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
7 years ago
luotao1 1b59bed989 Merge branch 'develop' into runtime_context
7 years ago
luotao1 6ce25c99a0 Merge branch 'develop' into runtime_context
7 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
7 years ago
minqiyang ca392c7e97 Implement infer var type context
7 years ago
Wojciech Uss b9252f3df8 Add cpu_quantize_squash_pass for C-API quantization (#16128)
7 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
7 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
7 years ago
Zhen Wang 5685a48c23 Add some fixme. test=develop
7 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
7 years ago
Zhen Wang 01eddf125c Not add graph copy construction method. test=develop
7 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
7 years ago
Yihua Xu 40f1dd818b Fix the node's order issue when the content of graph is changed (#16088)
7 years ago
Qiao Longfei fab1b54d99 Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
7 years ago
nhzlx 2eff3e26b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt
7 years ago
nhzlx 06a088a199 fix comments and fix cpplint
7 years ago
Krzysztof Binias 72253391b6 Add MKL-DNN placement pass tester
7 years ago
Qiao Longfei 49f2f4f91d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
7 years ago
Michal Gallus 6a2bc9a275 Add Conv Residual Connection UT for Projection
7 years ago
Xin Pan a6e3cd5eb7
Merge pull request #15425 from panyx0718/api
7 years ago
Qiao Longfei b8491bfd4e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
7 years ago
Xin Pan 0362ef75f4 fix
7 years ago
Xin Pan 12a0e2ed9d polish codes
7 years ago
Xin Pan 19d78f6797 polish
7 years ago
Xin Pan 32d5a16036 resolve conflicts
7 years ago
Michał Gallus c4faf36e7a MKL-DNN: Add test for conv bias fuse pass (#15824)
7 years ago
Xin Pan 26e32e095a allow compiler to use graph
7 years ago
Sylwester Fraczek 0b926114c0 add override to ApplyImpl
7 years ago
Xin Pan 6019054cdd
Merge pull request #15716 from Yancey1989/refine_pg
7 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
7 years ago
Yancey1989 4b193db14c polish code test=develop
7 years ago
Yancey1989 642fd68ce0 update by comment test=develop
7 years ago
Yan Chunwei 077d12b939
fix scale cleaner (#15742)
7 years ago
nhzlx ecc12fb430 3. when runing in trt mode, do not allocate memory for parameters in fluid.
7 years ago
Yancey1989 ecdd1166b8 cleanup code test=develop
7 years ago
Yancey1989 73005ee00d cleanup code test=develop
7 years ago
Yancey1989 88d3dc949e Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg
7 years ago