Commit Graph

438 Commits (22fa4c2d2440208870eb94de8694c05f2605cfe8)

Author SHA1 Message Date
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
7 years ago
chengduo 4140fe11a4
Open fuse optimization ops (#18741)
7 years ago
Zeng Jinle a802da650b
Feature/mem opt pass refactor (#18735)
7 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
7 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
7 years ago
Huihuang Zheng 89bc3fd841
Support memory eager deletion on recurrent OP (#17710)
7 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
7 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
7 years ago
gongweibao 160ddc980c
Regroup fusion by date type. (#18496)
7 years ago
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
7 years ago
Michał Gallus 7023a86c3a Fix Pooling output scale (#18186)
7 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
7 years ago
chengduo 14e1e165df
update alloc_continuous_space_for_grad_pass (#18287)
7 years ago
gongweibao f5caf3443c
Fix reinitialized ncclid error! (#18025)
7 years ago
gongweibao da9143c1cc
Polish codes of old prs. (#17938)
7 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
7 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
7 years ago
gongweibao 972c54cd70
Fix FLAGS_fuse_parameter_memory_size unit from Bytes to MBytes. (#17924)
7 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
7 years ago
Yiqun Liu 8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236)
7 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
7 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
7 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
7 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
7 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
7 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
7 years ago
Sylwester Fraczek 5b2a3c4b12 Conv concat relu quantization (#17466)
7 years ago
Sylwester Fraczek bccb0ba49a fix quantize_squash_pass segfault when no tensor linked to Bias (#17292)
7 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
7 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
7 years ago
liuwei1031 c3949f5699
remove two useless flags: enable_subgraph_optimize, memory_optimize_debug, test=develop (#17491)
7 years ago
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
7 years ago
Zeng Jinle 712bfb17cb
fix recurrent_op,test=develop (#17433)
7 years ago
Zhen Wang 4a1b7fec96
Add setting Scope function for the graph class (#17417)
7 years ago
chengduo 04bd413acb
Code Clean: Move all pass to paddle::framework::ir (#17228)
7 years ago
Zeng Jinle 4f8594088d
Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225)
7 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
7 years ago
tensor-tang 79ed1c76cd
fix bn fuse vardesc and add model saver (#17143)
7 years ago
Zeng Jinle 4e1bc6e805
Rewrite inplace pass and fix gc bug (#17126)
7 years ago
Yihua Xu 93cedfdb9c Fix the order while sorting the operators (#16756)
7 years ago
Yiqun Liu 112f16143b
Add an option to enable the cache of expected kernel in train phase. (#16724)
7 years ago
Tao Luo ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
7 years ago
Yiqun Liu 3fe8cb0dd7
Enable the runtime_context_cache pass in train phase (#16640)
7 years ago
luotao1 695f2db6a0 update expected_kernel_cache_pass
7 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
7 years ago
gongweibao 423bc515da
fix batch merge bug (#16601)
7 years ago
Qiao Longfei baf02328b2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
7 years ago
Qiao Longfei d8974e6da0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
7 years ago
nhzlx d065b5bf2b Anakin ssd support
7 years ago
chengduo ed61d67c73
Fix the interface of Pass::Apply (#16484)
7 years ago