Commit Graph

56 Commits (b1bd483a7ded7bb54294a4c6dbff76245f471640)

Author SHA1 Message Date
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
6 years ago
Tao Luo 68ec0a6f74
make parallel_executor support FLAGS_use_mkldnn (#17341)
6 years ago
Yiqun Liu 112f16143b
Add an option to enable the cache of expected kernel in train phase. (#16724)
6 years ago
liuwei1031 2e07c19a9c
disable memory_optimize and inpalce strategy by default, test=develop (#16760)
6 years ago
Yiqun Liu 3fe8cb0dd7
Enable the runtime_context_cache pass in train phase (#16640)
6 years ago
chengduo b75a69bad6
Add Stream for fetch op handle (#16600)
6 years ago
Qiao Longfei d8974e6da0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
chengduo 1096746cbf
Fuse Adam And SGD ops (#15933)
6 years ago
chengduo ed61d67c73
Fix the interface of Pass::Apply (#16484)
6 years ago
Qiao Longfei 30618409db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
Qiao Longfei 4e218dabc5 code format test=develop
6 years ago
liuwei1031 caadd0581d
add IfElse test case for ir memory optimize (#15998)
6 years ago
Qiao Longfei cf0511f21e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Xin Pan 26e32e095a allow compiler to use graph
6 years ago
Qiao Longfei cc71e89499 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
dzhwinter 04e9776aef add details. test=develop
6 years ago
Qiao Longfei 16af1dbc7b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
dzhwinter 0a63234c85 follow comments. test=develop
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
Qiao Longfei ada43e89c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
sneaxiy d8568acd19 turn on remove_unnecessary_lock
6 years ago
dzhwinter 8f3b252392 squash commits. test=develop
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
Qiao Longfei ea66979684 can run
6 years ago
chengduo eabb2105fa
Refactor MultiDevSSAGraphBuilder (#15090)
6 years ago
Yancey1989 ca8c77d966 selecte execution according to strategy test=develop
6 years ago
Yancey1989 845bfd5807 cleanup code
6 years ago
Yancey1989 41a64f6a2a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
chengduo 550e7e410b
Code Clean parallel_executor.py (#14849)
6 years ago
dzhwinter 7cd24b1318
add ir memory optimize. (#14530)
6 years ago
Yancey1989 fd144954ed redefine api test=develop
6 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
6 years ago
gongweibao 867c312bc4
Fix allreduce dependency order. (#14586)
6 years ago
peizhilin 7c8c9dc9bf fix unit test cases
6 years ago
Xin Pan 759ffca423 some improvements
6 years ago
Xin Pan 99dffb91d6 allow to repeatedly share and update BuildStrategy
6 years ago
sneaxiy faac8a76ce remove unnecessary codes
6 years ago
sneaxiy 203027ca86 test=develop
6 years ago
sneaxiy cef8cc81db merge develop
6 years ago
Wu Yi 26200f2e42
[1.1] [project] train imagenet using large batch size (#13766)
6 years ago
sneaxiy 2414f92f54 test=develop
6 years ago
sneaxiy a314a80cdb merge develop
6 years ago
sneaxiy d87569134c test=develop
6 years ago
Xin Pan f117feab0c modify comments
6 years ago
Xin Pan c2b3838cf5 add some comments
6 years ago
Xin Pan b43e49fa31 fix
6 years ago
Xin Pan 36c2a9af27 pass builder allow cutomize pass in python.
6 years ago
chengduo d402234ba8
Feature/op_fuse_pass (#12440)
7 years ago