Commit Graph

785 Commits (8ebffc78c9f999759a35921c71b83226200d8561)

Author SHA1 Message Date
Zeng Jinle 4f8594088d
Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225)
6 years ago
Zeng Jinle ee2028a110
Add use_cuda to inplace pass (#17205)
6 years ago
chengduo 950aec55fd
It doesn't need sync when fetch_list nit not empty (#17201)
6 years ago
Zeng Jinle 4e1bc6e805
Rewrite inplace pass and fix gc bug (#17126)
6 years ago
chengduo 794a195881
fix fuse optimizer ops (#17102)
6 years ago
Zeng Jinle 842ded14b0
fix reference_count_pass,test=develop (#17060)
6 years ago
chengduo cc31681687
use fast executor as default (#17044)
6 years ago
chengduo a2be4b4d91
Add fuse momenutum ops (#16745)
6 years ago
Zeng Jinle 1202d3fc74
Refine model gpu memory (#16993)
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
乔龙飞 Qiao Longfei 82cff5ec42
Merge pull request #16762 from jacquesqiao/add-async_sparse_param_update_recorder
6 years ago
chengduo e9409665f7
Refine Fuse Optimize Ops (#16810)
6 years ago
chengduo d105c06b50
Replace ThreadedExecutor with FastThreadedExecutor (#16650)
6 years ago
Qiao Longfei afc56949c1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Yiqun Liu 112f16143b
Add an option to enable the cache of expected kernel in train phase. (#16724)
6 years ago
liuwei1031 2e07c19a9c
disable memory_optimize and inpalce strategy by default, test=develop (#16760)
6 years ago
Qiao Longfei 0608f8ca56 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Zeng Jinle 9f7b027dce
fix activation grad op desc maker (#16715)
6 years ago
liuwei1031 fdb719a1bf
avoid optimize variable used in subblock, test=develop (#16739)
6 years ago
liuwei1031 a18ef10c87
only use the latest version variable for inplace strategy (#16736)
6 years ago
chengduo 55b15db5af
Add unit test for fuse all_reduce ops (#16699)
6 years ago
gongweibao 8b793d0efd
Fix DGC bug. (#16697)
6 years ago
Yiqun Liu 3fe8cb0dd7
Enable the runtime_context_cache pass in train phase (#16640)
6 years ago
chengduo ea2a2f778a Fix the bug of AllReduceDepPass (#16393)
6 years ago
chengduo b75a69bad6
Add Stream for fetch op handle (#16600)
6 years ago
chengduo 1342e2ea04
Fix the bug of the fast threaded executor (#16514)
6 years ago
liuwei1031 bd193781df
fix the bug of reusing different types of variables in memory_optimiz… (#16547)
6 years ago
乔龙飞 Qiao Longfei 21622ca30b
Merge pull request #16172 from jacquesqiao/add-async-ssa-graph-executor-communicator
6 years ago
sneaxiy 10249c0b78 Merge develop
6 years ago
Qiao Longfei fb6cc3a1bd follow commnet, optimize code and add comment test=develop
6 years ago
Qiao Longfei baf02328b2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Qiao Longfei 9db1a9e128 change log level test=develop
6 years ago
gongweibao a61ed9782e
fix log level test=develop (#16554)
6 years ago
Qiao Longfei 8342f12e31 fix set remote_prefetch test=develop
6 years ago
Qiao Longfei df45c8c538 update nce and hierarchical_sigmoid remote_prefetch
6 years ago
Qiao Longfei a1821a0449 remote remote_prefetch in embedding layer test=develop
6 years ago
gongweibao fea91164b7 Fix windows compilation error! (#16546)
6 years ago
sneaxiy 33473890f3 Merge develop
6 years ago
liuwei1031 278debab71
fix comments of 16410, test=develop (#16499)
6 years ago
Qiao Longfei d8974e6da0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
chengduo 1096746cbf
Fuse Adam And SGD ops (#15933)
6 years ago
chengduo 2265d091e6
Fix threaded executor bug (#16508)
6 years ago
sneaxiy 2c836ff914 check default grad maker
6 years ago
chengduo ed61d67c73
Fix the interface of Pass::Apply (#16484)
6 years ago
Zeng Jinle 2aa18e2bda
Merge pull request #16496 from sneaxiy/fix_gc_bug
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
sneaxiy c4c6205268 fix gc bug
6 years ago
Zeng Jinle c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
6 years ago
Qiao Longfei 33be014535 fix distribute compile problem test=develop
6 years ago
Qiao Longfei b542639dc0 code clean test=develop
6 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
6 years ago
sneaxiy a0f4fefb60 delete source file no_need_buffer_vars_inference.cc
6 years ago
Qiao Longfei 392e97aae5 fix cpplint test=develop
6 years ago
Qiao Longfei 37f6b9ab7a fix build test=develop
6 years ago
Qiao Longfei 30618409db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Wu Yi 9ffd5eecef
test fix fetch bar place for ce (#16406)
6 years ago
gongweibao ec6519e806
Fix allreducedep bug (#16443)
6 years ago
sneaxiy 78fb3a62e0 fix env variable settting bug
6 years ago
sneaxiy 7000ec85d9 fix some op grad maker
6 years ago
sneaxiy f8ed2c229e try to fix ci error
6 years ago
Qiao Longfei be0c482304 update trainer_id
6 years ago
sneaxiy 072d95d8f6 Merge develop
6 years ago
sneaxiy a93a9eef8f add op registry type
6 years ago
chengduo a6a3b2fbbc
[Speed]Refine ParallelExecutor (#16190)
6 years ago
chengduo 33965527fd
Add unit test for fuse all reduce (#16354)
6 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
6 years ago
Wu Yi 6382b62f6b
Collective ops (#15572)
6 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
6 years ago
minqiyang b40e41fbd1 Polish code style
6 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
6 years ago
minqiyang 438bca9c3d Implement Runtime Var Type Inference
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
minqiyang ca392c7e97 Implement infer var type context
6 years ago
liuwei1031 1c6caf8466
1. disable reuse SELECTED_ROWS type variable (#16150)
6 years ago
Zhen Wang 5685a48c23 Add some fixme. test=develop
6 years ago
Zeng Jinle 472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
6 years ago
Qiao Longfei ff8054c5a7 can run
6 years ago
Qiao Longfei 3225e19591 fix remove recv op
6 years ago
Qiao Longfei fe6a840924 fix delete recv ops
6 years ago
Wu Yi d206582337
add parallel graph dist test (#16076)
6 years ago
Qiao Longfei 446fdf9563 fix compile problem
6 years ago
Qiao Longfei a23f1ee85a optimize code
6 years ago
Qiao Longfei a0bb18beec Merge branch 'add-async-ssa-graph-executor' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
liuwei1031 045e5911bf
fix a code bug which cause crash when empty variable is used, test=develop (#16080)
6 years ago
sneaxiy 7b608396fe fix travis-ci format check
6 years ago
Qiao Longfei 255b36dad2 can run
6 years ago
Qiao Longfei 5e8de51409 code format test=develop
6 years ago
Qiao Longfei 4e218dabc5 code format test=develop
6 years ago
sneaxiy 814a759061 merge develop
6 years ago
sneaxiy 597dc65e76 enhance gc
6 years ago
liuwei1031 caadd0581d
add IfElse test case for ir memory optimize (#15998)
6 years ago
Qiao Longfei f28c258453 code clean test=develop
6 years ago
Qiao Longfei 8c38aca954 tmp commit
6 years ago
Qiao Longfei fab1b54d99 Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Qiao Longfei e70b1727ef Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
sneaxiy a9ea99d700 merge develop
6 years ago
chengduo ae37f82964
Unified ParallelExecutor and Compiler (#15970)
6 years ago
Qiao Longfei 847e4f4e85 pure async mode train
6 years ago
Qiao Longfei 49f2f4f91d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
Qiao Longfei f768fbf715 support multi graph
6 years ago
Qiao Longfei ff01d70583 fix style
6 years ago
Qiao Longfei dab7f36909 optimize code test=develop
6 years ago
Qiao Longfei cf0511f21e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei 43c82376cb use one graph
6 years ago
dzhwinter 660e410655
Merge pull request #15855 from dzhwinter/fix/nightly_test
6 years ago
Xin Pan a6e3cd5eb7
Merge pull request #15425 from panyx0718/api
6 years ago
Qiao Longfei b8491bfd4e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
dzhwinter a71f2fbe4f fix default value. test=develop
6 years ago
乔龙飞 Qiao Longfei ec8e878200
Merge pull request #15840 from jacquesqiao/revert-15684-revert-15661-fix-cpu-broadcast
6 years ago
Qiao Longfei 2b7931d5c9 refine code test=develop
6 years ago
Xin Pan 0362ef75f4 fix
6 years ago
Xin Pan 12a0e2ed9d polish codes
6 years ago
Xin Pan 19d78f6797 polish
6 years ago
Qiao Longfei ecedd531c1 fix code bug test=develop
6 years ago
Qiao Longfei f4f4816b0c fix gpu error test=develop
6 years ago
Xin Pan 32d5a16036 resolve conflicts
6 years ago
Qiao Longfei 3f9263f67e optimize style test=develop
6 years ago
Qiao Longfei 4233d0a820 add more comment test=develop
6 years ago
Qiao Longfei 3bccc1e6e2 optimize broadcast logic test=develop
6 years ago
Xin Pan 26e32e095a allow compiler to use graph
6 years ago
Qiao Longfei 12f6b8c3d6 change the include of ThreadPool.h test=develop
6 years ago
Qiao Longfei 7f3be09045 fix multi graph test=develop
6 years ago
Qiao Longfei 31a05d3efd Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei 62f1248ff5 fix use gpu test=develop
6 years ago
Xin Pan 6019054cdd
Merge pull request #15716 from Yancey1989/refine_pg
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
Qiao Longfei cc71e89499 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
dzhwinter f2e8409f5a
Merge pull request #15795 from dzhwinter/fix/block_desc
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
Yancey1989 4b193db14c polish code test=develop
6 years ago
dzhwinter 6deb17ed8c fix default value. test=develop
6 years ago
dzhwinter 089d262c41 fix default value. test=develop
6 years ago
dzhwinter 9c92d0304f fix default value. test=develop
6 years ago
Yancey1989 d5090c892d polish code test=develop
6 years ago
Yancey1989 0f8bd73cc9 cleanup code test=develop
6 years ago
Yancey1989 5677c9d4ee update comment test=develop
6 years ago
Yancey1989 642fd68ce0 update by comment test=develop
6 years ago
dzhwinter d94a314db5 add reference. test=develop
6 years ago
dzhwinter 591ad33e32 polish code for reading. test=develop
6 years ago
dzhwinter 18afb77e78 polish code for reading. test=develop
6 years ago
dzhwinter 684b572307 polish code for reading. test=develop
6 years ago
dzhwinter 3787e61fca polish code for reading. test=develop
6 years ago
dzhwinter c1455e606d Merge remote-tracking branch 'origin/develop' into cherry-pick/memory
6 years ago
dzhwinter d376cf71b7 polish code for reading. test=develop
6 years ago
Yancey1989 7cd6de37f5 fix cpu test=develop
6 years ago
Yancey1989 bd0d44af24 fix build failed test=develop
6 years ago
Yancey1989 ecdd1166b8 cleanup code test=develop
6 years ago
Yancey1989 73005ee00d cleanup code test=develop
6 years ago
Yancey1989 88d3dc949e Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg
6 years ago
Yancey1989 f3463ecb6e refine pg execution
6 years ago