Qiao Longfei
61912e879d
test_dist_base set runtime_split_send_recv to false test=develop
6 years ago
Qiao Longfei
d8974e6da0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
gongweibao
eb83abeac3
Add DGC(Deep Gradient Compression) interface. ( #15841 )
6 years ago
Qiao Longfei
30618409db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Wu Yi
8bebfe5640
add resnet nccl2 dist training, mp training unit test ( #16167 )
...
* add resnet nccl2 test=develop
* test dist train test=develop
* update test=develop
* increase timeout test=develop
* test on CI env test=develop
6 years ago
Wu Yi
6382b62f6b
Collective ops ( #15572 )
...
* wip allreduce in op
* wip
* wip
* wip
* wip adding test
* wip for conflict with mp mode
* fix tests test=develop
* fix cpu build test=develop
* fix travis clang format test=develop
* fix cpu build test=develop
* update api.spec test=develop
* delete comment test=develop
* fix cpplint test=develop
* fix test=develop
* follow comment test=develop
* add file test=develop
* fix build test=develop
* update test=develop
* to be compatible with sync_bn, and fix mp mode in develop test=develop
6 years ago
liuwei1031
caadd0581d
add IfElse test case for ir memory optimize ( #15998 )
...
* add ir memory optimize test case for IfElse op, test=develop
* fix some unitttest failure by force using the python memory_optimize, test=develop
* tweak comments, test=develop
* fix unittest, test=develop
* fix unittest, test=develop
6 years ago
Qiao Longfei
5cf0092825
add more log and fix test_dist_base in multi_batch_merge_pass
6 years ago
Qiao Longfei
4356f186b4
complete parameter_send
6 years ago
Zeng Jinle
dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
...
Turn on remove_unnecessary_lock by default
6 years ago
tangwei12
8b50ad80ff
checkpoint at distributed training ( #14854 )
...
checkpoint for distributed training.
6 years ago
sneaxiy
ef788603d4
merge develop
...
test=develop
6 years ago
WangZhen
bac08c4a26
Fix some bugs caused by set functions of the Pass class. test=develop
6 years ago
sneaxiy
d8568acd19
turn on remove_unnecessary_lock
...
test=develop
6 years ago
Xin Pan
7526ac14e3
add comments
...
test=develop
6 years ago
Xin Pan
beaae61a16
polish
...
test=develop
6 years ago
Xin Pan
5e928e579a
try unify Executor and ParallelExecutor
...
test=develop
6 years ago
Yancey1989
8cad371a60
fix nccl unittest acc test=develop
6 years ago
Yan Xu
5384206aec
Merge pull request #14869 from Yancey1989/fix_dist_unittest
...
fix dist unit test
6 years ago
Yancey1989
fa1f77e20c
enable ci test=develop
6 years ago
Wu Yi
f95ee9c09f
fix nccl dist test acc ( #14867 )
...
* fix nccl dist test acc test=develop
* fix test=develop
6 years ago
Wu Yi
554bcdbdfc
add more log for dist test for ci test=develop ( #14813 )
...
* add more log for dist test for ci test=develop
* increase deadline test=develop
6 years ago
Wu Yi
aebc175cd4
add nccl2 dist tests ( #14755 )
...
* add nccl2 dist tests test=develop
* fix dist_base test=develop
* fix tests test=develop
* fix test on mac test=develop
6 years ago
Wu Yi
e2011f1353
test dist ut fixes test=develop ( #14706 )
...
* test dist ut fixes test=develop
* fix cmake
* for test
6 years ago
Xin Pan
44ecf9a481
fix
...
test=develop
6 years ago
Xin Pan
9735e3016a
fix test
...
the build strategy is finalized after create_passes. So future
change of build strategy has no effects.
test=develop
6 years ago
Wu Yi
306236c2c0
feature/DC asgd ( #12722 )
...
* wip
* add ref_by_trainer_id op
* ready to test
* fix ref inputs
* refine rpc_op_handle
* fix merge bug
6 years ago
Wu Yi
d186e7434e
Refine dist ut ( #14118 )
...
* fix use_reader_alloc uts
* dist ut fixes test=develop
* update test=develop
* fix test for py3 test=develop
6 years ago
minqiyang
59420d5bd2
Polish code
...
test=develop
7 years ago
minqiyang
2cc939bbfa
Fix Mac Python3 CI job
...
test=develop
7 years ago
Wu Yi
26200f2e42
[1.1] [project] train imagenet using large batch size ( #13766 )
...
* fix nccl2 lars dist support
* put lars in momentum op
* add tests lars
* fix ci
* fix cpu kernel
* soft warning
* remove lars in test_recognize_digits.py
* move to another op
* add file
* update api.spec test=develop
* update test=develop
* fix api.spec test=develop
* wip
* wip, finish grad merge ops
* wip, finish graph build
* wip test running
* work on 1 gpu
* workable version
* update
* fix tests
* fuse broadcast op
* fix compile failed
* refine
* add batch merge test mnist
* fix CI test=develop
* fix build
* use independent bn params for batch merge test=develop
* update api.spec
* follow comments and for test
* wip
* refine tests test=develop
* follow comments test=develop
* remove startup bn modify test=develop
* follow comments test=develop
* fix merge test=develop
7 years ago
tangwei12
b35239df2b
fix dist ut with place, test=develop ( #13647 )
7 years ago
Wu Yi
7a5f3f750b
Fix memory optimization with dist train ( #13535 )
...
* show detail error log on ci
* test
* fix memopt and dist
* update apispec
* will fix different batch issue test=develop
7 years ago
tangwei12
97cf1eb6d7
Add distributed unit tests about text_classification/simnet-bow/ctr ( #12812 )
...
* add dist ut for text_classification
* add dist ut for text_classification
* add simnet bow unittest
* add dist ut for simnet bow
* add trainning data url for simnet bow
* add trainning data url for simnet bow
* modify simnet test_reader to train reader
* add test_dist_ctr
* test_dist_ctr can run now
* dense update is good
* add unit test for selected rows
* debug unit test
* fix dist sparse update problem
* Constant args at init
* optimize code
* simnet optimize
* fix DebugStringEx
* optimize sum_op.h
* add ScaleOpVarTypeInference
* clean code
* fix test_dist_transpiler.py
* code optimize
* modify delta
* fix sparse update bug
* dist test use one cpu
* update some data
* remove unused code
* add use cuda config
* unit test fix
* unit test fix
* unit test fix
* unit test fix
* dist_word2vec use CPU
* unit test fix
* unit test fix
* code clean
* code clean
* merge develop
* api spec update
* Revert: api spec update
* replace simnet data with fake
* replace simnet data with fake
* update dim
* add batch auc
* code clean
* code clean
* modify print to stderr
* update simnet delta -> 1e-5
* update RUN_STEP
* add use_reader_alloc
* add use_reader_alloc
* add use_reader_alloc
* modify delta
* add use_reader_alloc
* fix stderr write
* python3 compatibility
test=develop
* python3 compatibility, test=develop
* Update dist_text_classification.py
* test=develop
7 years ago
Wu Yi
437debf40e
Fix mac ci dist ( #13393 )
7 years ago
Yancey1989
a267155006
fix parallel run dist unit test
7 years ago
Wu Yi
0b8067c0dc
fix dist train reduce mode ( #13068 )
...
* fix dist train reduce mode
* fix previous fix
7 years ago
Wu Yi
a615ad46e4
Add test for dist and memopt ( #13049 )
...
* add test for dist and memopt
* update transformer too
7 years ago
Wu Yi
f63368db5e
Add async dist tests ( #12798 )
...
* add async dist tests
* update delta
* fix transformer test
* refine rmsprop transpile
* update
* fix dist seresnet
7 years ago
minqiyang
99d3f08920
Add print_function for all python files
7 years ago
minqiyang
77f12e000f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
gongweibao
842fb021b3
Fix clone() bug. ( #12583 )
7 years ago
minqiyang
e0d5f8a820
Move compat module to python/paddle
7 years ago
minqiyang
ae39709e59
Polish code
7 years ago
minqiyang
92aa20616d
Polish the code style
7 years ago
minqiyang
68b221401d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
typhoonzero
069ff14756
polish dist unitest
7 years ago
minqiyang
e4057d071b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
minqiyang
6abe819f07
Fix pybind11 problem
...
Fix str and bytes problem
Fix sorted problem
Fix math problem
Fix CI problem
7 years ago
Wu Yi
2d036c47cd
polish dist unit test code ( #12512 )
...
* polish dist se resnext ut
* update
* update
* update
* avoid cpu initializer differ
* change to use executor for now
* update by comment
* remove lr decay use para exe, should fix para exe bug later
* update by comment
7 years ago