Commit Graph

70 Commits (59357f4fb94d4589b9b51a33d1f3febb00653779)

Author SHA1 Message Date
gongweibao 8c9119afcd add logs and fix a bug (#5074)
8 years ago
Helin Wang 60238a1bfb Go master, pserver, trainer: switch to log15, away from logrus
8 years ago
Helin Wang 05176bd1bb master server will wait etcd forever
8 years ago
Helin Wang 5270585e10 fix according to comment
8 years ago
Helin Wang da7a1f2f6c master client: retry connecting to etcd
8 years ago
Helin Wang f64539bef9 use random port for embed etcd to avoid port collision
8 years ago
Helin Wang b8461c79fc implement init parameters selection with etcd
8 years ago
Helin Wang 01a62511b4 add curPass into log, remove JobTasks
8 years ago
Helin Wang 10794cf4de Master persist more states to etcd, schedule pending timeout after load pending state.
8 years ago
Yancey 53ea896996 Add master server unit test (#3086)
8 years ago
Helin Wang 54eac40f64 fix according to comments
8 years ago
Helin Wang 42fe3e88c7 gracefully shutdown pserver, fix gometalinter errors
8 years ago
Helin Wang cb5c7526e5 shutdown master server gracefully
8 years ago
武毅 c10121e13c [Done] Sync master client between passes and fix recordio split (#2948)
8 years ago
Helin Wang c67d8276b7 fix according to comments
8 years ago
Helin Wang 3ff0a9fbb1 Implement distributed training save model, improve master.NewClient interface
8 years ago
dongzhihong e1e7309789 boring copyright
8 years ago
Helin Wang 25e57949cc add more linters, fix errors found by them.
8 years ago
Helin Wang 2b1cac4113 Handle all unchecked errors
8 years ago
武毅 23b8346072 Fault tolerant distributed training, just work version, with etcd (#2849)
8 years ago
Helin Wang e4be077ffa Add go testing into cmake and fix libpaddle_go_optimizer.a link path
8 years ago
gongweibao d05d19ba03 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into taskfail
8 years ago
gongweibao b64c7a635d fix by helin's comments
8 years ago
gongweibao a40a7a5cb1 fix by helin's comments
8 years ago
gongweibao 8f7088590c fix bugs
8 years ago
gongweibao a94d217487 add TaskID
8 years ago
gongweibao 7663a40c88 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into taskfail
8 years ago
gongweibao 108b0fad2f fix by helin and wuyi's comments
8 years ago
gongweibao 24dc0d1c7c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cloudandlocal
8 years ago
gongweibao 3f5e5a24c4 fix cmake error
8 years ago
Qiao Longfei 9045063b53 pserver etcd client (#2559)
8 years ago
gongweibao 421d9f12a3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cloudandlocal
8 years ago
gongweibao 52cc601b48 fix bugs
8 years ago
gongweibao e25c155f39 add taskfail interface
8 years ago
gongweibao 26e661bc51 fix by helin's comments
8 years ago
gongweibao af5ac2c474 merge with upstream develop
8 years ago
yi.wu 9c853c269d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cmake_go_vendor
8 years ago
helinwang fae606fc96 Merge pull request #2659 from helinwang/cmake
8 years ago
gongweibao 97bbd17956 rm cloud EOF
8 years ago
gongweibao b3c5808e13 rm cloud EOF
8 years ago
Helin Wang 59cf5e7796 Fix Go cmake
8 years ago
gongweibao 0fa409246b fix bugs
8 years ago
Yancey 9af8d86b7c Trainer library discover master by etcd (#2551)
8 years ago
gongweibao 4874810ba5 fix bugs
8 years ago
gongweibao 183a5d44ee Merge branch 'cloudandlocal' of https://github.com/gongweibao/Paddle into cloudandlocal
8 years ago
gongweibao fc3d031425 first add
8 years ago
gongweibao 3919b75884 modify cmake
8 years ago
Helin Wang 4cc9680cc6 Make pserver able to get server index without etcd (decouple pserver with etcd)
8 years ago
Helin Wang 7dad02661f Master server registers itself to etcd.
8 years ago
Helin Wang 42313a3c35 rename EtcdStore to Etcd
8 years ago