Commit Graph

26 Commits (638bbb6153ccf47d23c74e58719a86be2ace5c13)

Author SHA1 Message Date
Thunderbrook 0cb60c700d
add heter ps mode (#25682)
5 years ago
lilong12 e39aa70ec7
add the support for pipeline (#24560)
5 years ago
hutuxian 0ec3a42e97
Random Dump (#24477)
6 years ago
xujiaqi01 3a45767d49
add fleet pslib pull and push sparse op and push dense op (#23139)
6 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
6 years ago
123malin 00594c1c88
support dumping params/grads in transpiler mode (#22490)
6 years ago
Wilber a90fa54092
Compile without nccl deps. [1/2] (#22509)
6 years ago
hutuxian c5aec2fe68
Paddlebox Related to Framework (#21586)
6 years ago
Thunderbrook 59bcdc8a19
support dump param of model into afs (#20302)
6 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug (#20366)
6 years ago
Thunderbrook f76a32df4a
dump fix dov vec file num (#20539)
6 years ago
tangwei12 c9139c3db3
trainer from dataset fetch targets (#19760)
6 years ago
yaoxuefeng 10ca3f9609
add thread scope stat accurate metrics test=develop (#19480)
6 years ago
Thunderbrook 1fe468d319
support debug each output of each ins (#19004)
6 years ago
jiaqi 3f8031e256
dataset (#17973)
7 years ago
hutuxian 969e6378b9
Pipeline Concurrency (#17402)
7 years ago
dongdaxiang 9e51ad4a65 fix io and fs compile on mac
7 years ago
dongdaxiang 4ce35815fb fix windows GLOG problem
7 years ago
dongdaxiang 6af697adb0 add trainfileswithprofiler for downpour worker
7 years ago
xujiaqi01 39449ba0b9 fix bug && add DestroyReaders in trainer
7 years ago
dongdaxiang ff87698a44 refactor downpour optimization
7 years ago
xjqbest dd67ad08a2 modify c++ and python dataset related code & fix bug
7 years ago
dongdaxiang 2486389793 add RunFromDataset in executor
7 years ago
xjqbest 824b84d185 add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
7 years ago
dongdaxiang c165012031 refine device_worker and trainer code
7 years ago
dongdaxiang 855bf579d2 add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
7 years ago