Commit Graph

2421 Commits (4a1b7fec967837ca283210aa669da3513c9f1aa1)

Author SHA1 Message Date
dongdaxiang ea07eb8cd2 remove comment in data_feed.cc
6 years ago
dongdaxiang 05464e7c5c add gpu training for Executor.train_from_dataset
6 years ago
Qiao Longfei 0608f8ca56 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Zeng Jinle 9f7b027dce
fix activation grad op desc maker (#16715)
6 years ago
liuwei1031 fdb719a1bf
avoid optimize variable used in subblock, test=develop (#16739)
6 years ago
liuwei1031 a18ef10c87
only use the latest version variable for inplace strategy (#16736)
6 years ago
Tao Luo 5c364cda3c
Merge pull request #16711 from luotao1/has_attr
6 years ago
chengduo 55b15db5af
Add unit test for fuse all_reduce ops (#16699)
6 years ago
luotao1 4098ba29ed reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 f89a9c5d95 Merge branch 'develop' into has_attr
6 years ago
Tao Luo ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
6 years ago
luotao1 6afc97ca6b reduce hasAttr elapsed time in RunImpl
6 years ago
gongweibao 8b793d0efd
Fix DGC bug. (#16697)
6 years ago
Yiqun Liu 3fe8cb0dd7
Enable the runtime_context_cache pass in train phase (#16640)
6 years ago
xjqbest 6a57e8075a remove trainer_id in datafeed and dataset
6 years ago
luotao1 695f2db6a0 update expected_kernel_cache_pass
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
xjqbest 5e5139283b fix runtime error
6 years ago
xjqbest 271b7147cc fix dataset bug
6 years ago
Zeng Jinle 1c526e1d1a
Fix some grad op desc makers (#16633)
6 years ago
chengduo ea2a2f778a Fix the bug of AllReduceDepPass (#16393)
6 years ago
chengduo b75a69bad6
Add Stream for fetch op handle (#16600)
6 years ago
chengduo 1342e2ea04
Fix the bug of the fast threaded executor (#16514)
6 years ago
gongweibao 423bc515da
fix batch merge bug (#16601)
6 years ago
liuwei1031 bd193781df
fix the bug of reusing different types of variables in memory_optimiz… (#16547)
6 years ago
乔龙飞 Qiao Longfei 21622ca30b
Merge pull request #16172 from jacquesqiao/add-async-ssa-graph-executor-communicator
6 years ago
sneaxiy 10249c0b78 Merge develop
6 years ago
Qiao Longfei 9861a92f6f change the return type of NewTempScope to unique ptr test=develop
6 years ago
Qiao Longfei fb6cc3a1bd follow commnet, optimize code and add comment test=develop
6 years ago
Qiao Longfei adf272bcec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
guru4elephant 76b49f02ee
Merge pull request #16539 from guru4elephant/train_with_pipe_reader_merge_develop
6 years ago
Qiao Longfei baf02328b2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Qiao Longfei 9db1a9e128 change log level test=develop
6 years ago
gongweibao a61ed9782e
fix log level test=develop (#16554)
6 years ago
Qiao Longfei 8342f12e31 fix set remote_prefetch test=develop
6 years ago
Qiao Longfei df45c8c538 update nce and hierarchical_sigmoid remote_prefetch
6 years ago
Qiao Longfei a1821a0449 remote remote_prefetch in embedding layer test=develop
6 years ago
dongdaxiang 718ea6dbd5 fix fleet code style
6 years ago
xjqbest 782ab2e2bd add some doc
6 years ago
xjqbest a99c8d0c29 fix client to client communication bug
6 years ago
gongweibao fea91164b7 Fix windows compilation error! (#16546)
6 years ago
Zhaolong Xing 3e6aa498d6
Merge pull request #16526 from NHZlX/refine_trt_anakin
6 years ago
sneaxiy 33473890f3 Merge develop
6 years ago
dongdaxiang ade9337486 fix API.spec
6 years ago
liuwei1031 278debab71
fix comments of 16410, test=develop (#16499)
6 years ago
dongdaxiang 720647e17f rebase current develop and fix conflict
6 years ago
dongdaxiang 98dda08a85 fix pull sparse slow problem
6 years ago
dongdaxiang d739bab844 fix async_executor problem and remove some unnecessary testcase, fix trainer_desc import problem
6 years ago
dongdaxiang 241d8808be add timer to distributed executor
6 years ago
dongdaxiang 3c73859eec add trainer_desc.proto to distributed executor
6 years ago
dongdaxiang 60b7bf6fa6 add infer_from_dataset for inference
6 years ago
xjqbest 030c7e7e9d fix FillSparseValue error
6 years ago
dongdaxiang 88880d9b69 fix import trainer_desc_pb2 error
6 years ago
dongdaxiang 0030eb2a61 fix distributed building
6 years ago
dongdaxiang ed31874397 undefine rand_r()
6 years ago
dongdaxiang f7e4813804 add WIN32 for rand_r and usleep
6 years ago
dongdaxiang cedbc161da add more _LINUX maroc on data_feed.cc for mac and window compile
6 years ago
dongdaxiang c5980c3566 add _LINUX macro
6 years ago
dongdaxiang 433301fbc2 remove glog in shell.h
6 years ago
dongdaxiang 9e51ad4a65 fix io and fs compile on mac
6 years ago
dongdaxiang 6eca88ac76 fix io and fs compile on mac
6 years ago
dongdaxiang 2708108a08 fix fleet_wrapper compile on windows
6 years ago
dongdaxiang 4ce35815fb fix windows GLOG problem
6 years ago
dongdaxiang e3107a6ae0 fix windows compile problem
6 years ago
dongdaxiang 398004ece0 disable sys/wait.h to fix windows compile problem, include scope in lodtensor_printer
6 years ago
dongdaxiang d4514949bf remove local random engine in fleet with rand_r()
6 years ago
dongdaxiang 45eb6f0765 run pre-commit check files and fix code style problem
6 years ago
dongdaxiang d87ba58c14 refine document of python API, make device_worker and trainer's API private
6 years ago
dongdaxiang 5687f234bf fix trainer_desc.proto error
6 years ago
dongdaxiang b95b80bc76 add doc string for executor and update API.spec
6 years ago
dongdaxiang 6be9f719e2 make string_helper dependency work
6 years ago
xjqbest e95cafd9a7 fix code style & add dataset testcase
6 years ago
dongdaxiang ba15d6b164 move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
6 years ago
xjqbest be74de2c61 fix code style & fix register bug & add release_memory
6 years ago
dongdaxiang a0b59773af fix code style
6 years ago
dongdaxiang f39b323ed7 remove trainer_library in CMakeLists
6 years ago
dongdaxiang 365be5d559 support win32 flag in io.cc shell.cc, fix code style problem in fleet_wrapper, fix lodtensor_printer_test problem
6 years ago
dongdaxiang 6bf796df14 refine print fetch list
6 years ago
xjqbest 589467f24c fix bug
6 years ago
xjqbest b7940c2918 fix bug of gen_worker_desc and set_filelist, add some doc
6 years ago
dongdaxiang 68d7bf3de5 add fetch var function
6 years ago
xjqbest a34fe6248f add some doc
6 years ago
xujiaqi01 f5c6a14b54 fix runtime error
6 years ago
xujiaqi01 a5b1a0e12b support multi dataset && add init model && fix bug
6 years ago
dongdaxiang 3c65cc1bbd add document for role_maker and fleet parameter, data_generator
6 years ago
dongdaxiang f6c9232a3d fix dataset float32 type problem
6 years ago
dongdaxiang 73b1f396d7 add data_generator into paddle.fluid.incubate.data_generator, add op run log in hogwild_device_worker and downpour_device_worker
6 years ago
dongdaxiang 73544e8b8d add training speed log
6 years ago
dongdaxiang 9419de521f add IO percent for multi_trainer
6 years ago
dongdaxiang 6af697adb0 add trainfileswithprofiler for downpour worker
6 years ago
dongdaxiang 2644b88685 add comment for MPI Symetric role maker
6 years ago
dongdaxiang cf45c54340 add distributed optimizer factory
6 years ago
dongdaxiang b7a202aa38 add distributed optimizer factory
6 years ago
xujiaqi01 70a5d4f797 fix error
6 years ago
xujiaqi01 d25389fefd add some log && fix error
6 years ago
dongdaxiang 317eb0aad3 add incubate for unified API
6 years ago
xujiaqi01 39449ba0b9 fix bug && add DestroyReaders in trainer
6 years ago
dongdaxiang e657c127a8 hide opt_info in distirbuted optimizer
6 years ago
xujiaqi01 ecfc7df913 add dataset factory && fix style
6 years ago
dongdaxiang 328f11b8b6 refactor downpour optimization
6 years ago