Commit Graph

2488 Commits (ab57d3893ea2cfe8b002ed4a82e88a0d40b2f1e8)

Author SHA1 Message Date
gongweibao a61ed9782e
fix log level test=develop (#16554)
6 years ago
Qiao Longfei 8342f12e31 fix set remote_prefetch test=develop
6 years ago
Qiao Longfei df45c8c538 update nce and hierarchical_sigmoid remote_prefetch
6 years ago
Qiao Longfei a1821a0449 remote remote_prefetch in embedding layer test=develop
6 years ago
dongdaxiang 718ea6dbd5 fix fleet code style
6 years ago
xjqbest 782ab2e2bd add some doc
6 years ago
xjqbest a99c8d0c29 fix client to client communication bug
6 years ago
gongweibao fea91164b7 Fix windows compilation error! (#16546)
6 years ago
Zhaolong Xing 3e6aa498d6
Merge pull request #16526 from NHZlX/refine_trt_anakin
6 years ago
sneaxiy 33473890f3 Merge develop
6 years ago
dongdaxiang ade9337486 fix API.spec
6 years ago
liuwei1031 278debab71
fix comments of 16410, test=develop (#16499)
6 years ago
dongdaxiang 720647e17f rebase current develop and fix conflict
6 years ago
dongdaxiang 98dda08a85 fix pull sparse slow problem
6 years ago
dongdaxiang d739bab844 fix async_executor problem and remove some unnecessary testcase, fix trainer_desc import problem
6 years ago
dongdaxiang 241d8808be add timer to distributed executor
6 years ago
dongdaxiang 3c73859eec add trainer_desc.proto to distributed executor
6 years ago
dongdaxiang 60b7bf6fa6 add infer_from_dataset for inference
6 years ago
xjqbest 030c7e7e9d fix FillSparseValue error
6 years ago
dongdaxiang 88880d9b69 fix import trainer_desc_pb2 error
6 years ago
dongdaxiang 0030eb2a61 fix distributed building
6 years ago
dongdaxiang ed31874397 undefine rand_r()
6 years ago
dongdaxiang f7e4813804 add WIN32 for rand_r and usleep
6 years ago
dongdaxiang cedbc161da add more _LINUX maroc on data_feed.cc for mac and window compile
6 years ago
dongdaxiang c5980c3566 add _LINUX macro
6 years ago
dongdaxiang 433301fbc2 remove glog in shell.h
6 years ago
dongdaxiang 9e51ad4a65 fix io and fs compile on mac
6 years ago
dongdaxiang 6eca88ac76 fix io and fs compile on mac
6 years ago
dongdaxiang 2708108a08 fix fleet_wrapper compile on windows
6 years ago
dongdaxiang 4ce35815fb fix windows GLOG problem
6 years ago
dongdaxiang e3107a6ae0 fix windows compile problem
6 years ago
dongdaxiang 398004ece0 disable sys/wait.h to fix windows compile problem, include scope in lodtensor_printer
6 years ago
dongdaxiang d4514949bf remove local random engine in fleet with rand_r()
6 years ago
dongdaxiang 45eb6f0765 run pre-commit check files and fix code style problem
6 years ago
dongdaxiang d87ba58c14 refine document of python API, make device_worker and trainer's API private
6 years ago
dongdaxiang 5687f234bf fix trainer_desc.proto error
6 years ago
dongdaxiang b95b80bc76 add doc string for executor and update API.spec
6 years ago
dongdaxiang 6be9f719e2 make string_helper dependency work
6 years ago
xjqbest e95cafd9a7 fix code style & add dataset testcase
6 years ago
dongdaxiang ba15d6b164 move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
6 years ago
xjqbest be74de2c61 fix code style & fix register bug & add release_memory
6 years ago
dongdaxiang a0b59773af fix code style
6 years ago
dongdaxiang f39b323ed7 remove trainer_library in CMakeLists
6 years ago
dongdaxiang 365be5d559 support win32 flag in io.cc shell.cc, fix code style problem in fleet_wrapper, fix lodtensor_printer_test problem
6 years ago
dongdaxiang 6bf796df14 refine print fetch list
6 years ago
xjqbest 589467f24c fix bug
6 years ago
xjqbest b7940c2918 fix bug of gen_worker_desc and set_filelist, add some doc
6 years ago
dongdaxiang 68d7bf3de5 add fetch var function
6 years ago
xjqbest a34fe6248f add some doc
6 years ago
xujiaqi01 f5c6a14b54 fix runtime error
6 years ago
xujiaqi01 a5b1a0e12b support multi dataset && add init model && fix bug
6 years ago
dongdaxiang 3c65cc1bbd add document for role_maker and fleet parameter, data_generator
6 years ago
dongdaxiang f6c9232a3d fix dataset float32 type problem
6 years ago
dongdaxiang 73b1f396d7 add data_generator into paddle.fluid.incubate.data_generator, add op run log in hogwild_device_worker and downpour_device_worker
6 years ago
dongdaxiang 73544e8b8d add training speed log
6 years ago
dongdaxiang 9419de521f add IO percent for multi_trainer
6 years ago
dongdaxiang 6af697adb0 add trainfileswithprofiler for downpour worker
6 years ago
dongdaxiang 2644b88685 add comment for MPI Symetric role maker
6 years ago
dongdaxiang cf45c54340 add distributed optimizer factory
6 years ago
dongdaxiang b7a202aa38 add distributed optimizer factory
6 years ago
xujiaqi01 70a5d4f797 fix error
6 years ago
xujiaqi01 d25389fefd add some log && fix error
6 years ago
dongdaxiang 317eb0aad3 add incubate for unified API
6 years ago
xujiaqi01 39449ba0b9 fix bug && add DestroyReaders in trainer
6 years ago
dongdaxiang e657c127a8 hide opt_info in distirbuted optimizer
6 years ago
xujiaqi01 ecfc7df913 add dataset factory && fix style
6 years ago
dongdaxiang 328f11b8b6 refactor downpour optimization
6 years ago
xujiaqi01 3cea00bd52 store memory data in Dataset && fix bug
6 years ago
dongdaxiang ff87698a44 refactor downpour optimization
6 years ago
dongdaxiang b66f0074b6 fix data reading bugs in api, add VLOG(3) log for setup
6 years ago
dongdaxiang b415ec27e8 make Dataset* as an argument
6 years ago
xjqbest dd67ad08a2 modify c++ and python dataset related code & fix bug
6 years ago
dongdaxiang cc4def6ba5 fix some conflict for compilation
6 years ago
heqiaozhi 9bca1926c1 refactor & fix bug
6 years ago
xjqbest 2e9a836c6f add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang 2486389793 add RunFromDataset in executor
6 years ago
dongdaxiang e36bbcc871 fix some typo and CMakefile.txt
6 years ago
xjqbest 824b84d185 add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang 08c25995a2 add run from dataset in executor.
6 years ago
dongdaxiang c28bbdf8ba add dataset_generator.py
6 years ago
dongdaxiang be757096da add pybind for fleet
6 years ago
dongdaxiang 687cb79dbb add pipe command io interface
6 years ago
dongdaxiang 1fe54416c9 move fs.cc and shell.cc into paddle/fluid/framework/io
6 years ago
dongdaxiang 53fbab5d33 add fs_local_open example
6 years ago
dongdaxiang afaf937010 add fs_local_open example
6 years ago
dongdaxiang cf1360643f add printer for fetch variable
6 years ago
dongdaxiang d65cb13ad5 add pslib flag on fleet_wrapper CMakefile
6 years ago
dongdaxiang 6de9ebc65c refine VLOG in fleet_wrapper.h
6 years ago
dongdaxiang 97d5cd30f0 make pull dense worker work
6 years ago
dongdaxiang 39014b9f9f fix class register problem
6 years ago
dongdaxiang f0dd1201cc fix destructor problem
6 years ago
dongdaxiang f2bde9c241 fix destructor problem
6 years ago
dongdaxiang 54f047a126 fix ngraph compile option
6 years ago
dongdaxiang dd1dc9bcf0 add common.h.in back
6 years ago
dongdaxiang 378037c535 make s_instance_ private to ensure singleton
6 years ago
dongdaxiang a446d26e8a add todo for asynce executor
6 years ago
dongdaxiang c165012031 refine device_worker and trainer code
6 years ago
dongdaxiang 8a335b50be add downpour device_worker pb configuration
6 years ago
dongdaxiang 24a8001142 make -DWITH_PSLIB=ON compilable
6 years ago
dongdaxiang 67b1d6d721 add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
6 years ago
dongdaxiang 855bf579d2 add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
6 years ago
Qiao Longfei d8974e6da0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
chengduo 1096746cbf
Fuse Adam And SGD ops (#15933)
6 years ago
Jacek Czaja 2632327429 [MKL-DNN] Tensor modifications revert (#16462)
6 years ago
chengduo 2265d091e6
Fix threaded executor bug (#16508)
6 years ago
sneaxiy 2c836ff914 check default grad maker
6 years ago
nhzlx d065b5bf2b Anakin ssd support
6 years ago
Zeng Jinle 69cb9792ea
Merge pull request #16506 from sneaxiy/revert-16424-fix_allocator_bug
6 years ago
chengduo ed61d67c73
Fix the interface of Pass::Apply (#16484)
6 years ago
Zeng Jinle 2aa18e2bda
Merge pull request #16496 from sneaxiy/fix_gc_bug
6 years ago
Zeng Jinle 174d0d0b90 Revert "Fix allocator bug"
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
Zeng Jinle 644e8af4cf
Merge pull request #16424 from sneaxiy/fix_allocator_bug
6 years ago
sneaxiy c4c6205268 fix gc bug
6 years ago
Zeng Jinle c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
6 years ago
Qiao Longfei 33be014535 fix distribute compile problem test=develop
6 years ago
Qiao Longfei b542639dc0 code clean test=develop
6 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
6 years ago
Zhaolong Xing fa1796a30a
Merge pull request #16330 from NHZlX/merge_anakin_branch_to_dev
6 years ago
sneaxiy a0f4fefb60 delete source file no_need_buffer_vars_inference.cc
6 years ago
Qiao Longfei 392e97aae5 fix cpplint test=develop
6 years ago
Qiao Longfei 37f6b9ab7a fix build test=develop
6 years ago
Qiao Longfei 30618409db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Wu Yi 9ffd5eecef
test fix fetch bar place for ce (#16406)
6 years ago
nhzlx 953bdde058 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
Tao Luo e0a3a49096
Merge pull request #16438 from wojtuss/wojtuss/move-cpu-quantize-passes
6 years ago
gongweibao ec6519e806
Fix allreducedep bug (#16443)
6 years ago
sneaxiy 78fb3a62e0 fix env variable settting bug
6 years ago
sneaxiy 2d92b6be98 merge develop
6 years ago
sneaxiy 7000ec85d9 fix some op grad maker
6 years ago
sneaxiy f8ed2c229e try to fix ci error
6 years ago
Wojciech Uss 46677fb080 Move cpu_quantize_* passes into mkldnn subfolder
6 years ago
sneaxiy c20db6357b split PR
6 years ago
Qiao Longfei be0c482304 update trainer_id
6 years ago
sneaxiy 072d95d8f6 Merge develop
6 years ago
sneaxiy a93a9eef8f add op registry type
6 years ago
chengduo a6a3b2fbbc
[Speed]Refine ParallelExecutor (#16190)
6 years ago
nhzlx 3df7b98a0f Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
chengduo 33965527fd
Add unit test for fuse all reduce (#16354)
6 years ago
sneaxiy 953214ad97 add more unittest
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
6 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
6 years ago
Tao Luo 9a05859179
Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass
6 years ago
nhzlx c407dfa3cb cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
nhzlx a1d200a5de cherry-pick from feature/anakin-engine: Anakin support facebox #16111
6 years ago
luotao1 bfdab00e5b Merge branch 'develop' into core_opt_choose_kernel
6 years ago
Tao Luo a5124ee0bb
Merge pull request #16301 from luotao1/runtime_context_pass
6 years ago
luotao1 6c6a39222b Merge branch 'core_opt_choose_kernel' of https://github.com/Xreki/Paddle into core_opt_choose_kernel
6 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
6 years ago
Zeng Jinle d0ef682552
Merge pull request #16274 from sneaxiy/fix_grad_maker
6 years ago
Wojciech Uss 104a9f1e27 fix pattern maching conv2d with(out) ResidualData
6 years ago
Wu Yi 6382b62f6b
Collective ops (#15572)
6 years ago
sneaxiy 023a3a3d62 fix op grad maker
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
sneaxiy fd23262e0c merge develop, fix conflict
6 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
6 years ago
Jacek Czaja 13816dd4ac [MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)
6 years ago
Wojciech Uss af03008890 Add cpu_quantize_placement_pass for C-API quantization (#16265)
6 years ago
Tao Luo dbb92ee4b1
Merge pull request #16002 from luotao1/runtime_context
6 years ago
minqiyang b40e41fbd1 Polish code style
6 years ago
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
6 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
6 years ago
luotao1 cc0ae1f1a1 refine with comments
6 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
6 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
6 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
6 years ago
minqiyang 362253732c Polish code
6 years ago
minqiyang c0ddb93ccc Polish code
6 years ago
minqiyang b5078c211a Make infer var type virtual
6 years ago
minqiyang 438bca9c3d Implement Runtime Var Type Inference
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 1b59bed989 Merge branch 'develop' into runtime_context
6 years ago
luotao1 6ce25c99a0 Merge branch 'develop' into runtime_context
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
minqiyang ca392c7e97 Implement infer var type context
6 years ago
liuwei1031 1c6caf8466
1. disable reuse SELECTED_ROWS type variable (#16150)
6 years ago
Wojciech Uss b9252f3df8 Add cpu_quantize_squash_pass for C-API quantization (#16128)
6 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
6 years ago
sneaxiy a7a4f053da Merge develop
6 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
6 years ago
minqiyang 42e96a029f Accelerate CPU part
6 years ago
sneaxiy 682f2dbf29 merge develop
6 years ago
sneaxiy 2c4fcaa683 merge develop
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
6 years ago
baojun e3c37bd564 remove const_cast and refactor ngraph engine code (#15925)
6 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
6 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
Zhen Wang 94b7c1ea7b
Merge pull request #16107 from wzzju/add_graph_clone
6 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
6 years ago
Zhen Wang 5685a48c23 Add some fixme. test=develop
6 years ago
luotao1 8f6597aa0e Merge branch 'develop' into infershape_example
6 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang 01eddf125c Not add graph copy construction method. test=develop
6 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
6 years ago
Zeng Jinle 472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
6 years ago