Commit Graph

2444 Commits (b4b169467ba27d04e6d4bd5c6bc3f7abbba04c65)

Author SHA1 Message Date
dongdaxiang 6bf796df14 refine print fetch list
6 years ago
xjqbest 589467f24c fix bug
6 years ago
xjqbest b7940c2918 fix bug of gen_worker_desc and set_filelist, add some doc
6 years ago
dongdaxiang 68d7bf3de5 add fetch var function
6 years ago
xjqbest a34fe6248f add some doc
6 years ago
xujiaqi01 f5c6a14b54 fix runtime error
6 years ago
xujiaqi01 a5b1a0e12b support multi dataset && add init model && fix bug
6 years ago
dongdaxiang 3c65cc1bbd add document for role_maker and fleet parameter, data_generator
6 years ago
dongdaxiang f6c9232a3d fix dataset float32 type problem
6 years ago
dongdaxiang 73b1f396d7 add data_generator into paddle.fluid.incubate.data_generator, add op run log in hogwild_device_worker and downpour_device_worker
6 years ago
dongdaxiang 73544e8b8d add training speed log
6 years ago
dongdaxiang 9419de521f add IO percent for multi_trainer
6 years ago
dongdaxiang 6af697adb0 add trainfileswithprofiler for downpour worker
6 years ago
dongdaxiang 2644b88685 add comment for MPI Symetric role maker
6 years ago
dongdaxiang cf45c54340 add distributed optimizer factory
6 years ago
dongdaxiang b7a202aa38 add distributed optimizer factory
6 years ago
xujiaqi01 70a5d4f797 fix error
6 years ago
xujiaqi01 d25389fefd add some log && fix error
6 years ago
dongdaxiang 317eb0aad3 add incubate for unified API
6 years ago
xujiaqi01 39449ba0b9 fix bug && add DestroyReaders in trainer
6 years ago
dongdaxiang e657c127a8 hide opt_info in distirbuted optimizer
6 years ago
xujiaqi01 ecfc7df913 add dataset factory && fix style
6 years ago
dongdaxiang 328f11b8b6 refactor downpour optimization
6 years ago
xujiaqi01 3cea00bd52 store memory data in Dataset && fix bug
6 years ago
dongdaxiang ff87698a44 refactor downpour optimization
6 years ago
dongdaxiang b66f0074b6 fix data reading bugs in api, add VLOG(3) log for setup
6 years ago
dongdaxiang b415ec27e8 make Dataset* as an argument
6 years ago
xjqbest dd67ad08a2 modify c++ and python dataset related code & fix bug
6 years ago
dongdaxiang cc4def6ba5 fix some conflict for compilation
6 years ago
heqiaozhi 9bca1926c1 refactor & fix bug
6 years ago
xjqbest 2e9a836c6f add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang 2486389793 add RunFromDataset in executor
6 years ago
dongdaxiang e36bbcc871 fix some typo and CMakefile.txt
6 years ago
xjqbest 824b84d185 add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang 08c25995a2 add run from dataset in executor.
6 years ago
dongdaxiang c28bbdf8ba add dataset_generator.py
6 years ago
dongdaxiang be757096da add pybind for fleet
6 years ago
dongdaxiang 687cb79dbb add pipe command io interface
6 years ago
dongdaxiang 1fe54416c9 move fs.cc and shell.cc into paddle/fluid/framework/io
6 years ago
dongdaxiang 53fbab5d33 add fs_local_open example
6 years ago
dongdaxiang afaf937010 add fs_local_open example
6 years ago
dongdaxiang cf1360643f add printer for fetch variable
6 years ago
dongdaxiang d65cb13ad5 add pslib flag on fleet_wrapper CMakefile
6 years ago
dongdaxiang 6de9ebc65c refine VLOG in fleet_wrapper.h
6 years ago
dongdaxiang 97d5cd30f0 make pull dense worker work
6 years ago
dongdaxiang 39014b9f9f fix class register problem
6 years ago
dongdaxiang f0dd1201cc fix destructor problem
6 years ago
dongdaxiang f2bde9c241 fix destructor problem
6 years ago
dongdaxiang 54f047a126 fix ngraph compile option
6 years ago
dongdaxiang dd1dc9bcf0 add common.h.in back
6 years ago
dongdaxiang 378037c535 make s_instance_ private to ensure singleton
6 years ago
dongdaxiang a446d26e8a add todo for asynce executor
6 years ago
dongdaxiang c165012031 refine device_worker and trainer code
6 years ago
dongdaxiang 8a335b50be add downpour device_worker pb configuration
6 years ago
dongdaxiang 24a8001142 make -DWITH_PSLIB=ON compilable
6 years ago
dongdaxiang 67b1d6d721 add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
6 years ago
dongdaxiang 855bf579d2 add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
6 years ago
Qiao Longfei d8974e6da0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
chengduo 1096746cbf
Fuse Adam And SGD ops (#15933)
6 years ago
Jacek Czaja 2632327429 [MKL-DNN] Tensor modifications revert (#16462)
6 years ago
chengduo 2265d091e6
Fix threaded executor bug (#16508)
6 years ago
sneaxiy 2c836ff914 check default grad maker
6 years ago
nhzlx d065b5bf2b Anakin ssd support
6 years ago
Zeng Jinle 69cb9792ea
Merge pull request #16506 from sneaxiy/revert-16424-fix_allocator_bug
6 years ago
chengduo ed61d67c73
Fix the interface of Pass::Apply (#16484)
6 years ago
Zeng Jinle 2aa18e2bda
Merge pull request #16496 from sneaxiy/fix_gc_bug
6 years ago
Zeng Jinle 174d0d0b90 Revert "Fix allocator bug"
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
Zeng Jinle 644e8af4cf
Merge pull request #16424 from sneaxiy/fix_allocator_bug
6 years ago
sneaxiy c4c6205268 fix gc bug
6 years ago
Zeng Jinle c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
6 years ago
Qiao Longfei 33be014535 fix distribute compile problem test=develop
6 years ago
Qiao Longfei b542639dc0 code clean test=develop
6 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
6 years ago
Zhaolong Xing fa1796a30a
Merge pull request #16330 from NHZlX/merge_anakin_branch_to_dev
6 years ago
sneaxiy a0f4fefb60 delete source file no_need_buffer_vars_inference.cc
6 years ago
Qiao Longfei 392e97aae5 fix cpplint test=develop
6 years ago
Qiao Longfei 37f6b9ab7a fix build test=develop
6 years ago
Qiao Longfei 30618409db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Wu Yi 9ffd5eecef
test fix fetch bar place for ce (#16406)
6 years ago
nhzlx 953bdde058 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
Tao Luo e0a3a49096
Merge pull request #16438 from wojtuss/wojtuss/move-cpu-quantize-passes
6 years ago
gongweibao ec6519e806
Fix allreducedep bug (#16443)
6 years ago
sneaxiy 78fb3a62e0 fix env variable settting bug
6 years ago
sneaxiy 2d92b6be98 merge develop
6 years ago
sneaxiy 7000ec85d9 fix some op grad maker
6 years ago
sneaxiy f8ed2c229e try to fix ci error
6 years ago
Wojciech Uss 46677fb080 Move cpu_quantize_* passes into mkldnn subfolder
6 years ago
sneaxiy c20db6357b split PR
6 years ago
Qiao Longfei be0c482304 update trainer_id
6 years ago
sneaxiy 072d95d8f6 Merge develop
6 years ago
sneaxiy a93a9eef8f add op registry type
6 years ago
chengduo a6a3b2fbbc
[Speed]Refine ParallelExecutor (#16190)
6 years ago
nhzlx 3df7b98a0f Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
chengduo 33965527fd
Add unit test for fuse all reduce (#16354)
6 years ago
sneaxiy 953214ad97 add more unittest
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
6 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
6 years ago
Tao Luo 9a05859179
Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass
6 years ago
nhzlx c407dfa3cb cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
nhzlx a1d200a5de cherry-pick from feature/anakin-engine: Anakin support facebox #16111
6 years ago
luotao1 bfdab00e5b Merge branch 'develop' into core_opt_choose_kernel
6 years ago
Tao Luo a5124ee0bb
Merge pull request #16301 from luotao1/runtime_context_pass
6 years ago
luotao1 6c6a39222b Merge branch 'core_opt_choose_kernel' of https://github.com/Xreki/Paddle into core_opt_choose_kernel
6 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
6 years ago
Zeng Jinle d0ef682552
Merge pull request #16274 from sneaxiy/fix_grad_maker
6 years ago
Wojciech Uss 104a9f1e27 fix pattern maching conv2d with(out) ResidualData
6 years ago
Wu Yi 6382b62f6b
Collective ops (#15572)
6 years ago
sneaxiy 023a3a3d62 fix op grad maker
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
sneaxiy fd23262e0c merge develop, fix conflict
6 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
6 years ago
Jacek Czaja 13816dd4ac [MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)
6 years ago
Wojciech Uss af03008890 Add cpu_quantize_placement_pass for C-API quantization (#16265)
6 years ago
Tao Luo dbb92ee4b1
Merge pull request #16002 from luotao1/runtime_context
6 years ago
minqiyang b40e41fbd1 Polish code style
6 years ago
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
6 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
6 years ago
luotao1 cc0ae1f1a1 refine with comments
6 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
6 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
6 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
6 years ago
minqiyang 362253732c Polish code
6 years ago
minqiyang c0ddb93ccc Polish code
6 years ago
minqiyang b5078c211a Make infer var type virtual
6 years ago
minqiyang 438bca9c3d Implement Runtime Var Type Inference
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 1b59bed989 Merge branch 'develop' into runtime_context
6 years ago
luotao1 6ce25c99a0 Merge branch 'develop' into runtime_context
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
minqiyang ca392c7e97 Implement infer var type context
6 years ago
liuwei1031 1c6caf8466
1. disable reuse SELECTED_ROWS type variable (#16150)
6 years ago
Wojciech Uss b9252f3df8 Add cpu_quantize_squash_pass for C-API quantization (#16128)
6 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
6 years ago
sneaxiy a7a4f053da Merge develop
6 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
6 years ago
minqiyang 42e96a029f Accelerate CPU part
6 years ago
sneaxiy 682f2dbf29 merge develop
6 years ago
sneaxiy 2c4fcaa683 merge develop
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
6 years ago
baojun e3c37bd564 remove const_cast and refactor ngraph engine code (#15925)
6 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
6 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
Zhen Wang 94b7c1ea7b
Merge pull request #16107 from wzzju/add_graph_clone
6 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
6 years ago
Zhen Wang 5685a48c23 Add some fixme. test=develop
6 years ago
luotao1 8f6597aa0e Merge branch 'develop' into infershape_example
6 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang 01eddf125c Not add graph copy construction method. test=develop
6 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
6 years ago
Zeng Jinle 472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
chengduo ad80bde824
Revert "Revert "Add Event for TensorCopy"" (#16035)
6 years ago
sneaxiy 732fa00eaf disable gc in recurrent_op currently
6 years ago
Qiao Longfei ff8054c5a7 can run
6 years ago
Yihua Xu 40f1dd818b Fix the node's order issue when the content of graph is changed (#16088)
6 years ago
Zhaolong Xing 3d63aa0a11
Merge pull request #15729 from NHZlX/add_static_model_load_for_trt
6 years ago
Qiao Longfei 3225e19591 fix remove recv op
6 years ago
Qiao Longfei fe6a840924 fix delete recv ops
6 years ago
Wu Yi d206582337
add parallel graph dist test (#16076)
6 years ago
Qiao Longfei 446fdf9563 fix compile problem
6 years ago
Qiao Longfei a23f1ee85a optimize code
6 years ago
Qiao Longfei a0bb18beec Merge branch 'add-async-ssa-graph-executor' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
sneaxiy 2a639d5c2a add allocator chain to fix bug
6 years ago
liuwei1031 045e5911bf
fix a code bug which cause crash when empty variable is used, test=develop (#16080)
6 years ago
sneaxiy 7b608396fe fix travis-ci format check
6 years ago
Qiao Longfei 255b36dad2 can run
6 years ago
Qiao Longfei 5e8de51409 code format test=develop
6 years ago
Qiao Longfei 4e218dabc5 code format test=develop
6 years ago
Tao Luo 6375fe45d7
Merge pull request #16039 from luotao1/execution_context
6 years ago
sneaxiy 814a759061 merge develop
6 years ago
sneaxiy 597dc65e76 enhance gc
6 years ago
liuwei1031 caadd0581d
add IfElse test case for ir memory optimize (#15998)
6 years ago
Qiao Longfei f28c258453 code clean test=develop
6 years ago
Qiao Longfei 8c38aca954 tmp commit
6 years ago
Tao Luo f4587789d8 remove legacy function in ExecutionContext
6 years ago
Liu Yiqun 1041e18c47 Refine codes.
6 years ago
luotao1 c0b240aa43 try to fix distributed unit-test
6 years ago
luotao1 784826a4f5 enhance cache runtime_context for different scope
6 years ago
Qiao Longfei fab1b54d99 Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Qiao Longfei 8744f9a083 fix parallel executor async mode
6 years ago
Qiao Longfei e70b1727ef Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Liu Yiqun d8a939d8a8 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
chengduo e2da3a5b22
Revert "Add Event for TensorCopy" (#16022)
6 years ago
luotao1 2fb38c108c Merge branch 'develop' into runtime_context
6 years ago
sneaxiy a9ea99d700 merge develop
6 years ago
Qiao Longfei 3691a46fa3 improve communicator
6 years ago
chengduo ae37f82964
Unified ParallelExecutor and Compiler (#15970)
6 years ago
chengduo 7235fd662b
Add Event for TensorCopy (#15953)
6 years ago
luotao1 82b0bb9d72 fix cpplint error
6 years ago
Liu Yiqun d4674dab13 Cache the chosen kernel of operators'.
6 years ago
luotao1 9773f38f99 cache runtime_context
6 years ago
tangwei12 6d5a04c1e7
add op type in check nan/inf (#15986)
6 years ago
Qiao Longfei 847e4f4e85 pure async mode train
6 years ago
Qiyang Min 187cffd019
Merge pull request #15928 from velconia/imperative_backward_hooks
6 years ago