Xin Pan
2349acea48
checkpoint
...
test=develop
6 years ago
Xin Pan
11d4d39cd7
forward working
...
test=develop
6 years ago
Xin Pan
b629133375
checkpoint runnable PyLayer
...
test=develop
6 years ago
peizhilin
a6f5ceee74
add the python callstack for debug support test=develop
6 years ago
Xin Pan
0d0bc61248
update api
...
test=develop
6 years ago
chengduo
eabb2105fa
Refactor MultiDevSSAGraphBuilder ( #15090 )
...
* Refactor ParallelExecutor
test=develop
* extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
test=develop
* Refactor MultiDevSSAGraphBuilder
test=developt
* Remove enable_data_balance
test=develop
* code refine
test=develop
* remove data balance
test=develop
* refine ScaleLossGradOp
test=develop
* remove uncessary file
test=develop
* code refine
test=develop
* modify function name
test=develop
* follow comments
test=develop
* add is_distribution field
test=develop
* set is_distribution
test=develop
* fix DistSSAGraphBuilder
test=develop
6 years ago
Xin Pan
5e928e579a
try unify Executor and ParallelExecutor
...
test=develop
6 years ago
minqiyang
2547f9d1b8
Polish code
...
test=develop
6 years ago
minqiyang
09e2e66236
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
gongweibao
ce70229ba6
Add max_body_size flags to brpc ( #15084 )
6 years ago
minqiyang
6a5f604607
Support stop_gradients var in imperative backward
...
test=develop
6 years ago
minqiyang
9e3155e01d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
336160e651
Complete imperative optimizer implementation
...
test=develop
6 years ago
minqiyang
d4b9928c5a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
5822f7f1d8
Polish code
...
test=develop
6 years ago
Xin Pan
7b6bf9ddf2
make fill_constant kernel-based
...
test=develop
6 years ago
Xin Pan
61491ce250
clean
...
test=develop
6 years ago
Xin Pan
ce7e503cbe
refactor to avoid scope.
...
test=develop
6 years ago
Xin Pan
71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
...
Revert "cherry-pick the #12759"
6 years ago
minqiyang
68e9b841ab
Add support for optimizer
6 years ago
Zeng Jinle
9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
...
Add scope_pool
6 years ago
sneaxiy
3e917a934a
add scope_pool
...
add module cleanup
test=develop
6 years ago
peizhilin
2388d0e7d6
Revert "cherry-pick the #12759"
...
test=develop
This reverts commit 7f6d8acecb
.
6 years ago
peizhilin
7f6d8acecb
cherry-pick the #12759
...
test=develop
6 years ago
minqiyang
3de0f612e8
Polish code
...
test=develop
6 years ago
Zeng Jinle
95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
...
Featue/py_func op
6 years ago
minqiyang
1c116462cc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
29697c2e25
Add stop_gradient to VarBase to support loss function
...
test=develop
6 years ago
chengduo
550e7e410b
Code Clean parallel_executor.py ( #14849 )
...
* refine parallel_executor
* remove uncessary code
test=develop
6 years ago
minqiyang
3cd10a7c4f
Add Conv2D forward
...
test=develop
6 years ago
minqiyang
8d88c5a87d
Shameless copy
6 years ago
Xin Pan
6324032602
MLP forward backward
...
test=develop
6 years ago
colourful-tree
44ad2f4479
Merge pull request #14873 from colourful-tree/develop
...
add pslib(pserver) to paddle, an industrial scale high performance parameter server library
6 years ago
dzhwinter
7cd24b1318
add ir memory optimize. ( #14530 )
...
* follow comments. test=develop
* Fix typo
* fix compile error. test=develop
* merge develop branch. test=develop
* Remove set_equal
* Polish code
* Delete unused functions
test=develop
* polish code. test=develop
* follow comment
* polish code.
* fix windows compile error. test=develop
* fix op handle.
* rerun ci. test=develop
* rerun ci. test=develop
* rerun macci. test=develop
* polish code. test=develop
* rewrite sort code. test=develop
* remove unused code. test=develop
* fix tests. test=develop
* fix conflict. test=develop
* follow comment. test=develop
* merge develop branch. test=develop
* fix tests. test=develop
* remove ToTypeIndex. test=develop
* rerun ci. test=develop
6 years ago
guru4elephant
a79a3ea2f0
Merge branch 'develop' into develop
6 years ago
wopeizl
0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
...
fix script issue
6 years ago
Yu Yang
bacf1d2399
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
gongweibao
0b1c7d838c
Add brpc serialization support. ( #11430 )
6 years ago
peizhilin
23dec78772
fix script issue
...
test=develop
6 years ago
dongdaxiang
e52bb816e3
add copyright to __init__.py in distributed folder
6 years ago
sneaxiy
f0df62f136
add more unittest case
...
test=develop
6 years ago
Yu Yang
4ecdb6f486
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
...
test=develop
6 years ago
sneaxiy
f6741df462
merge develop
...
fix bug
test=develop
6 years ago
heqiaozhi
f81957a753
refine cmake for pslib & pre_define
6 years ago
sneaxiy
deb0d41cea
fix cmake
...
fix cmake again
test=develop
6 years ago
sneaxiy
e7c5c9d2de
remove unnecesary code
...
test=develop
6 years ago
sneaxiy
db2daefe50
merge develop
...
test=develop
6 years ago
sneaxiy
ca84c2ca8f
merge develop
...
test=develop
6 years ago
sneaxiy
e240ba2918
implement backward
...
test=develop
6 years ago
Yu Yang
81520a24cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
6 years ago
Yu Yang
9bd70a1e04
Change tensor uses proto::VarType::type
...
test=develop
6 years ago
Qiao Longfei
1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
...
split selected rows op should always init output selected rows
6 years ago
sneaxiy
7923042365
merge develop
...
test=develop
6 years ago
Qiao Longfei
0a7c7e97af
test zero output of split_selected_rows_op
...
test=develop
6 years ago
Tao Luo
3243b45bb2
Merge pull request #14651 from luotao1/has_attr
...
add Opdesc's HasProtoAttr
6 years ago
heqiaozhi
016a068775
stop server
6 years ago
sneaxiy
8760d23c7d
featue/py_func
6 years ago
Tao Luo
067ed70f2d
add HasProtoAttr function in op_desc.h, clean node.h
...
test=develop
6 years ago
Xin Pan
748549b2e3
Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
...
This reverts commit b1d3a1c8b4
, reversing
changes made to f1fb64b17f
.
6 years ago
Xin Pan
c049fa7cf7
Revert "Revert "Revert "Imperative"""
6 years ago
gongweibao
f1fb64b17f
Add reduce sparse tensor feature. ( #14757 )
6 years ago
heqiaozhi
575ae7c6c3
refine pslib inferface & fix some bugs
6 years ago
Xin Pan
2538ef64f1
Revert "Revert "Imperative""
6 years ago
Xin Pan
6217f42ab7
Revert "Imperative"
6 years ago
Qiao Longfei
daba57f752
complete ctr_reader
6 years ago
sneaxiy
64ad051b9a
merge develop
...
test=develop
6 years ago
sneaxiy
c47c451a00
fix bug
6 years ago
heqiaozhi
a77fa67bbd
async_thread_trainer & libmct & pslib.cmake
6 years ago
Xin Pan
b80fe8264a
polish
...
test=develop
6 years ago
Xin Pan
e5d64fd4d1
initial imperative
...
test=develop
6 years ago
Xin Pan
4d0df1fea7
add fields for autograd
...
test=develop
6 years ago
Xin Pan
8138391631
add OpBase and unify with VarBase
...
test=develop
6 years ago
Xin Pan
f6f0692451
clean up
...
test=develop
6 years ago
Xin Pan
aeb74af54c
allow operator to run imperatively
6 years ago
Xin Pan
b1f6fda5e5
run forward
6 years ago
Xin Pan
a6d23083f0
some tracing
...
test=develop
6 years ago
Xin Pan
dac92e560c
initial commit
6 years ago
Qiao Longfei
7f07dfa1a4
clean code
6 years ago
Wang Guibao
41e19eb431
AsyncExecutor ( #14627 )
...
* AsyncExecutor: C++ side
* Google naming conventions
* Rename MultiExecutor to AsyncExecutor
* pybind with async_executor
* Naming convention
* remove some flags and unused code
* add refactored file of async_executor and data_feed
* clear async executor interface and add data feed factory
* split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
* Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
* refine async_executor_refactor.cc
* add some files about datafeed
* Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
* Interface rework
* add MultiSlotDataFeed
* Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
* update data_feed for add MultiSlotDataFeed
* update datafeed and async_executor to run bow_net demo
* fix bug that finish_set_filelist failed in multithread
* delete finish_binding_memory_(flag), because it can not be marked under the current interface
* Fix bug
* update async_executor.py for support set_use_slots
* update async_executor.py for support set_use_slots and set set_dense_slots
* fix bug that when the number of files is less than the number of threads, it will fetch nan
* remove redundant code, and make executor exit when set a illegal queue size
* add batch_size check
* add MultiSlotDesc
* Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
* add some checkpoint in DataFeedDesc
* add CheckFile function in MultiSlotDataFeed
* update something error info
* fix deaded lock bug
* Fix fetch variable
* Merge error
* fix code style in async_executor
* using one lock blocking queue replace two lock blocking queue because of some bugs
* update code style
* add utest for data_feed
* Fix fetch var
* update utest for data_feed for multithread
* update SetFileList info
* fix bug in utest of data_feed
* Add comments for python
* Add comments for python code
* Fix pybind.cc with new pybind11 version
* add note for DataFeedDesc's set_use_slots function
* Add save_model
* update data_feed_test for multi-type
* add comment for executor_thread_worker
* Remove unused code
* update data_feed_test for generate test data file
* removed unnecessary interfaces and add comments
* c++ style check
* update data_feed.cc
* AsyncExecutor: C++ side
Google naming conventions
Rename MultiExecutor to AsyncExecutor
pybind with async_executor
Naming convention
remove some flags and unused code
add refactored file of async_executor and data_feed
clear async executor interface and add data feed factory
split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
refine async_executor_refactor.cc
add some files about datafeed
Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
add MultiSlotDataFeed
Interface rework
Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
update datafeed and async_executor to run bow_net demo
update async_executor.py for support set_use_slots
Fix bug
update async_executor.py for support set_use_slots and set set_dense_slots
fix bug that when the number of files is less than the number of threads, it will fetch nan
remove redundant code, and make executor exit when set a illegal queue size
add MultiSlotDesc
Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
add some checkpoint in DataFeedDesc
Fix fetch variable
fix code style in async_executor
Fix fetch var
add utest for data_feed
Add comments for python
update utest for data_feed for multithread
fix bug in utest of data_feed
Add comments for python code
Fix pybind.cc with new pybind11 version
add note for DataFeedDesc's set_use_slots function
update data_feed_test for multi-type
Add save_model
update data_feed_test for generate test data file
removed unnecessary interfaces and add comments
add comment for executor_thread_worker
Remove unused code
update data_feed.cc
c++ style check
* commit for code style
* commit for code style
* commit for code style
* commit for code style
* Comment away __init__ in async_executor.py
* clang-format fix test=develop
* use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
* commit for update code style
* commit for update code style
* Add async_executor demo; Remove some methods
test=develop
* commit for update code style
* commit for update code style
* commit for update code style
* update API.spec
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
* Fix windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix code style
test=develop
* Fix code style
test=develop
* update datafeed
* Fix code style
test=develop
* update data_feed_test for test Tensor test=develop
* Fix code style
test=develop
* Fix windows build failure
test=develop
* Fix code style and windows build failure
test=develop
* Fix PYTHON3.5 build failure
test=develop
* AsyncExecutor API
test=develop
6 years ago
Qiao Longfei
75eba6108d
Add scope doc ( #14582 )
...
* add doc for scope
* update doc for force_init_on_cpu
test=develop
* follow comment test=develop
* update format test=develop
6 years ago
minqiyang
a02ce58f2c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
6 years ago
wopeizl
db9284ecde
Merge pull request #14617 from wopeizl/windows/online
...
Windows/online
6 years ago
gongweibao
867c312bc4
Fix allreduce dependency order. ( #14586 )
6 years ago
peizhilin
511cc9024a
fix for build issue
6 years ago
minqiyang
be04d99fe4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
6 years ago
minqiyang
53433d7f2e
Revert the changes of VLOG
...
test=develop
6 years ago
minqiyang
d2045260a5
Change visibilities of variant_visitor of pybind11
...
test=develop
6 years ago
minqiyang
b67229187e
Change to PYBIND11_MODULE because the deprecation of PYBIND11_PLUGIN
...
test=develop
6 years ago
minqiyang
81994e84e0
Change the include files because the version changes of pybind11
...
test=develop
6 years ago
sabreshao
61c5f13fcf
Fix cmake for AMDGPU platform ( #13801 )
...
* HIP cmake.
Enable whole archieve build for pybind library.
Disable two warning.
Rollback to C++11.
Link RCCL to WA gpu kernel loading issue.
Update eigen to fix build failure.
Add more include directories.
Fix O3 build failure.
Update eigen.
fix tensor_util_test segment fault issue
add more macro check in hip.cmake.
we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
Fix rocRAND load.
Update eigen to fix gru_unit_op and reduce_op.
Add HIP support to testing.
Update eigen to support int16 and int8 in arg min and arg max.
* add rocprim as cub library used by nv implementation
* Reduce build time in rocprim.
* Add rocprim introduction, remove useless cmake code.
* Remove useless flags and format cmake file.
6 years ago
Jiabin Yang
de2db11735
Merge pull request #14537 from reyoung/feature/fix_macos_ut
...
fix(Cpu): fix cpu compile and unittest
6 years ago
wopeizl
d9a1f3e58e
Windows/online ( #14474 )
...
* add recordio support
* disable the openblas multi-thread on windows since no support
adjust the python script
* code style
* code style
test=develop
* add create_recordio_file_reader back
* fix code style
test=develop
* fix the gtest.cmake on windows
* fix cc_test on windows
* fix the win build
test=develop
* remove fused compile support on windows
test=develop
* add the jit support
test=develop
* add the jit support, test=develop
* add the jit support, test=develop
* add the jit back
fix compile error on windows
* rollback test=develop
* test case fix
* disable DSO by default on windows
* exclude warpctc_op on windows
* exclude the dynload_warpctc out on windows
test=develop
* fix the scripts error
test=develop
* disable avx on windows by default
test=develop
* re-organize the cmake file
* disable mkl on windows by default
* add warp_ctc back
* fix the dependency
* fix the dependency
* fix the build issue on windows
* remove unsupported flag on windows
* code style
* code style
test=develop
* fix issue
* add profiler, parallel_executor back
* clean up the pre-definitions on windows
* fix build issue
* test=develop
6 years ago
Yu Yang
533c5d5803
fix(Cpu): fix cpu compile and unittest
...
test=develop
6 years ago
Yu Yang
a685f305f8
Merge pull request #14479 from reyoung/feature/fix_macos_ut
...
fix(Mac): fix unittest of macos
6 years ago
Yu Yang
98bbfc17be
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
Yu Yang
7486b0ddec
fix(Mac): fix unittest of macos
...
test=develop
6 years ago
Wu Yi
a2d9b34417
Refine operator cmake ( #14413 )
...
* wip simplify operator framework
* wip
* wip
* done test=develop
* clean test=develop
* fix test=develop
* fix deps test=develop
* fix cpu build test=develop
* fix tensorrt build test=develop
* fix tests test=develop
* fix test=develop
* fix cpu build test=develop
6 years ago
Yu Yang
c8f6e70ab4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
dzhwinter
d3aed98d86
Merge pull request #14320 from wopeizl/windows/online
...
Windows/online
6 years ago
peizhilin
1a9008c420
code style fix
...
test=develop
6 years ago
Xin Pan
bae3659714
more test
...
test=develop
6 years ago
Xin Pan
759ffca423
some improvements
...
test=develop
6 years ago
Xin Pan
99dffb91d6
allow to repeatedly share and update BuildStrategy
...
test=develop
6 years ago
Yu Yang
02631965c8
Refine
6 years ago
sneaxiy
d231e55065
merge develop
...
test=develop
6 years ago
peizhilin
ca60e1d34d
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
52f7644f53
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yu Yang
fdc689142c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
minqiyang
87450b9ad4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
6 years ago
chengduo
c5b6573a5a
Fix input<tensor> ( #14208 )
...
* fix input<tensor>
test=develop
* fix split_ids
test=develop
* ElementwiseMul should not support SelectedRows
* fix scale op
test=develop
* change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()
* fix operator
* refine MultiOutput
* fix MultiOutput
test=develop
* disable test_dist_save_load
test=develop
* fix elementwise_op
test=develop
* add get_sparse_as_op
test=develop
* add info for check
test=develop
* rename get_sparse_as_op with extract_rows_as_op.
test=develop
* elementwise doesn't support selected_rows
* fix regularizer
* remove extract_rows_as
test=develop
* fix ci
test=develop
* add test for sum_op
* fix regularizer
test=develop
* test=develop
* fix pserver weight decay multi inputs test=develop
6 years ago
minqiyang
0c3227a523
Change the origin VLOG level to 10 times
...
Fix code to support cpplint syntax check
test=develop
6 years ago
peizhilin
869487a2b7
Merge remote-tracking branch 'origin/develop' into windows/build
6 years ago
Yu Yang
057a682ee9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Yu Yang
c28beb8a3c
test(Pe): add dry run tests for pe ( #14254 )
...
Dry run tests will skip `Op.Run` and just perform job scheduling. It helps to analysis dead lock in PE.
test=develop
6 years ago
peizhilin
1f12ba6192
gpu support, fix build issue:
...
1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
2. comment out some ops which not supported on windows
3. cuda libs may not be correctly linked to target on windows
6 years ago
sneaxiy
faac8a76ce
remove unnecessary codes
...
test=develop
6 years ago
peizhilin
71d7980f69
fix build issue 1
6 years ago
peizhilin
9d67c1fb69
cpu build support
6 years ago
sneaxiy
2494ca83ab
test=develop
6 years ago
sneaxiy
cef8cc81db
merge develop
6 years ago
Yu Yang
c01696f8c2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
Qiao Longfei
d26ff8cb2d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
6 years ago
Wu Yi
26200f2e42
[1.1] [project] train imagenet using large batch size ( #13766 )
...
* fix nccl2 lars dist support
* put lars in momentum op
* add tests lars
* fix ci
* fix cpu kernel
* soft warning
* remove lars in test_recognize_digits.py
* move to another op
* add file
* update api.spec test=develop
* update test=develop
* fix api.spec test=develop
* wip
* wip, finish grad merge ops
* wip, finish graph build
* wip test running
* work on 1 gpu
* workable version
* update
* fix tests
* fuse broadcast op
* fix compile failed
* refine
* add batch merge test mnist
* fix CI test=develop
* fix build
* use independent bn params for batch merge test=develop
* update api.spec
* follow comments and for test
* wip
* refine tests test=develop
* follow comments test=develop
* remove startup bn modify test=develop
* follow comments test=develop
* fix merge test=develop
6 years ago
sneaxiy
a314a80cdb
merge develop
6 years ago
seiriosPlus
06de824ba8
fix shape in floats
6 years ago
tangwei12
755927d2b0
shape type to int64_t, test=develop
6 years ago
tangwei12
8b7f45a889
add longs in framework
6 years ago
Yu Yang
461f71a90b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
sneaxiy
2002e71da8
fix pinned allocator
6 years ago
Qiao Longfei
681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
...
reader block queue add test mode
6 years ago
Qiao Longfei
60030e8678
change the use of FLAGS_reader_queue_speed_test_mode
...
test=develop
6 years ago
Tao Luo
323d67cfc1
Merge pull request #13879 from panyx0718/doc
...
fix doc
6 years ago
chengduo
2562eb92b8
Add strategy doc ( #13849 )
...
* add strategy doc
test=develop
* fix doc
test=develop
* add ParallelExecutor arg doc
test=develop
6 years ago
Xin Pan
b11372a0af
fix doc
...
test=develop
6 years ago
Qiao Longfei
8686f7c68e
add reader_queue_speed_test_mode flag for speed test
6 years ago
sneaxiy
d87569134c
test=develop
6 years ago
Xin Pan
63b2e98f3d
Explain LoD and a few other concepts
...
test=develop
6 years ago
chengduo
e1904ac2c8
Add doc ( #13765 )
...
test=develop
6 years ago
sneaxiy
4c672ab1a2
Merge reyoung:rewrite_allocation
6 years ago
Yu Yang
15076c325e
Add comments and polish code style
6 years ago
Yu Yang
29f66c2408
Polish code
6 years ago
Yu Yang
6ca37448ac
Refine prelu_op
6 years ago
Yu Yang
186b2b138d
Revert "Merge pull request #13201 from reyoung/revert_callstack" ( #13697 )
...
This reverts commit 21bb9e91fc
, reversing
changes made to 3fa68dc101
.
test=develop
6 years ago
Yu Yang
ae9378f640
Refine PyBind
6 years ago
Yu Yang
a1a01899c8
Refine
6 years ago
Yu Yang
31270e58d0
Add communication attr
6 years ago
Xin Pan
ddd60581b7
clean up channel
...
test=develop
6 years ago
Xin Pan
7cd2761736
Merge pull request #13416 from panyx0718/ir
...
PassBuilder
6 years ago
Wu Yi
16e73e0d28
hide operator API ( #12543 )
...
* hide operator API
* update
* update api.spec
* fix merge
* fix test
6 years ago
Xin Pan
cbdf9833e3
hide create_passes_from_strategy for now
6 years ago
Xin Pan
f117feab0c
modify comments
6 years ago
Yu Yang
7119d6c3cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_callstack
6 years ago
Xin Pan
b43e49fa31
fix
6 years ago
Xin Pan
36c2a9af27
pass builder allow cutomize pass in python.
6 years ago
Wu Yi
29c63d180f
[Feature] dist op role and lr op role, to support memory optimize with dist training ( #13220 )
...
* wip
* clean up
* should fix running with memopt
* add ut
* mark lr schedule op role
* hide lr_schedule_guard
* use op_role_var instead of ufind
* unify dist test name
* wip for py3 support
* fix var deref
* fix python3 mem_opt order
* remove comments
6 years ago
chengduo
d402234ba8
Feature/op_fuse_pass ( #12440 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* add op_fuse pass
* add backward
* code refine
* use TopologySortOperations
* follow comments
* refine IsFusible
* code enhance
* fix op_fusion_pass
* refine code
* refine fuse_elemwise_act_op
* adjust the input and output
* refine logic
* add intermediate_edge
* disable inplace
* follow comments
* refine logic
* follow comments
* Remove the removable IntermediateOut
* change strategy
* code refine
* enable fuse backward
* code refine
* code refine
* rename unit test
* follow comments
6 years ago
Xin Pan
5ce77889f4
clean unused inference_optimize c++ implementation
7 years ago
Yancey1989
5ce1a960a5
move bcast op into pass
7 years ago
Xin Pan
0904f07d46
polish
7 years ago
Xin Pan
56a977d436
add test
7 years ago
Xin Pan
926e1077ca
version
7 years ago
Yu Yang
cda7842e26
Revert "Revert "Add Python Callstacks when Op::Run error ( #12759 )""
...
This reverts commit 1f270275a6
.
7 years ago
Xin Pan
51ef0ad766
allow to use name_scope for debugging and visiualization
7 years ago
qingqing01
1f09bc320c
Support data type int8_t . ( #12841 )
...
* Support int8 type.
7 years ago
dzhwinter
eca4563e5d
operators module ( #12938 )
7 years ago
guochaorong
1f270275a6
Revert "Add Python Callstacks when Op::Run error ( #12759 )"
...
This reverts commit b2df17003f
.
7 years ago
Wu Yi
b8da70c370
Resovle multi gpu async deps ( #12828 )
...
* dist transpiler add control dependency var between send and recv
* fix async deps
* follow comments and refine
* fix deps connect for rpc ops
7 years ago
Yu Yang
b2df17003f
Add Python Callstacks when Op::Run error ( #12759 )
...
* Add Python Callstacks when Op::Run error
* Skip op with sub-block
* refactor: refine callstack info's format
* Reshape only support matrix
* Polish Python code
* Fix UT
* Fix Py3
7 years ago
dzhwinter
e23ddf6ae4
status ( #12764 )
7 years ago
Yu Yang
c6af7201e9
Merge pull request #12692 from reyoung/feature/fast_executor
...
Feature/fast executor
7 years ago
Qiao Longfei
653fad08f8
Optimize selected rows for dist lookup table with pthread rwlock ( #12635 )
...
Optimize selected rows for dist lookup table with rwlock
7 years ago
yuyang18
d1d825ee02
Hide unnecessary API
7 years ago
yuyang18
265302edea
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fast_executor
7 years ago
minqiyang
77f12e000f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
gongweibao
842fb021b3
Fix clone() bug. ( #12583 )
7 years ago
minqiyang
e0d5f8a820
Move compat module to python/paddle
7 years ago
minqiyang
1800fef142
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
yuyang18
05cadf1b24
Add FastExecutor
7 years ago
Wu Yi
8b77448d5f
hide misc APIs ( #12540 )
...
* hide misc APIs
* update
* fix transformer test
* update API.spec
7 years ago
minqiyang
6abe819f07
Fix pybind11 problem
...
Fix str and bytes problem
Fix sorted problem
Fix math problem
Fix CI problem
7 years ago
minqiyang
559d36328c
Apply 2to3 to current paddle main python code
7 years ago
Yancey1989
d4f51218ef
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
Yancey1989
efd5a84986
update executor interface
7 years ago
Tao Luo
3694fd5c4a
Merge pull request #12109 from emailweixu/cpu_only
...
Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
Yancey1989
7570d8e77c
add rpc complete interface
7 years ago
Wu Yi
db67d60e31
Remove block api ( #12107 )
...
* remove block api
* remove clone_variable
* hide block inner apis
* update
* fix tests
7 years ago
Wei Xu
264e8305b0
Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
Yancey
0042ba93c8
Merge pull request #12127 from Yancey1989/enforce_rpc_timeout
...
Enforce rpc timeout
7 years ago
yuyang18
4ff1bde5fb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/hide_api_cont
7 years ago
Yu Yang
ebe3b5e78a
Merge pull request #11853 from sneaxiy/complete_py_reader_python
...
Add Python Reader Op (Python side and unittests)
7 years ago
Yancey1989
d14afcedeb
polish function name
7 years ago
Yancey1989
4a91a14549
enforce rpc client timeout
7 years ago
Yancey1989
9de096bd58
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fix_cpu_pe
7 years ago
Yancey1989
1effba3312
fix pe with cpu place
7 years ago
sneaxiy
e40905daed
Merge branch 'complete_py_reader_python' of https://github.com/sneaxiy/Paddle into complete_py_reader_python
7 years ago
Yancey1989
bbd53131f0
update
7 years ago
sneaxiy
df7ecbf5a1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into complete_py_reader_python
7 years ago
yuyang18
b83ffda4aa
Try hide APIs
7 years ago
fengjiayi
5528f59900
Split ReInit() to Shutdown() and Start()
7 years ago
dzhwinter
4ed0b62476
Move fluid::framework::InitDevices into fluid::platform ( #11757 )
...
* move to platform
* "move init from framework to platform"
* "remove used init"
* "fix ci"
* "fix ci"
* "fix generic"
* "fix ci"
* "fix ci"
* "fix ci"
* "disable fragile test"
7 years ago
sneaxiy
739c330914
fix merge conflict
7 years ago
fengjiayi
ce16b40b04
Merge pull request #11891 from JiayiFeng/dev_eof_exp
...
Add EOFException to represent EOF in C++ reader
7 years ago
Yancey
f7fd711e3f
Merge pull request #11868 from Yancey1989/dist_pass_barrier
...
add dist pass barrier
7 years ago
fengjiayi
3fab4f65a4
Add EOFException to represent EOF in C++ reader
7 years ago
sneaxiy
1b59220d50
complete python reader op python side
7 years ago
fengjiayi
ff4317cee9
follow comments
7 years ago
Yancey1989
1366832a41
add dist pass barrier
7 years ago
sneaxiy
d4d946db5a
update blocking queue
7 years ago
sneaxiy
502faf62a9
complete_py_reader_cpp
7 years ago
sneaxiy
697ba4b13d
Add Python array reader op
7 years ago
Yancey
0d4b376fc0
Merge pull request #11585 from Yancey1989/fix_pserver_sub_blocks
...
fix pserver sub-blocks
7 years ago
Yancey1989
8cb494f79c
add blocks attr type in proto
7 years ago
Yancey1989
56a903d3ac
use optimize block list instead of first optimize block
7 years ago
Xin Pan
32bfebfe38
disable the LODTensor warning for now
7 years ago
fengjiayi
6e1c48d1fd
Merge pull request #11576 from JiayiFeng/dev_refine_bilinear_interp
...
Add bilinear interp supporting for uint8
7 years ago
gongweibao
80f63642e6
Add comments to `set_lod`. ( #11588 )
7 years ago
fengjiayi
47c02b5c32
Add unit tests
7 years ago
Kexin Zhao
417fcf4f43
Modify Pybind LoDTensor API according to length-based LoD ( #11106 )
...
* add lod_tensor util and modify pybind
* refind pybind LoDTensor API and modify LoDTensor and DataFeeder test
* fix test error
* fix detection map op test
* fix reorder_lod_tensor test
* fix seq_concat_op
* fix chunk evel op test
* fix target assign op
* fix warp ctc op
* address comments step 1: reverse reset_lod op
* step 2: modify op test
* add warning message
* remove has_valid_lod
* add back has_valid_lod
* address comments
* add exception catching trial
7 years ago
Wu Yi
34865f2de3
Trainer send term signal ( #11220 )
...
* wip
* use executor.complete to end trainer
* fix build
* fix build with distribute off
* fix typo
* fix cmake typo
* fix build
7 years ago
chengduoZH
aadaadf735
replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
yuyang18
d9af153232
SSA Graph Builder Factory
...
* Use Builder Chain to decorate new builders. It is easy to extend
builders.
* Make graphviz path as a build strategy, not a FLAGS.
7 years ago
yuyang18
86a61c177f
Add ScopeBufferedSSAGraphExecutor
7 years ago
Xin Pan
35d5b1b9e2
Merge pull request #11036 from panyx0718/dist_timeline
...
better profiler and benchmark
7 years ago
Xin Pan
f14e579cc3
clean up
7 years ago
fengjiayi
c95cd4742f
Merge pull request #10975 from JiayiFeng/fix_bug_in_uint8_support
...
Correct uint8 support
7 years ago
Yancey1989
20c24c05aa
singleton rpc_client
7 years ago
fengjiayi
9f81f1ce57
Add uint8 support in pybind.cc
7 years ago
yuyang18
0e5a278c23
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/support_op_role
7 years ago
dzhwinter
5828101c23
make uint8 support in data_type transform and memory optimize ( #10715 )
...
* "a piece of job."
* "fix typeo"
* "fix ci"
7 years ago
yuyang18
3923d40992
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/support_op_role
7 years ago
yuyang18
7c777dd549
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
7 years ago
yuyang18
017bba1664
Add op role
7 years ago
yuyang18
08295f9877
Add build strategy
7 years ago
yuyang18
e5281b3c2d
Clean code & add execution strategy
7 years ago
typhoonzero
928418a9ac
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
chengduoZH
97cb5479ae
change PE strategy
7 years ago
typhoonzero
d9320dcd94
complete code
7 years ago
fengjiayi
d11b8e56e5
fix
7 years ago
yangyaming
f456cd8079
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10219
7 years ago
yangyaming
82571deb89
Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
Yu Yang
0c24b3f937
Clean memcpy async
7 years ago
Yu Yang
7a395881d4
Add customize_loss_grad option to PE
7 years ago
Yu Yang
f738691777
Merge pull request #9740 from dzhwinter/memory/activation
...
"polish activation"
7 years ago
Liu Yiqun
7ffbcbcaf0
Add flush of program desc to update the proto information.
7 years ago
dzhwinter
1fc9fe773d
merge develop
7 years ago
Yiqun Liu
598035f985
Fix a bug in save_inference_model and prune when the program is initailized by load_inference_model ( #10011 )
...
* Fix bug in save_inference_model and prune when the program is initialized by load_inference_program.
* Save the transpiled program instead.
7 years ago
dzhwinter
425a1e76a2
Merge remote-tracking branch 'origin/develop' into memory/activation_memory
7 years ago
Yu Yang
b4aaa00a8a
Polish logic of ParallelExecutor
7 years ago
Yu Yang
2ab12ca248
Add comments and clean code
7 years ago
Yu Yang
a822f8ddbb
Add wait
7 years ago