dzhwinter
381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
...
refactor optimize pass.
6 years ago
xuezhong
eeaa2066e5
add device info to tensor
...
test=develop
6 years ago
dzhwinter
04e9776aef
add details. test=develop
6 years ago
Qiao Longfei
16af1dbc7b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
peizhilin
3a4110f960
fix ci broken randomly and disable some warnings
...
test=develop
6 years ago
dzhwinter
4f01de6378
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
liuwei1031
6e84eb131f
expose peak gpu memory API to python test=develop ( #15529 )
...
* expose peak gpu memory API to python test=develop
* add unittest for peak gpu memory monitoring test=develop
* add pybind change test=develop
* add mutex to gpu mem usage monitor test=develop
* update benchmark flag definition file test=develop
* tweak unittest for memory monitoring test=develop
6 years ago
WangZhen
2175292634
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
...
test=develop
6 years ago
dzhwinter
06f2448848
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
Yan Chunwei
655179089f
AnalysisConfig remove contrib namespace ( #15540 )
6 years ago
Qiao Longfei
d6c0dcaa16
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
WangZhen
c67b29c178
fix some bugs of graph.to_program and get_pass.
6 years ago
dzhwinter
ee3aae56cd
merge develop branch. test=develop
6 years ago
Zhaolong Xing
97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
...
add trt int8 support
6 years ago
WangZhen
c8095eeb82
add freeze pass, and UT is passed.
6 years ago
Qiao Longfei
ada43e89c3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
乔龙飞 Qiao Longfei
c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
...
Optimize cpp reader
6 years ago
nhzlx
36abc964df
fix pybind problem: add an enum to AnalysisConfig
...
test=develop
6 years ago
Zeng Jinle
2480a3df7d
Merge pull request #15496 from sneaxiy/lazy_allocator2
...
Fix bug when user set CUDA_VISIBLE_DEVICES be empty and run CPU-only models
6 years ago
WangZhen
dde19a0ff8
add quantization freeze pass.
6 years ago
Zeng Jinle
dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
...
Turn on remove_unnecessary_lock by default
6 years ago
Xin Pan
58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
...
Imperative Resnet
6 years ago
sneaxiy
51227bd447
lazy_allocator
...
test=develop
6 years ago
minqiyang
c8965dc1ab
Polish code
...
test=develop
6 years ago
sneaxiy
ef788603d4
merge develop
...
test=develop
6 years ago
Zhen Wang
58727e8e6d
Merge pull request #15455 from wzzju/graph_quantization
...
Graph quantization pass. TODO(Add public API comments.)
6 years ago
Tao Luo
fef3fd6d62
Merge pull request #15452 from luotao1/legacy_option
...
remove legacy compiler option
6 years ago
Paddle CI
289aba750a
Polish code
...
test=develop
6 years ago
WangZhen
b913463e83
Update according to the reviewers' suggestion. test=develop
6 years ago
sneaxiy
d8568acd19
turn on remove_unnecessary_lock
...
test=develop
6 years ago
WangZhen
3ce6172052
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
WangZhen
59e5cc51d6
Add quantization transform pass and UT.
6 years ago
flame
d60751fb71
add python inference api ( #15248 )
...
add python inference api
6 years ago
dzhwinter
8f3b252392
squash commits. test=develop
6 years ago
Tao Luo
cf29ea1592
remove legacy ANDROID option
6 years ago
Qiao Longfei
45578c1b48
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
minqiyang
8ce198b2e1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
...
test=develop
6 years ago
minqiyang
31a1cd8ce5
Align the first batch of gpu resnet
6 years ago
Dun
9f8f0fc2d3
Memory optimization of depthwise conv op and group norm op ( #15313 )
...
* mem opt
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine with cub test=develop
* fix mkldnn test && remove comments && test=develop
* polish code && test=develop
* add only_forward test && test=develop
6 years ago
Xin Pan
9a9c690e71
Merge pull request #15343 from panyx0718/imperative3
...
add a GAN model in imperative mode
6 years ago
WangZhen
e2ff300b02
add UT for quantization.
6 years ago
WangZhen
451896fce4
init quantization.
6 years ago
Qiao Longfei
88d71fa2f9
support num_iteration_per_run
6 years ago
Qiao Longfei
ea66979684
can run
6 years ago
minqiyang
dbd4d058af
Add static implementation and fix fc layer
6 years ago
Xin Pan
e395f2c6a3
polish codes
...
test=develop
6 years ago
minqiyang
315b133e67
Add single GPU support to imperative
6 years ago
Xin Pan
9a4314f025
imperative gan
...
test=develop
6 years ago
Qiao Longfei
cd31b90a46
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
...
test=develop
6 years ago
peizhilin
9adb158e5b
Merge remote-tracking branch 'upstream/develop' into debug/support
6 years ago
minqiyang
ddfb9f1123
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
...
test=develop
6 years ago
Xin Pan
d1220f23bb
Merge pull request #15229 from panyx0718/imperative
...
support python codes in the imperative model
6 years ago
minqiyang
d0b640dca1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
...
test=develop
6 years ago
minqiyang
687171d22b
Move from shared_ptr to raw pointer
...
test=develop
6 years ago
peizhilin
439691f5bd
adjust the shlwapi on windows
...
test=develop
6 years ago
flame
fb63cd89d4
Add python ir graph API ( #14917 )
6 years ago
minqiyang
08e2a5d611
Polish tracer code
...
test=develop
6 years ago
minqiyang
cded24768c
Remove shared_ptr holder for VarBase
...
test=develop
6 years ago
minqiyang
c8d1a8e909
Change var_ and grad_ to shared_ptr
6 years ago
minqiyang
7aab39af15
Change grads to VarBase
6 years ago
Qiao Longfei
edad60e612
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
Xin Pan
7b73fc9e1a
Merge pull request #15089 from panyx0718/api
...
try unify Executor and ParallelExecutor
6 years ago
Xin Pan
9597fd05e9
polish
...
test=develop
6 years ago
Xin Pan
7aad6afd49
forward and backward
...
test=develop
6 years ago
Xin Pan
2349acea48
checkpoint
...
test=develop
6 years ago
Xin Pan
11d4d39cd7
forward working
...
test=develop
6 years ago
Xin Pan
b629133375
checkpoint runnable PyLayer
...
test=develop
6 years ago
peizhilin
a6f5ceee74
add the python callstack for debug support test=develop
6 years ago
Xin Pan
0d0bc61248
update api
...
test=develop
6 years ago
chengduo
eabb2105fa
Refactor MultiDevSSAGraphBuilder ( #15090 )
...
* Refactor ParallelExecutor
test=develop
* extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
test=develop
* Refactor MultiDevSSAGraphBuilder
test=developt
* Remove enable_data_balance
test=develop
* code refine
test=develop
* remove data balance
test=develop
* refine ScaleLossGradOp
test=develop
* remove uncessary file
test=develop
* code refine
test=develop
* modify function name
test=develop
* follow comments
test=develop
* add is_distribution field
test=develop
* set is_distribution
test=develop
* fix DistSSAGraphBuilder
test=develop
6 years ago
Xin Pan
5e928e579a
try unify Executor and ParallelExecutor
...
test=develop
6 years ago
minqiyang
2547f9d1b8
Polish code
...
test=develop
6 years ago
minqiyang
09e2e66236
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
gongweibao
ce70229ba6
Add max_body_size flags to brpc ( #15084 )
6 years ago
minqiyang
6a5f604607
Support stop_gradients var in imperative backward
...
test=develop
6 years ago
minqiyang
9e3155e01d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
336160e651
Complete imperative optimizer implementation
...
test=develop
6 years ago
minqiyang
d4b9928c5a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
5822f7f1d8
Polish code
...
test=develop
6 years ago
Xin Pan
7b6bf9ddf2
make fill_constant kernel-based
...
test=develop
6 years ago
Xin Pan
61491ce250
clean
...
test=develop
6 years ago
Xin Pan
ce7e503cbe
refactor to avoid scope.
...
test=develop
6 years ago
Xin Pan
71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
...
Revert "cherry-pick the #12759"
6 years ago
minqiyang
68e9b841ab
Add support for optimizer
6 years ago
Zeng Jinle
9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
...
Add scope_pool
6 years ago
sneaxiy
3e917a934a
add scope_pool
...
add module cleanup
test=develop
6 years ago
peizhilin
2388d0e7d6
Revert "cherry-pick the #12759"
...
test=develop
This reverts commit 7f6d8acecb
.
6 years ago
peizhilin
7f6d8acecb
cherry-pick the #12759
...
test=develop
6 years ago
minqiyang
3de0f612e8
Polish code
...
test=develop
6 years ago
Zeng Jinle
95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
...
Featue/py_func op
6 years ago
minqiyang
1c116462cc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
29697c2e25
Add stop_gradient to VarBase to support loss function
...
test=develop
6 years ago
chengduo
550e7e410b
Code Clean parallel_executor.py ( #14849 )
...
* refine parallel_executor
* remove uncessary code
test=develop
6 years ago
minqiyang
3cd10a7c4f
Add Conv2D forward
...
test=develop
6 years ago
minqiyang
8d88c5a87d
Shameless copy
6 years ago
Xin Pan
6324032602
MLP forward backward
...
test=develop
6 years ago
colourful-tree
44ad2f4479
Merge pull request #14873 from colourful-tree/develop
...
add pslib(pserver) to paddle, an industrial scale high performance parameter server library
6 years ago
dzhwinter
7cd24b1318
add ir memory optimize. ( #14530 )
...
* follow comments. test=develop
* Fix typo
* fix compile error. test=develop
* merge develop branch. test=develop
* Remove set_equal
* Polish code
* Delete unused functions
test=develop
* polish code. test=develop
* follow comment
* polish code.
* fix windows compile error. test=develop
* fix op handle.
* rerun ci. test=develop
* rerun ci. test=develop
* rerun macci. test=develop
* polish code. test=develop
* rewrite sort code. test=develop
* remove unused code. test=develop
* fix tests. test=develop
* fix conflict. test=develop
* follow comment. test=develop
* merge develop branch. test=develop
* fix tests. test=develop
* remove ToTypeIndex. test=develop
* rerun ci. test=develop
6 years ago
guru4elephant
a79a3ea2f0
Merge branch 'develop' into develop
6 years ago
wopeizl
0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
...
fix script issue
6 years ago
Yu Yang
bacf1d2399
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
gongweibao
0b1c7d838c
Add brpc serialization support. ( #11430 )
6 years ago
peizhilin
23dec78772
fix script issue
...
test=develop
6 years ago
dongdaxiang
e52bb816e3
add copyright to __init__.py in distributed folder
6 years ago
sneaxiy
f0df62f136
add more unittest case
...
test=develop
6 years ago
Yu Yang
4ecdb6f486
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
...
test=develop
6 years ago
sneaxiy
f6741df462
merge develop
...
fix bug
test=develop
6 years ago
heqiaozhi
f81957a753
refine cmake for pslib & pre_define
6 years ago
sneaxiy
deb0d41cea
fix cmake
...
fix cmake again
test=develop
6 years ago
sneaxiy
e7c5c9d2de
remove unnecesary code
...
test=develop
6 years ago
sneaxiy
db2daefe50
merge develop
...
test=develop
6 years ago
sneaxiy
ca84c2ca8f
merge develop
...
test=develop
6 years ago
sneaxiy
e240ba2918
implement backward
...
test=develop
6 years ago
Yu Yang
81520a24cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
6 years ago
Yu Yang
9bd70a1e04
Change tensor uses proto::VarType::type
...
test=develop
6 years ago
Qiao Longfei
1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
...
split selected rows op should always init output selected rows
6 years ago
sneaxiy
7923042365
merge develop
...
test=develop
6 years ago
Qiao Longfei
0a7c7e97af
test zero output of split_selected_rows_op
...
test=develop
6 years ago
Tao Luo
3243b45bb2
Merge pull request #14651 from luotao1/has_attr
...
add Opdesc's HasProtoAttr
6 years ago
heqiaozhi
016a068775
stop server
6 years ago
sneaxiy
8760d23c7d
featue/py_func
6 years ago
Tao Luo
067ed70f2d
add HasProtoAttr function in op_desc.h, clean node.h
...
test=develop
6 years ago
Xin Pan
748549b2e3
Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
...
This reverts commit b1d3a1c8b4
, reversing
changes made to f1fb64b17f
.
6 years ago
Xin Pan
c049fa7cf7
Revert "Revert "Revert "Imperative"""
6 years ago
gongweibao
f1fb64b17f
Add reduce sparse tensor feature. ( #14757 )
6 years ago
heqiaozhi
575ae7c6c3
refine pslib inferface & fix some bugs
6 years ago
Xin Pan
2538ef64f1
Revert "Revert "Imperative""
6 years ago
Xin Pan
6217f42ab7
Revert "Imperative"
6 years ago
Qiao Longfei
daba57f752
complete ctr_reader
6 years ago
sneaxiy
64ad051b9a
merge develop
...
test=develop
6 years ago
sneaxiy
c47c451a00
fix bug
6 years ago
heqiaozhi
a77fa67bbd
async_thread_trainer & libmct & pslib.cmake
6 years ago
Xin Pan
b80fe8264a
polish
...
test=develop
6 years ago
Xin Pan
e5d64fd4d1
initial imperative
...
test=develop
6 years ago
Xin Pan
4d0df1fea7
add fields for autograd
...
test=develop
6 years ago
Xin Pan
8138391631
add OpBase and unify with VarBase
...
test=develop
6 years ago
Xin Pan
f6f0692451
clean up
...
test=develop
6 years ago
Xin Pan
aeb74af54c
allow operator to run imperatively
6 years ago
Xin Pan
b1f6fda5e5
run forward
6 years ago
Xin Pan
a6d23083f0
some tracing
...
test=develop
6 years ago
Xin Pan
dac92e560c
initial commit
6 years ago
Qiao Longfei
7f07dfa1a4
clean code
6 years ago
Wang Guibao
41e19eb431
AsyncExecutor ( #14627 )
...
* AsyncExecutor: C++ side
* Google naming conventions
* Rename MultiExecutor to AsyncExecutor
* pybind with async_executor
* Naming convention
* remove some flags and unused code
* add refactored file of async_executor and data_feed
* clear async executor interface and add data feed factory
* split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
* Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
* refine async_executor_refactor.cc
* add some files about datafeed
* Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
* Interface rework
* add MultiSlotDataFeed
* Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
* update data_feed for add MultiSlotDataFeed
* update datafeed and async_executor to run bow_net demo
* fix bug that finish_set_filelist failed in multithread
* delete finish_binding_memory_(flag), because it can not be marked under the current interface
* Fix bug
* update async_executor.py for support set_use_slots
* update async_executor.py for support set_use_slots and set set_dense_slots
* fix bug that when the number of files is less than the number of threads, it will fetch nan
* remove redundant code, and make executor exit when set a illegal queue size
* add batch_size check
* add MultiSlotDesc
* Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
* add some checkpoint in DataFeedDesc
* add CheckFile function in MultiSlotDataFeed
* update something error info
* fix deaded lock bug
* Fix fetch variable
* Merge error
* fix code style in async_executor
* using one lock blocking queue replace two lock blocking queue because of some bugs
* update code style
* add utest for data_feed
* Fix fetch var
* update utest for data_feed for multithread
* update SetFileList info
* fix bug in utest of data_feed
* Add comments for python
* Add comments for python code
* Fix pybind.cc with new pybind11 version
* add note for DataFeedDesc's set_use_slots function
* Add save_model
* update data_feed_test for multi-type
* add comment for executor_thread_worker
* Remove unused code
* update data_feed_test for generate test data file
* removed unnecessary interfaces and add comments
* c++ style check
* update data_feed.cc
* AsyncExecutor: C++ side
Google naming conventions
Rename MultiExecutor to AsyncExecutor
pybind with async_executor
Naming convention
remove some flags and unused code
add refactored file of async_executor and data_feed
clear async executor interface and add data feed factory
split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
refine async_executor_refactor.cc
add some files about datafeed
Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
add MultiSlotDataFeed
Interface rework
Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
update datafeed and async_executor to run bow_net demo
update async_executor.py for support set_use_slots
Fix bug
update async_executor.py for support set_use_slots and set set_dense_slots
fix bug that when the number of files is less than the number of threads, it will fetch nan
remove redundant code, and make executor exit when set a illegal queue size
add MultiSlotDesc
Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
add some checkpoint in DataFeedDesc
Fix fetch variable
fix code style in async_executor
Fix fetch var
add utest for data_feed
Add comments for python
update utest for data_feed for multithread
fix bug in utest of data_feed
Add comments for python code
Fix pybind.cc with new pybind11 version
add note for DataFeedDesc's set_use_slots function
update data_feed_test for multi-type
Add save_model
update data_feed_test for generate test data file
removed unnecessary interfaces and add comments
add comment for executor_thread_worker
Remove unused code
update data_feed.cc
c++ style check
* commit for code style
* commit for code style
* commit for code style
* commit for code style
* Comment away __init__ in async_executor.py
* clang-format fix test=develop
* use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
* commit for update code style
* commit for update code style
* Add async_executor demo; Remove some methods
test=develop
* commit for update code style
* commit for update code style
* commit for update code style
* update API.spec
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
* Fix windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix code style
test=develop
* Fix code style
test=develop
* update datafeed
* Fix code style
test=develop
* update data_feed_test for test Tensor test=develop
* Fix code style
test=develop
* Fix windows build failure
test=develop
* Fix code style and windows build failure
test=develop
* Fix PYTHON3.5 build failure
test=develop
* AsyncExecutor API
test=develop
6 years ago
Qiao Longfei
75eba6108d
Add scope doc ( #14582 )
...
* add doc for scope
* update doc for force_init_on_cpu
test=develop
* follow comment test=develop
* update format test=develop
6 years ago
minqiyang
a02ce58f2c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
6 years ago
wopeizl
db9284ecde
Merge pull request #14617 from wopeizl/windows/online
...
Windows/online
6 years ago
gongweibao
867c312bc4
Fix allreduce dependency order. ( #14586 )
6 years ago
peizhilin
511cc9024a
fix for build issue
6 years ago
minqiyang
be04d99fe4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
6 years ago
minqiyang
53433d7f2e
Revert the changes of VLOG
...
test=develop
6 years ago
minqiyang
d2045260a5
Change visibilities of variant_visitor of pybind11
...
test=develop
6 years ago
minqiyang
b67229187e
Change to PYBIND11_MODULE because the deprecation of PYBIND11_PLUGIN
...
test=develop
6 years ago
minqiyang
81994e84e0
Change the include files because the version changes of pybind11
...
test=develop
6 years ago
sabreshao
61c5f13fcf
Fix cmake for AMDGPU platform ( #13801 )
...
* HIP cmake.
Enable whole archieve build for pybind library.
Disable two warning.
Rollback to C++11.
Link RCCL to WA gpu kernel loading issue.
Update eigen to fix build failure.
Add more include directories.
Fix O3 build failure.
Update eigen.
fix tensor_util_test segment fault issue
add more macro check in hip.cmake.
we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
Fix rocRAND load.
Update eigen to fix gru_unit_op and reduce_op.
Add HIP support to testing.
Update eigen to support int16 and int8 in arg min and arg max.
* add rocprim as cub library used by nv implementation
* Reduce build time in rocprim.
* Add rocprim introduction, remove useless cmake code.
* Remove useless flags and format cmake file.
6 years ago
Jiabin Yang
de2db11735
Merge pull request #14537 from reyoung/feature/fix_macos_ut
...
fix(Cpu): fix cpu compile and unittest
6 years ago
wopeizl
d9a1f3e58e
Windows/online ( #14474 )
...
* add recordio support
* disable the openblas multi-thread on windows since no support
adjust the python script
* code style
* code style
test=develop
* add create_recordio_file_reader back
* fix code style
test=develop
* fix the gtest.cmake on windows
* fix cc_test on windows
* fix the win build
test=develop
* remove fused compile support on windows
test=develop
* add the jit support
test=develop
* add the jit support, test=develop
* add the jit support, test=develop
* add the jit back
fix compile error on windows
* rollback test=develop
* test case fix
* disable DSO by default on windows
* exclude warpctc_op on windows
* exclude the dynload_warpctc out on windows
test=develop
* fix the scripts error
test=develop
* disable avx on windows by default
test=develop
* re-organize the cmake file
* disable mkl on windows by default
* add warp_ctc back
* fix the dependency
* fix the dependency
* fix the build issue on windows
* remove unsupported flag on windows
* code style
* code style
test=develop
* fix issue
* add profiler, parallel_executor back
* clean up the pre-definitions on windows
* fix build issue
* test=develop
6 years ago
Yu Yang
533c5d5803
fix(Cpu): fix cpu compile and unittest
...
test=develop
6 years ago
Yu Yang
a685f305f8
Merge pull request #14479 from reyoung/feature/fix_macos_ut
...
fix(Mac): fix unittest of macos
6 years ago
Yu Yang
98bbfc17be
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
Yu Yang
7486b0ddec
fix(Mac): fix unittest of macos
...
test=develop
6 years ago
Wu Yi
a2d9b34417
Refine operator cmake ( #14413 )
...
* wip simplify operator framework
* wip
* wip
* done test=develop
* clean test=develop
* fix test=develop
* fix deps test=develop
* fix cpu build test=develop
* fix tensorrt build test=develop
* fix tests test=develop
* fix test=develop
* fix cpu build test=develop
6 years ago
Yu Yang
c8f6e70ab4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
dzhwinter
d3aed98d86
Merge pull request #14320 from wopeizl/windows/online
...
Windows/online
6 years ago
peizhilin
1a9008c420
code style fix
...
test=develop
6 years ago
Xin Pan
bae3659714
more test
...
test=develop
6 years ago
Xin Pan
759ffca423
some improvements
...
test=develop
6 years ago
Xin Pan
99dffb91d6
allow to repeatedly share and update BuildStrategy
...
test=develop
6 years ago
Yu Yang
02631965c8
Refine
6 years ago
sneaxiy
d231e55065
merge develop
...
test=develop
6 years ago
peizhilin
ca60e1d34d
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
52f7644f53
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yu Yang
fdc689142c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
minqiyang
87450b9ad4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
6 years ago
chengduo
c5b6573a5a
Fix input<tensor> ( #14208 )
...
* fix input<tensor>
test=develop
* fix split_ids
test=develop
* ElementwiseMul should not support SelectedRows
* fix scale op
test=develop
* change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()
* fix operator
* refine MultiOutput
* fix MultiOutput
test=develop
* disable test_dist_save_load
test=develop
* fix elementwise_op
test=develop
* add get_sparse_as_op
test=develop
* add info for check
test=develop
* rename get_sparse_as_op with extract_rows_as_op.
test=develop
* elementwise doesn't support selected_rows
* fix regularizer
* remove extract_rows_as
test=develop
* fix ci
test=develop
* add test for sum_op
* fix regularizer
test=develop
* test=develop
* fix pserver weight decay multi inputs test=develop
6 years ago
minqiyang
0c3227a523
Change the origin VLOG level to 10 times
...
Fix code to support cpplint syntax check
test=develop
6 years ago
peizhilin
869487a2b7
Merge remote-tracking branch 'origin/develop' into windows/build
6 years ago
Yu Yang
057a682ee9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Yu Yang
c28beb8a3c
test(Pe): add dry run tests for pe ( #14254 )
...
Dry run tests will skip `Op.Run` and just perform job scheduling. It helps to analysis dead lock in PE.
test=develop
6 years ago
peizhilin
1f12ba6192
gpu support, fix build issue:
...
1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
2. comment out some ops which not supported on windows
3. cuda libs may not be correctly linked to target on windows
6 years ago
sneaxiy
faac8a76ce
remove unnecessary codes
...
test=develop
6 years ago
peizhilin
71d7980f69
fix build issue 1
6 years ago
peizhilin
9d67c1fb69
cpu build support
6 years ago
sneaxiy
2494ca83ab
test=develop
6 years ago
sneaxiy
cef8cc81db
merge develop
6 years ago
Yu Yang
c01696f8c2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
Qiao Longfei
d26ff8cb2d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
6 years ago
Wu Yi
26200f2e42
[1.1] [project] train imagenet using large batch size ( #13766 )
...
* fix nccl2 lars dist support
* put lars in momentum op
* add tests lars
* fix ci
* fix cpu kernel
* soft warning
* remove lars in test_recognize_digits.py
* move to another op
* add file
* update api.spec test=develop
* update test=develop
* fix api.spec test=develop
* wip
* wip, finish grad merge ops
* wip, finish graph build
* wip test running
* work on 1 gpu
* workable version
* update
* fix tests
* fuse broadcast op
* fix compile failed
* refine
* add batch merge test mnist
* fix CI test=develop
* fix build
* use independent bn params for batch merge test=develop
* update api.spec
* follow comments and for test
* wip
* refine tests test=develop
* follow comments test=develop
* remove startup bn modify test=develop
* follow comments test=develop
* fix merge test=develop
6 years ago
sneaxiy
a314a80cdb
merge develop
6 years ago
seiriosPlus
06de824ba8
fix shape in floats
6 years ago
tangwei12
755927d2b0
shape type to int64_t, test=develop
6 years ago
tangwei12
8b7f45a889
add longs in framework
6 years ago
Yu Yang
461f71a90b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
sneaxiy
2002e71da8
fix pinned allocator
6 years ago
Qiao Longfei
681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
...
reader block queue add test mode
6 years ago
Qiao Longfei
60030e8678
change the use of FLAGS_reader_queue_speed_test_mode
...
test=develop
6 years ago
Tao Luo
323d67cfc1
Merge pull request #13879 from panyx0718/doc
...
fix doc
6 years ago
chengduo
2562eb92b8
Add strategy doc ( #13849 )
...
* add strategy doc
test=develop
* fix doc
test=develop
* add ParallelExecutor arg doc
test=develop
6 years ago
Xin Pan
b11372a0af
fix doc
...
test=develop
6 years ago
Qiao Longfei
8686f7c68e
add reader_queue_speed_test_mode flag for speed test
6 years ago
sneaxiy
d87569134c
test=develop
6 years ago
Xin Pan
63b2e98f3d
Explain LoD and a few other concepts
...
test=develop
6 years ago
chengduo
e1904ac2c8
Add doc ( #13765 )
...
test=develop
6 years ago
sneaxiy
4c672ab1a2
Merge reyoung:rewrite_allocation
6 years ago
Yu Yang
15076c325e
Add comments and polish code style
6 years ago
Yu Yang
29f66c2408
Polish code
6 years ago
Yu Yang
6ca37448ac
Refine prelu_op
6 years ago
Yu Yang
186b2b138d
Revert "Merge pull request #13201 from reyoung/revert_callstack" ( #13697 )
...
This reverts commit 21bb9e91fc
, reversing
changes made to 3fa68dc101
.
test=develop
6 years ago
Yu Yang
ae9378f640
Refine PyBind
6 years ago
Yu Yang
a1a01899c8
Refine
6 years ago
Yu Yang
31270e58d0
Add communication attr
6 years ago
Xin Pan
ddd60581b7
clean up channel
...
test=develop
6 years ago
Xin Pan
7cd2761736
Merge pull request #13416 from panyx0718/ir
...
PassBuilder
6 years ago
Wu Yi
16e73e0d28
hide operator API ( #12543 )
...
* hide operator API
* update
* update api.spec
* fix merge
* fix test
6 years ago
Xin Pan
cbdf9833e3
hide create_passes_from_strategy for now
6 years ago
Xin Pan
f117feab0c
modify comments
6 years ago
Yu Yang
7119d6c3cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_callstack
6 years ago
Xin Pan
b43e49fa31
fix
6 years ago
Xin Pan
36c2a9af27
pass builder allow cutomize pass in python.
6 years ago
Wu Yi
29c63d180f
[Feature] dist op role and lr op role, to support memory optimize with dist training ( #13220 )
...
* wip
* clean up
* should fix running with memopt
* add ut
* mark lr schedule op role
* hide lr_schedule_guard
* use op_role_var instead of ufind
* unify dist test name
* wip for py3 support
* fix var deref
* fix python3 mem_opt order
* remove comments
6 years ago
chengduo
d402234ba8
Feature/op_fuse_pass ( #12440 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* add op_fuse pass
* add backward
* code refine
* use TopologySortOperations
* follow comments
* refine IsFusible
* code enhance
* fix op_fusion_pass
* refine code
* refine fuse_elemwise_act_op
* adjust the input and output
* refine logic
* add intermediate_edge
* disable inplace
* follow comments
* refine logic
* follow comments
* Remove the removable IntermediateOut
* change strategy
* code refine
* enable fuse backward
* code refine
* code refine
* rename unit test
* follow comments
6 years ago
Xin Pan
5ce77889f4
clean unused inference_optimize c++ implementation
6 years ago
Yancey1989
5ce1a960a5
move bcast op into pass
7 years ago
Xin Pan
0904f07d46
polish
7 years ago
Xin Pan
56a977d436
add test
7 years ago
Xin Pan
926e1077ca
version
7 years ago
Yu Yang
cda7842e26
Revert "Revert "Add Python Callstacks when Op::Run error ( #12759 )""
...
This reverts commit 1f270275a6
.
7 years ago
Xin Pan
51ef0ad766
allow to use name_scope for debugging and visiualization
7 years ago
qingqing01
1f09bc320c
Support data type int8_t . ( #12841 )
...
* Support int8 type.
7 years ago
dzhwinter
eca4563e5d
operators module ( #12938 )
7 years ago
guochaorong
1f270275a6
Revert "Add Python Callstacks when Op::Run error ( #12759 )"
...
This reverts commit b2df17003f
.
7 years ago
Wu Yi
b8da70c370
Resovle multi gpu async deps ( #12828 )
...
* dist transpiler add control dependency var between send and recv
* fix async deps
* follow comments and refine
* fix deps connect for rpc ops
7 years ago
Yu Yang
b2df17003f
Add Python Callstacks when Op::Run error ( #12759 )
...
* Add Python Callstacks when Op::Run error
* Skip op with sub-block
* refactor: refine callstack info's format
* Reshape only support matrix
* Polish Python code
* Fix UT
* Fix Py3
7 years ago
dzhwinter
e23ddf6ae4
status ( #12764 )
7 years ago
Yu Yang
c6af7201e9
Merge pull request #12692 from reyoung/feature/fast_executor
...
Feature/fast executor
7 years ago
Qiao Longfei
653fad08f8
Optimize selected rows for dist lookup table with pthread rwlock ( #12635 )
...
Optimize selected rows for dist lookup table with rwlock
7 years ago
yuyang18
d1d825ee02
Hide unnecessary API
7 years ago
yuyang18
265302edea
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fast_executor
7 years ago
minqiyang
77f12e000f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
gongweibao
842fb021b3
Fix clone() bug. ( #12583 )
7 years ago
minqiyang
e0d5f8a820
Move compat module to python/paddle
7 years ago
minqiyang
1800fef142
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
yuyang18
05cadf1b24
Add FastExecutor
7 years ago
Wu Yi
8b77448d5f
hide misc APIs ( #12540 )
...
* hide misc APIs
* update
* fix transformer test
* update API.spec
7 years ago
minqiyang
6abe819f07
Fix pybind11 problem
...
Fix str and bytes problem
Fix sorted problem
Fix math problem
Fix CI problem
7 years ago
minqiyang
559d36328c
Apply 2to3 to current paddle main python code
7 years ago
Yancey1989
d4f51218ef
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
Yancey1989
efd5a84986
update executor interface
7 years ago
Tao Luo
3694fd5c4a
Merge pull request #12109 from emailweixu/cpu_only
...
Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
Yancey1989
7570d8e77c
add rpc complete interface
7 years ago
Wu Yi
db67d60e31
Remove block api ( #12107 )
...
* remove block api
* remove clone_variable
* hide block inner apis
* update
* fix tests
7 years ago