fengjiayi
335e4b9d52
fix a dead lock bug
7 years ago
sneaxiy
f85e16f1de
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into complete_py_reader_python
7 years ago
yuyang18
c822d0309b
Refine code
7 years ago
yuyang18
7d6afee5c1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exception_safe_pe
7 years ago
yuyang18
0a445da631
Make scope_buffered_ssa_graph_executor Exception safe
7 years ago
fengjiayi
26ae6111d1
Merge pull request #12051 from JiayiFeng/dev_reader_ResetAll
...
[WIP] Dev reader reset all
7 years ago
fengjiayi
d55919c656
Impl ResetAll and fix errors
7 years ago
gongweibao
091ab63331
Fix singleton. ( #11835 )
7 years ago
fengjiayi
6d6f49cd56
Merge remote-tracking branch 'yuyang/feature/decorated_reader_chain' into dev_reader_ResetAll
7 years ago
fengjiayi
611716e9bc
Merge branch 'dev_reader_shutdown_start' of https://github.com/JiayiFeng/Paddle into dev_reader_shutdown_start
7 years ago
fengjiayi
0e9f1e2790
Make ReaderBase thread safe and remove ThreadedReader
7 years ago
yuyang18
e8ee9dc7f8
Several Polish
7 years ago
yuyang18
0d2ccfbd3c
Remove atomic
7 years ago
fengjiayi
b4f0e57956
fix errors
7 years ago
yuyang18
62c1133f42
Add mutex for decorated_chain
7 years ago
Tao Luo
436bb4500b
Merge pull request #11699 from pzelazko-intel/pzelazko/workaround-for-missing-mklnn-kernels
...
workaround for no MKLDNN kernel
7 years ago
fengjiayi
5528f59900
Split ReInit() to Shutdown() and Start()
7 years ago
fengjiayi
de9a411f1c
adjust readers' inheritance relationships
...
1. Make PyReader and RandomDataGenerator inherited from FileReader.
2. Remove the memeber variable 'dims_' and realated checks in FileReader.
7 years ago
yuyang18
c48c586aca
Use weak_ptr to implement DecoratedReaderChain
7 years ago
yuyang18
2bbe5f77e7
Add GetEndPoints of Reader.
...
We can get endpoints of a reader chain.
7 years ago
Paweł Żelazko
05ffc1331e
log for fallback added
7 years ago
Yu Yang
de456e74ae
Merge pull request #11913 from reyoung/feature/remove_clone_method
...
Remove Op::Clone method
7 years ago
yuyang18
8e86721fe7
Fix data balance on single GPU
7 years ago
dzhwinter
4ed0b62476
Move fluid::framework::InitDevices into fluid::platform ( #11757 )
...
* move to platform
* "move init from framework to platform"
* "remove used init"
* "fix ci"
* "fix ci"
* "fix generic"
* "fix ci"
* "fix ci"
* "fix ci"
* "disable fragile test"
7 years ago
fengjiayi
ce16b40b04
Merge pull request #11891 from JiayiFeng/dev_eof_exp
...
Add EOFException to represent EOF in C++ reader
7 years ago
yuyang18
4e4438a8aa
Remove Op::Clone method
...
It is used by NetOp before.
7 years ago
yuyang18
2d0e5592b5
Use std::map for Place <--> DeviceContext
7 years ago
Yancey
f7fd711e3f
Merge pull request #11868 from Yancey1989/dist_pass_barrier
...
add dist pass barrier
7 years ago
Yu Yang
9401b64d61
Merge pull request #11877 from reyoung/feature/fix_reshape_op_size
...
User can register a standard C++ functor as Kernel
7 years ago
fengjiayi
3fab4f65a4
Add EOFException to represent EOF in C++ reader
7 years ago
Paweł Żelazko
ac323343a0
typos fix
7 years ago
Paweł Żelazko
bf0090a1ef
workaround for missing MKLDNN kernel
7 years ago
yuyang18
82866d4a18
Add register kernel functor and shrink reshape op
...
* Shrink reshape_op library size
* User can register a standard C++ functor as a op kernel
7 years ago
fengjiayi
58560622bc
Merge pull request #11854 from JiayiFeng/dev_data_balance
...
Data balance for the ParallelExecutor
7 years ago
yuyang18
75ae426aa5
Merge branch 'feature/change_op_kernel_to_func' into feature/fix_reshape_op_size
7 years ago
yuyang18
3b00ed81a9
Make Kernel registed as a function
7 years ago
yuyang18
1ce478f100
Polish reshape op
7 years ago
Yancey1989
37410a0c75
update by comment
7 years ago
sneaxiy
3f9292c6e6
fix merge conflict
7 years ago
sneaxiy
dd70fb4393
fix type comparation bugs
7 years ago
fengjiayi
ff4317cee9
follow comments
7 years ago
Yancey1989
c1ab215e26
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into dist_pass_barrier
7 years ago
Yancey1989
1366832a41
add dist pass barrier
7 years ago
gongweibao
66c91911cf
Improve brpccmake ( #11842 )
7 years ago
Yan Chunwei
28172bbb8e
add debug to replacing enforce with GLOG for debug ( #11244 )
7 years ago
guochaorong
c318aa5ffa
Merge pull request #11850 from guochaorong/revert_11496
...
Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray …
7 years ago
fengjiayi
49a04d75ee
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi
4b950951d3
Add unittests and fix a few bugs
7 years ago
chengduo
7b54f16855
Follow comment ( #11845 )
7 years ago
Tao Luo
b0378963d0
Merge pull request #11666 from mozga-intel/mozga-intel/Batch_norm_support_other_type
...
The mkldnn batch norm supports other data format
7 years ago
guochaorong
6a35899131
Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray ( #11496 )"
...
This reverts commit bc28cf613f
.
7 years ago
gongweibao
c2165ffa7b
Fix codesytle ( #11836 )
7 years ago
chengduo
8d76cf397d
Fix TensorCopy bug ( #11822 )
...
* Fix tensorcopy bug
* follow comment
* Refine TensorCopy
7 years ago
fengjiayi
ba5380129c
Merge branch 'fix_Mac_compile_errors' into dev_data_balance
7 years ago
fengjiayi
077434c26c
fix CPU compile error
7 years ago
fengjiayi
47388020a2
fix bugs
7 years ago
yuyang18
5e23a5ec18
Rename TransferData -> TransformData
7 years ago
yuyang18
5999201346
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refactor_op
7 years ago
chengduo
6711b7b5f1
fix FeedAndSplitTensorIntoLocalScopes ( #11817 )
7 years ago
fengjiayi
2e320079d3
fix bugs
7 years ago
fengjiayi
5b4f283069
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi
b6dc3a59f1
Add DataBalanceOpHandle to MultiDeviceSSAGragh
7 years ago
mozga-intel
61c54dbbe7
Remove additional function of the code
7 years ago
Haichao Zhang
bc28cf613f
Extend fill_zeros_like_op for zero-filling an LoDTensorArray ( #11496 )
...
* Add fill_zeros_array op. This op is used for zero-filling an LoDTensorArray.
* merge fill_zeros_array_op with fill_zeros_like_op
* add unit_test for fill_zeros_like for array
7 years ago
yi.wu
8d04d0e2a3
update
7 years ago
yi.wu
6f0107126a
fix broadcast bug
7 years ago
yuyang18
9faf5a39c5
Refactor Operator.cc, and clean code
7 years ago
yi.wu
88cb47bd86
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_paraexe_bcast
7 years ago
yi.wu
8e48c77b54
wip
7 years ago
yi.wu
3d69a82b83
fix dist train broadcasting bug
7 years ago
chengduo
bb29800aaa
small refine ( #11460 )
7 years ago
Tomasz Patejko
e26f51ce74
MKLDNN elementwis_add with default broadcast operations ( #11544 )
...
* elementwise_add with bcast: Brian's implementation by Brian added, with default bcasts
* elementwise_add with bcast: GetExpectedKernelType added to elementwise_op
* elementwise_add with bcast: use_mkldnn attribute added
* elementwise_add with bcast: changes after review and some formatting
* elementwise_add with bcast: changes after style check
* elementwise_add with bcast: changes after style check cont.
* elementwise_add with bcast: MKLDNN unittests added
* elementwise_add with bcast: original unittests with use_mkldnn flag
* elementwise_add with bcast: handling of MKLDNN format corrected
* elementwise_add with bcast: setting MKLDNN format turned into lambda
* elementwise_add with bcast: MKDNN format setting turned into separate function
* elementwise_add with bcast: condition for choosing MKLDNN simplified
* elementwise_add with bcast: fix for MKLDNN format set incorrectly in bcasts
* elementwise_add with bcast: changes in unittests for broadcasts
* elementwise_add with bcast: fixes in unittests regarding dimensions
* elementwise_add with bcast: bring back correct format setting in mklml grad path
* elementwise_add with bcast: fixed compilation error
7 years ago
yi.wu
254154a9bf
fix sparse paraexe dist train
7 years ago
Yancey
0d4b376fc0
Merge pull request #11585 from Yancey1989/fix_pserver_sub_blocks
...
fix pserver sub-blocks
7 years ago
Yancey1989
8cb494f79c
add blocks attr type in proto
7 years ago
Yancey1989
56a903d3ac
use optimize block list instead of first optimize block
7 years ago
fengjiayi
e45a55523b
Merge pull request #11639 from JiayiFeng/fix_mac_compile
...
fix mac compile
7 years ago
chengduo
da556ed6d4
enhance ParallelExecutor stable ( #11637 )
7 years ago
Kexin Zhao
073af6237a
add print lod_tensor int64 option ( #11644 )
7 years ago
fengjiayi
964f515e9a
fix mac compile
7 years ago
Yu Yang
9b3f48d7e6
Merge pull request #11616 from chengduoZH/fix_parallel_exe
...
Enhance Parallel Executor stable
7 years ago
chengduoZH
c99fca5f90
Add No Mutex
7 years ago
chengduoZH
13de72388d
Fix broadcast
7 years ago
Xin Pan
0151e4eb3d
Merge pull request #11608 from panyx0718/doc
...
small thread-safety fix and doc improvements.
7 years ago
chengduoZH
28a86aebc3
Fix Parallel Exe(VarHandel's version)
7 years ago
Yancey
c413c46915
Merge pull request #11603 from Yancey1989/rename_details_folder
...
move dist codes from operaotrs/detail to operators/distributed
7 years ago
Yancey
9cc1eb434e
Merge pull request #11221 from Yancey1989/overlap_memcpy_with_dist
...
overlap rpc op memcpy in distributed training
7 years ago
Xin Pan
df31926fcf
small thread-safety fix and doc improvements.
7 years ago
Yancey1989
1ef6cdb60e
move dist codes from operaotrs/detail to operators/distributed
7 years ago
Qiao Longfei
05a92779f4
Merge pull request #11524 from jacquesqiao/add-keep_kids-for-executor
...
add keep_kids flag for executor
7 years ago
Yancey1989
7e6518e8ca
fix compile warning
7 years ago
Yancey1989
7d1b146939
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Qiyang Min
a29cb4be2a
Fix decay bug ( #11520 )
...
* Add sub_blocks of lr_decay_op to pserver_prog after distribute_transpiler
* Remove unused logs and logics
* 1. Add ops to new block (considering the nested block condition)
2. Follow the original hierarchy of blocks
3. Change the function's name and remove debug lines
7 years ago
qiaolongfei
2b1ecdf55a
update comment
7 years ago
qiaolongfei
daa0fbd5f4
add keep_kids flag for executor
7 years ago
tensor-tang
609dccfb55
Merge pull request #11395 from tensor-tang/fix
...
remove mkldnn flag from gtest strdup for cpu
7 years ago
Kexin Zhao
417fcf4f43
Modify Pybind LoDTensor API according to length-based LoD ( #11106 )
...
* add lod_tensor util and modify pybind
* refind pybind LoDTensor API and modify LoDTensor and DataFeeder test
* fix test error
* fix detection map op test
* fix reorder_lod_tensor test
* fix seq_concat_op
* fix chunk evel op test
* fix target assign op
* fix warp ctc op
* address comments step 1: reverse reset_lod op
* step 2: modify op test
* add warning message
* remove has_valid_lod
* add back has_valid_lod
* address comments
* add exception catching trial
7 years ago
tensor-tang
0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
...
Refine test and scope lock
7 years ago
tensor-tang
6c1cf60950
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang
c453573286
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang
3e58df20df
initial with only 1 mkl/openblas threads for each pthreads
7 years ago
Qiyang Min
046bb5c8cb
Fix NCCLBcast hang up bug in Parallel Executor ( #11377 )
...
* 1. Create buddy allocator in each places before NcclBcast the variables
2. Check the memory usage of ALL gpus rather than the first one
* 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
2. NOTE the usage of NCCLGroupGuard
* Remove the memory usage check of gpus
* Fix code style
7 years ago
tensor-tang
85c3bfc1f9
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yancey1989
3d875b693f
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Yang Yang(Tony)
d827c6e87a
Dynamic Graph first prototype ( #11415 )
7 years ago
qiaolongfei
7ebef493d5
add row_size for selected rows in DebugStringEx
7 years ago
qiaolongfei
cdd55dbcaa
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei
82416f1844
fix concurrency_test build error on mac
7 years ago
qiaolongfei
9ebbfa6bbc
fix build on mac
7 years ago
Yancey1989
f52d78d189
update by comment
7 years ago
fengjiayi
88fa9c2ec7
Merge pull request #11267 from JiayiFeng/fix_reader_bug
...
Fix a multi-thread bug in readers
7 years ago
tensor-tang
6602db5b3e
throw warning if try to use mkldnn while not compiled
7 years ago
Yancey1989
6d752bafd8
use get_appropriate_dev to schedule rpc op
7 years ago
qiaolongfei
f031555cfb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
Qiao Longfei
431491a2a8
Merge pull request #11366 from jacquesqiao/refine-prefetch
...
Refine prefetch
7 years ago
Wu Yi
34865f2de3
Trainer send term signal ( #11220 )
...
* wip
* use executor.complete to end trainer
* fix build
* fix build with distribute off
* fix typo
* fix cmake typo
* fix build
7 years ago
Yancey1989
4444e79e46
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
qiaolongfei
6dd3f3cf27
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei
16658f7b59
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
chengduo
183377f410
Merge pull request #11306 from chengduoZH/enable_cpu_on_pe
...
Enable CPU on Parallel executor
7 years ago
qiaolongfei
83a577e8ce
fix build problem
7 years ago
dzhwinter
bfa3fd6f15
add inplace attribute to op_proto_maker ( #10665 )
...
* "add inplace attribute"
* "register inplace attribute"
* "change se-next model for memory-reuse"
* "fix typo"
* repick
* fix merge conflict
* "fix stupid error"
7 years ago
gongweibao
9087c6687f
polish ( #11363 )
7 years ago
chengduoZH
173d72b481
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enable_cpu_on_pe
7 years ago
chengduoZH
aadaadf735
replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
gongweibao
627d7a64f8
Clean `sendop` `recv` operator. ( #11309 )
7 years ago
chengduo
fa29ef0b0d
Merge pull request #11277 from chengduoZH/check_ssa_graph
...
Check SSA Graph
7 years ago
chengduoZH
961fbce8e2
follow comments
7 years ago
chengduoZH
7b723839ef
Add cpu test for parallel_executor_crf executor_fetch_feed, and enable these tests
7 years ago
chengduoZH
d24e046c1e
fix allReduce bug
7 years ago
chengduoZH
a57e8a4338
add cpu test
7 years ago
qiaolongfei
0485405b3d
add more debug string
7 years ago
Luo Tao
79d555b9f2
Merge branch 'develop' into mkldnn
7 years ago
gongweibao
062d5a56b4
Add comments to a singleton. ( #11333 )
7 years ago
chengduoZH
1e731f5964
small fix
7 years ago
chengduoZH
495368c243
ADD CPU_NUM
7 years ago
chengduoZH
27073c284d
nccl_all_reduce_op_handle => all_reduce_op_handle
7 years ago
chengduoZH
2d94697a82
code refine
7 years ago
chengduoZH
5a3c8bf813
fix in c++ side
7 years ago
Luo Tao
c6d230e03e
add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
fengjiayi
d745840a6e
fix a small compile error on Mac
7 years ago
chengduoZH
0c851cab22
add SSA graph checker
7 years ago
Yancey1989
d5a88b9340
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
chengduoZH
1076e85135
refine logic
7 years ago
Yancey
0aa9546eed
fix dist train error ( #11281 )
...
* fix dist train error
* update by comment
7 years ago
tensor-tang
b8d315fb69
make scope thread safe
7 years ago
dzhwinter
d48172f22a
split reduce op into multiple libraries, accelerate the compiling ( #11029 )
...
* "split into multiple .ccl"
* "refine file structure"
* "refine files"
* "remove the cmakelist"
* "fix typo"
* "fix typo"
* fix ci
7 years ago
fengjiayi
5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
...
fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
tensor-tang
bfd42683ca
Merge remote-tracking branch 'ups/develop' into refine
7 years ago
dzhwinter
f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. ( #10646 )
...
* "init benchmark ops"
* "untrack outputs"
* "delete some usused code"
* "benchmark"
* "fix ci"
* "fix op test"
* "fix uint16 missing"
* "fix ci"
* "follow comments"
* "fix ci"
* "follow comments"
* "conficts. merge develop branch"
* repick
* "merge develop branch"
7 years ago
fengjiayi
dc8e0b494d
fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
mozga-intel
3ff9ba0e6b
Mkldnn layout ( #11040 )
...
* Add MKLDNN layout support in Paddle
Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.
* Add MKLDNN layout support in activation OP
* Don't populate layout from input to output when kMKLDNN in
* Refine pool mkldnn op kernel
* MKLDNN layout
* Remove the inferitance from tensor file
* MKLDNN layout: refactoring
* Remove additional #define to register new operator
* Prepare mkldnn tests to work with layout
7 years ago
chengduoZH
8291b916d6
replace graph_builder_factory with ssa_graph_builder_factory
7 years ago
chengduoZH
9ac785be39
check graph's validation
7 years ago
fengjiayi
a1e046bfc0
Merge pull request #11270 from JiayiFeng/fix_a_error_on_max
...
fix a compile error on Mac
7 years ago
fengjiayi
499dbe0536
fix a multi-thread bug in readers
7 years ago
fengjiayi
7344210070
Merge branch 'fix_a_error_on_max' into fix_reader_bug
7 years ago
fengjiayi
2f5e310167
fix a compile error
7 years ago
yuyang18
8149b0a9aa
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fuse_var_op_handle
7 years ago
Yancey1989
23433def4b
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
fengjiayi
5d45793936
hot fix
7 years ago
tensor-tang
0693b4148d
Merge remote-tracking branch 'ups/develop' into refine
7 years ago
Yu Yang
ff9b1a0f95
Merge pull request #11234 from reyoung/feature/refine_code
...
SSA Graph Builder Factory
7 years ago
Yu Yang
08823146ec
Merge pull request #11232 from reyoung/feature/extract_tensor
...
Extract method from tensor_impl.h to tensor.cc
7 years ago
tensor-tang
f326b0117e
refine scope lock
7 years ago
Yancey1989
15913d92c5
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
tensor-tang
4b7b17a84f
fix conflcts
...
Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
yuyang18
d9af153232
SSA Graph Builder Factory
...
* Use Builder Chain to decorate new builders. It is easy to extend
builders.
* Make graphviz path as a build strategy, not a FLAGS.
7 years ago
yuyang18
b6c8701e45
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/extract_tensor
7 years ago
chengduoZH
a584bc86dd
add fuse var op handle
7 years ago
Xin Pan
106ee9d1cc
Merge pull request #11243 from panyx0718/scope
...
small clean up and document pointer ownership.
7 years ago
tensor-tang
64323b1caf
Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
Xin Pan
73aa5d230b
small clean up and document pointer ownership.
7 years ago
Yancey1989
e533a4b4ab
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
tensor-tang
4ae935e2cf
refine the lock in scope
7 years ago
Yancey1989
cb3861538d
fix compile failed with CPU
7 years ago
Yu Yang
c36dd3b338
Merge pull request #11114 from reyoung/feature/yep
...
Try to speed up parallel executor
7 years ago
tensor-tang
9b34f8dabd
fix abort issue in cpu multi-threads
7 years ago
yuyang18
fc9f2d282f
Extract method from tensor_impl.h to tensor.cc
7 years ago
Yancey1989
82d741c4b9
fix op name typo
7 years ago
Yancey1989
6d69ae0c6e
code cleanup
7 years ago
Yancey1989
93401c98e1
overlap rpc op memcpy in distributed training
7 years ago
Qiao Longfei
238124909e
fix protobuf memory leak ( #11177 )
...
fix protobuf memory leak
7 years ago
sneaxiy
02cc80b30d
Fix dangling pointer bug
7 years ago
yuyang18
86a61c177f
Add ScopeBufferedSSAGraphExecutor
7 years ago
fengjiayi
d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
...
Refine ProgramDesc copy
7 years ago
chengduo
01bbe532da
Merge pull request #11079 from chengduoZH/balance_parameter_update
...
Balance parameter opt
7 years ago
fengjiayi
a3aca2a3cf
fix bugs
7 years ago
Tao Luo
47f3cae2af
Merge pull request #11038 from sneaxiy/zjl-dev
...
Fix bug in "paddle/fluid/framework/tensor_impl.h" and inefficient code in "paddle/fluid/framework/reader.cc"
7 years ago
fengjiayi
2a3c58d3fe
refine programdesc copy
7 years ago
chengduoZH
e330cd032e
balance parameter update
7 years ago
Yu Yang
14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
...
Add random crop op
7 years ago
fengjiayi
3bce3dbce1
fix a bug
7 years ago
Qiao Longfei
654f5d3c91
Merge pull request #11012 from jacquesqiao/add-auto_grown_mutex
...
add auto_grown_mutex for selected rows
7 years ago
sneaxiy
46f13237dc
Fix bugs in framework/tensor_impl.h and polish framework/reader.cc
7 years ago
qiaolongfei
fa2079b71a
add mutex header
7 years ago
qiaolongfei
167312484b
use unique ptr to hold auto_grown_mutex_ for default copy and move constructor
7 years ago
qiaolongfei
add9ed3388
change order of member
7 years ago
Qiao Longfei
ab953baec6
Merge pull request #10973 from jacquesqiao/fix-prefetch
...
Fix and optimize async distribute lookup table
7 years ago
qiaolongfei
e3c041d319
add auto_grown_mutex for selected rows
7 years ago
chengduo
8c54f1fb9c
Merge pull request #10906 from chengduoZH/fix_data_trans
...
Fix DataTransFunc
7 years ago
chengduo
7d1332f696
Merge pull request #11006 from chengduoZH/fix_add_check_nan_inf_in_operator
...
Move check_nan_inf to operator
7 years ago
qiaolongfei
74701b2692
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-prefetch
7 years ago
chengduoZH
cb1c657ce4
move check_nan_inf to operator
7 years ago
chengduoZH
17a076d8ed
replace TensorCopy with TensorCopySync
7 years ago
Yancey1989
6b91d407de
revert variable mutex
7 years ago
Yancey1989
20c24c05aa
singleton rpc_client
7 years ago
Yancey1989
28596a3386
add gflag ssa_graph_path
7 years ago
qiaolongfei
25f47fc0af
fix prefetch bugs, optimize code
7 years ago
Yancey1989
ad6c0142c4
clean up codes
7 years ago
Yancey1989
268e9dc1c6
polish code
7 years ago
Yancey1989
ceefbf3259
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
chengduoZH
fa613206e4
update
7 years ago
Yancey1989
b348e159f2
fix rename var
7 years ago
fengjiayi
cd8700f172
Merge pull request #10872 from JiayiFeng/dev_CustomReader
...
CustomReader
7 years ago
Yancey1989
0aa6f9e934
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
chengduoZH
4bfadcd136
fix data trans
7 years ago
Yu Yang
d406c76a30
Merge pull request #10744 from reyoung/feature/refine_parallel_executor
...
Disable and fix tests on multi devices.
7 years ago
Yancey1989
540b45350d
use req_count as atomic type
7 years ago
yuyang18
ad6a8715ff
Disable unstable unittests
7 years ago
Yancey1989
6debbcd9f9
connect fetch barrier and concat op
7 years ago
yuyang18
3b04f0099c
Fix bug
7 years ago
yuyang18
39a44e3834
Add Debug log
7 years ago
yuyang18
fb370f4411
Refine code
7 years ago
yuyang18
03e4da6d04
Fix bug
7 years ago
Yancey1989
147d54ba62
update
7 years ago
Yancey1989
e9abc66910
fix pe
7 years ago
yuyang18
c978259027
Add default value of op_role
7 years ago
yuyang18
0e5a278c23
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/support_op_role
7 years ago
fengjiayi
983c9a2a30
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CustomReader
7 years ago
Yancey1989
952fa04009
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989
62af10d440
support multiple devices
7 years ago
dzhwinter
5828101c23
make uint8 support in data_type transform and memory optimize ( #10715 )
...
* "a piece of job."
* "fix typeo"
* "fix ci"
7 years ago
Yancey
7ae03ec08b
fix lack split_byref op in parallel executor ( #10770 )
7 years ago
Wu Yi
ebc7303990
listen_and_serv use local scope ( #10663 )
...
* listen_and_serv use localscope
* fix ut
7 years ago
chengduo
93c470034a
fix DataTransFunc ( #10752 )
7 years ago
Yu Yang
14248a64d7
Fix hang when input is duplicated ( #10709 )
7 years ago
Yu Yang
e528862714
Merge pull request #10683 from reyoung/feature/tensor_support_uint8
...
Make tensor support uint8
7 years ago
Yu Yang
8b1b7564cf
Merge pull request #10671 from chengduoZH/fix_fetch_op_handle
...
Refine fetch op handle
7 years ago
yuyang18
9b7cd7f906
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_support_uint8
7 years ago
qiaolongfei
3b4124a778
code clean
7 years ago
yuyang18
3923d40992
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/support_op_role
7 years ago
yuyang18
fd2b4b478e
Make tensor support uint8
7 years ago
yuyang18
624caee56b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_fetch_op_handle
7 years ago
chengduoZH
e383ea20dc
fix fetch op handle
7 years ago
yuyang18
999d0fdbef
By default is all reduce
7 years ago
yuyang18
5895989a4f
Add ref
7 years ago
yuyang18
a77d1bc65e
Add debug code
7 years ago
yuyang18
9e736215e8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
7 years ago
yuyang18
7e052a5105
Fix compile
7 years ago
yuyang18
c70ddb0a57
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/polish_visit_data_type
7 years ago
yuyang18
741401e999
Update data_type
7 years ago
yuyang18
44c52a8c1a
Polish op_proto_maker
7 years ago
yuyang18
7c777dd549
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
7 years ago
yuyang18
017bba1664
Add op role
7 years ago
Yiqun Liu
b7026f79a9
Fix a bug related to dispensable inputs and refine the inference unittest ( #10527 )
...
* Fix a bug related to dispensable inputs and refine the inference unittest.
* Fix the use of dispensable inputs in reshape_op.
* Polish the enforce statements.
* Fix an English writing typo.
7 years ago
yuyang18
66e82b9869
Change implementation to fit sphinx model
7 years ago
yuyang18
08295f9877
Add build strategy
7 years ago
yuyang18
715c933d88
Change deps
7 years ago
typhoonzero
7b0c0273f4
update by comments
7 years ago
yuyang18
b9cc896545
Add includes
7 years ago
yuyang18
64305b3ffc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/polish_visit_data_type
7 years ago
yuyang18
e5281b3c2d
Clean code & add execution strategy
7 years ago
typhoonzero
928418a9ac
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
typhoonzero
0ae726f060
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
typhoonzero
f5840d8925
follow comments
7 years ago
typhoonzero
04bde96e4c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
yuyang18
54cbf799d6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/change_pe_strategy
7 years ago
yuyang18
9f705a4bb9
Use int instead of VarType as unordered_map key
7 years ago
chengduoZH
97cb5479ae
change PE strategy
7 years ago
chengduo
18f527bfa3
Merge pull request #10586 from jacquesqiao/fix-compile
...
Fix compile on mac
7 years ago
yuyang18
c4d6daac58
Polish SizeOfType
7 years ago
Yu Yang
046405e091
Merge pull request #10486 from reyoung/feature/clean_op_maker
...
Clean OpProtoAndCheckerMaker
7 years ago
yuyang18
711d86bb56
Polish data_type.h
7 years ago
qiaolongfei
3c2a6109ec
fix build computation_op_handle
7 years ago
qiaolongfei
07d94bdd82
fix compile fetch_op_handle on mac
7 years ago
chengduo
ce72c3ff26
Merge pull request #10476 from chengduoZH/refine_parallel_exe
...
Clean Parallel exe
7 years ago
chengduoZH
a89cd46700
Wait VarDummyHandle generated
7 years ago
fengjiayi
e15d616e29
Complete the C++ core of 'CustomReader'
7 years ago
reyoung
a0c413833d
Polish Dctor
7 years ago
reyoung
b0ca371f11
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_op_maker
7 years ago
chengduoZH
830532213a
extract method from broadcast::RunImpl
7 years ago
chengduoZH
9eec2c7509
refine pe
7 years ago
Yao Cheng
9ca8124f99
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
...
yes
7 years ago
chengduoZH
f4851f14b8
clean code
7 years ago
chengduoZH
a459764de7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_fetchop
7 years ago
Yu Yang
0e78cb69fb
Clean OpProtoAndCheckerMaker
...
Do not use ctor
* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.
7 years ago
Yao Cheng
8cbb49ce63
rm space
7 years ago
Yao Cheng
e97645c959
rm block_desc.h whitespace
7 years ago
gmcather
be1f058f52
Merge branch 'develop' into develop
7 years ago
Yao Cheng
55218dafbc
add prependAllocatedop
7 years ago
chengduoZH
676dfd1875
follow comments
7 years ago
chengduoZH
aff8a26d71
check generated_op_
7 years ago
chengduoZH
2e5d44f102
fix fetch op
7 years ago
typhoonzero
a529d790b6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
Yiqun Liu
fd1971caa0
Add the call of DropKids at the end of executor.Run to delete the local scopes created in operators ( #10403 )
...
* Add the call of DeleteScope to delete the memory of scope created by NewScope.
* Call DropKids at the end of executor.Run to delete all local scopes created in operators.
7 years ago
chengduo
99acf1da4c
Merge pull request #10351 from chengduoZH/feature/update_sparse_parameter
...
Feature/update sparse parameter
7 years ago
chengduoZH
881e063ee2
follow comments
7 years ago
chengduoZH
ff599b9218
use Reduce and Broadcast
7 years ago
chengduoZH
0441c2cc45
fix ci
7 years ago
Yancey
2d98a418d7
fix remove op ( #10410 )
...
* fix remove op
* update
7 years ago
chengduoZH
f9c680c43e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/update_sparse_parameter
7 years ago
typhoonzero
d9320dcd94
complete code
7 years ago
chengduoZH
7722baa8e3
follow comments and clean code
7 years ago
Yiqun Liu
6084af47ef
Fix the bug when a input variable of op is dispensable. ( #10268 )
...
* Fix the bug when a input variable of op is dispensable.
* Add HasInputs/Outputs interfaces to OperatorBase.
* Remove the unreferenced header file.
7 years ago
chengduoZH
c891189568
update sparse gradient parameter with reduce and broadcast
7 years ago
chengduoZH
5ff1ef36ee
update sparse parameter
7 years ago
Yancey
ff99d94197
Merge pull request #10164 from Yancey1989/lookup_sparse_table_op
...
add lookup_sparse_table_op
7 years ago
Yang yaming
9a8be9daca
Merge pull request #10223 from pkuyym/fix-10219
...
Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
yangyaming
13fac4232a
Fix to pass CI.
7 years ago
Yu Yang
c0ac0cd6b3
Complete rename
7 years ago
chengduoZH
9fda5c92cd
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_FLAGS_use_deterministic_algo
7 years ago
yangyaming
f456cd8079
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10219
7 years ago
Yu Yang
25779c982d
Merge pull request #10265 from reyoung/feature/polish_code
...
Add comments and polish code
7 years ago
Yu Yang
9612c7e599
Add comments and polish code
7 years ago
fengjiayi
71fa3ca9c4
Merge pull request #10232 from JiayiFeng/fix_unittests
...
Fix unittests
7 years ago
whs
2f9fa9b721
Merge pull request #10167 from wanghaoshuang/fluid_init
...
Add init interface for customize devices.
7 years ago
fengjiayi
330fa95cbd
Follow comments
7 years ago
fengjiayi
b88721213f
fix broadcast_op_test and reduce_op_test
7 years ago
yangyaming
82571deb89
Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
Abhinav Arora
4c8ff72615
Fix CPPLint errors with rxecutor ( #10212 )
7 years ago
Yancey1989
e393c86c4a
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into lookup_sparse_table_op
7 years ago
Tao Luo
8b2391858f
Merge pull request #10181 from abhinavarora/cpplint_advanced
...
Fix CPPLint errors with framework/op_desc
7 years ago
Wu Yi
3fdfa940be
Merge pull request #10135 from typhoonzero/unify_blocking_queue
...
Unify fluid blocking queue
7 years ago
wanghaoshuang
ad3f6f4ad5
Fix devices 'not undefined' error.
7 years ago
Abhinav Arora
edd3587e50
Fix CPPLint errors with op_desc
7 years ago
Yang Yang(Tony)
81dfc0cf0e
Clean up unused code in operator class ( #10035 )
...
* delete unused IsNetOp() and Rename()
* rm OperatorBase::Rename implementation
* delete Operator::InputVars()
* remove unused OperatorBase::ShareLoD; ShareLoD has been implemented in infershape
* organize operatorbase; remove unused set_type
* add comments
* fix comment
7 years ago
Abhinav Arora
f09aed0475
Fix CPPLint issues in framework/data_transform framework/prune.cc ( #10178 )
...
* Fic CPPLint issues with data_transform
* Fic CPPLint issues with prune.cc
7 years ago
wanghaoshuang
3d96b3811a
Fix InitGflags.
7 years ago
wanghaoshuang
a4b452a2d6
Remove initP2P(bool) and init function in framework.
7 years ago
Yu Yang
4ecc9b7bae
Merge pull request #10166 from reyoung/feature/train_and_test_recordio
...
Add demo for recordio train/test and parallel executor
7 years ago
wanghaoshuang
a0b258278e
Reuse 'initP2P(bool, std::vector)' in 'initP2P(bool)'
7 years ago
wanghaoshuang
f31bb1476c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fluid_init
7 years ago
wanghaoshuang
48b7b54321
Refine code.
7 years ago
Abhinav Arora
5ce57555ee
Fix CPPLint issues in init.cc, init.h and library_type.h ( #10148 )
...
* Fix CPPLint issues in init
* Fix compilation
* Fix typo in init.cc
* Fix CPPLint issues in library_type.h
* Fix compilation in init.h
7 years ago
wanghaoshuang
1bdea0a8d2
Add init interface for customize devices.
7 years ago
Yu Yang
2c8fe4e544
Merge pull request #10143 from typhoonzero/fix_multiGPU_dist_train
...
Fix multi gpu dist train
7 years ago
Yu Yang
54ada9449e
Add demo for recordio train/test and parallel executor
7 years ago
Yancey1989
e8d802159e
add lookup_sparse_table_op
7 years ago
fengjiayi
00b54d5803
fix fetch_op_handle
7 years ago
fengjiayi
dd5946528a
Merge pull request #10142 from JiayiFeng/Add_TensorCopySync
...
Add synchronous TensorCopy
7 years ago
Abhinav Arora
504e60a881
Fix Cpplint issues in framework/data_type.h and framework/feed_fetch_type.h ( #10146 )
...
* Fix CPPLint issues with data_type.h
* Fix CPPLint issues with feed_fetch_type.h
7 years ago
typhoonzero
f03415207a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_multiGPU_dist_train
7 years ago
typhoonzero
1fba0c578a
fix multi gpu dist train
7 years ago
fengjiayi
c5e178f4fd
fix errors
7 years ago
fengjiayi
9f11da5931
Add synchronous TensorCopy and use it in double buffer
7 years ago
Yang yaming
f3e4e42def
Merge pull request #10130 from reyoung/feature/skip_loss
...
Add customize_loss_grad option to PE
7 years ago
Yu Yang
55feba9b5a
Fix CPU compile
7 years ago
Wu Yi
3863c6a9a8
Merge pull request #10126 from typhoonzero/fix_para_exe_dist_train
...
Fix para exe dist train
7 years ago
typhoonzero
251e4a8ee5
unify fluid blocking queue
7 years ago
Yu Yang
7a395881d4
Add customize_loss_grad option to PE
7 years ago
typhoonzero
2b06b4b4e9
updates follow up para exe
7 years ago
typhoonzero
8ee837255e
fix send op handle local scope
7 years ago
chengduoZH
f5b4ac6e50
enable delay op feature
7 years ago
Tao Luo
58fd4c0994
Merge pull request #10111 from abhinavarora/cpplint_tensor_util_test
...
Fix CPPLint issues in tensor_util_test
7 years ago
Abhinav Arora
1c70600d09
Fix compile error
7 years ago
Abhinav Arora
6402b59a7c
Fix CPPLint issues in some tests in fluid/framework ( #10068 )
...
* Fix CPPLint in data_device_transform_test
* Fix compilation error
* Fix compilation error
* Fix CPPLint errors in data_layout_transform_test
* Fix CPPLint errors in data_type_transform_test
* Fix CPPLint errors in data_type_transform_test.cu
* Fix compilation error
* Fix CPPLint issues in threadpool_test
* Fix CPPLInt issues in op_registry_test
* Fix CPPLint issues in operator_test
* Fix compilation error
* test
7 years ago
Abhinav Arora
c08752ca43
Fix compile error
7 years ago
Abhinav Arora
feaf168d95
Fiux compile error
7 years ago
Abhinav Arora
b6ec45d8da
Fix CPPLint warnings in tensor_util_test
7 years ago
Abhinav Arora
e66f0c73af
Fix CPPLint errors in framework/details ( #10104 )
7 years ago
chengduo
eb8e14c9fd
Merge pull request #10081 from chengduoZH/refine/gather_broadcast
...
Fix scope of gather and broadcast, and code clean
7 years ago
Qiao Longfei
acd7309ea1
Merge pull request #10079 from jacquesqiao/fix-reduce_op_handle_test
...
fix reduce_op_handle_test
7 years ago
chengduoZH
9a4ae4df79
fix scope of gather broadcast
7 years ago
Tao Luo
8e00540778
Merge pull request #10058 from Xreki/core_fix_flush
...
Add flush of program desc to update the proto information.
7 years ago
qiaolongfei
9a0ad10ff1
fix reduce_op_handle_test
7 years ago
chengduo
cec4e6ed0d
Merge pull request #9946 from chengduoZH/feature/add_reduce_op_handle
...
Feature/add reduce op handle
7 years ago
chengduo
23a21c86b1
Merge pull request #9922 from chengduoZH/feature/refine_gather_reduce
...
Refine gather and broadcast
7 years ago
Yu Yang
92b5f49ee4
Eager destruct local scope
7 years ago
qiaolongfei
0ffd33d30e
VariableResponse support deserialize var into local scope
7 years ago
Liu Yiqun
7ffbcbcaf0
Add flush of program desc to update the proto information.
7 years ago
Yancey
0b8630b9b4
Merge pull request #9897 from Yancey1989/auto_grwon_sparse_table
...
Auto-grown sparse table
7 years ago
chengduo
88f8183576
Merge branch 'develop' into feature/refine_gather_reduce
7 years ago
chengduoZH
035712822c
fix VisitVariable
7 years ago
Abhinav Arora
cbbf08aee9
Fix CPPLint errors in some framework files
7 years ago
Yiqun Liu
598035f985
Fix a bug in save_inference_model and prune when the program is initailized by load_inference_model ( #10011 )
...
* Fix bug in save_inference_model and prune when the program is initialized by load_inference_program.
* Save the transpiled program instead.
7 years ago
Xin Pan
98c12b1a08
Clean up C++ codes. ( #10022 )
...
* Privatize OpHandleBase
* Clean up a few private members
7 years ago
Tao Luo
d84cdb7b59
Merge pull request #9911 from tonyyang-svail/unify_op_registry
...
Unify REGISTER_OP and REGISTER_OPERATOR
7 years ago
Yancey1989
f12b3f3617
use memcpy
7 years ago
Yu Yang
60e0d1aabd
Merge pull request #10023 from reyoung/feature/DtorOfPE
...
Correctly implement destructor of ParallelExecutor
7 years ago
Yu Yang
28013ef9e2
Merge pull request #10019 from reyoung/feature/feed_fetch_tensor_on_cpu
...
Fix FetchTensor on CPU
7 years ago
Yancey1989
13e7194ebd
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into auto_grwon_sparse_table
7 years ago
Yancey1989
70bf732f82
refine get interface
7 years ago
chengduoZH
fbb75c6bb6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_gather_reduce
7 years ago
Yu Yang
5305c5f845
Correctly implement destructor of ParallelExecutor
7 years ago
Yu Yang
38d75c912c
Follow comments
7 years ago
chengduoZH
4760ac44f1
check the generate_op is null or not and add DEPS of broadcast_op_handle and gather_op_handle
7 years ago
Yu Yang
6de5fd9e8a
Fix FetchTensor on CPU
7 years ago
Yu Yang
35ebe2eca7
Clean MultiDevicesGraphBuilder
7 years ago
Yu Yang
d24ef931b5
Clean Code
7 years ago
Yancey1989
b920b51686
rename parameter
7 years ago
Yancey1989
2917a75b7a
add some comments for sparse table
7 years ago
fengjiayi
fbe562478d
Merge pull request #9994 from reyoung/feature/debug
...
Fix bugs in local_scopes
7 years ago
chengduoZH
e63013a86f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
7 years ago
chengduoZH
1eeb2e0042
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
7 years ago
Abhinav Arora
1b5c1bcb57
Disabling Channel Test to fix CI
7 years ago
Yang Yang
68d96385e4
remove REGISTER_OP and REGISTER_OP_EX
7 years ago
Yu Yang
9b60d0decb
Merge pull request #9975 from reyoung/feature/VarHandleCtor
...
Using constructor for VarHandle
7 years ago
Yu Yang
06fb055a2f
New group
7 years ago
Yancey1989
ca327508cc
update
7 years ago