Commit Graph

838 Commits (7231ef6b68334ef095b643b565a8b2e52806c150)

Author SHA1 Message Date
chengduoZH 13de72388d Fix broadcast
7 years ago
Xin Pan 0151e4eb3d
Merge pull request #11608 from panyx0718/doc
7 years ago
chengduoZH 28a86aebc3 Fix Parallel Exe(VarHandel's version)
7 years ago
Yancey c413c46915
Merge pull request #11603 from Yancey1989/rename_details_folder
7 years ago
Yancey 9cc1eb434e
Merge pull request #11221 from Yancey1989/overlap_memcpy_with_dist
7 years ago
Xin Pan df31926fcf small thread-safety fix and doc improvements.
7 years ago
Yancey1989 1ef6cdb60e move dist codes from operaotrs/detail to operators/distributed
7 years ago
Qiao Longfei 05a92779f4
Merge pull request #11524 from jacquesqiao/add-keep_kids-for-executor
7 years ago
Yancey1989 7e6518e8ca fix compile warning
7 years ago
Yancey1989 7d1b146939 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Qiyang Min a29cb4be2a Fix decay bug (#11520)
7 years ago
qiaolongfei 2b1ecdf55a update comment
7 years ago
qiaolongfei daa0fbd5f4 add keep_kids flag for executor
7 years ago
tensor-tang 609dccfb55
Merge pull request #11395 from tensor-tang/fix
7 years ago
Kexin Zhao 417fcf4f43
Modify Pybind LoDTensor API according to length-based LoD (#11106)
7 years ago
tensor-tang 0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
7 years ago
tensor-tang 6c1cf60950 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang c453573286 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang 3e58df20df initial with only 1 mkl/openblas threads for each pthreads
7 years ago
Qiyang Min 046bb5c8cb Fix NCCLBcast hang up bug in Parallel Executor (#11377)
7 years ago
tensor-tang 85c3bfc1f9 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yancey1989 3d875b693f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Yang Yang(Tony) d827c6e87a
Dynamic Graph first prototype (#11415)
7 years ago
qiaolongfei 7ebef493d5 add row_size for selected rows in DebugStringEx
7 years ago
qiaolongfei cdd55dbcaa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 82416f1844 fix concurrency_test build error on mac
7 years ago
qiaolongfei 9ebbfa6bbc fix build on mac
7 years ago
Yancey1989 f52d78d189 update by comment
7 years ago
fengjiayi 88fa9c2ec7
Merge pull request #11267 from JiayiFeng/fix_reader_bug
7 years ago
tensor-tang 6602db5b3e throw warning if try to use mkldnn while not compiled
7 years ago
Yancey1989 6d752bafd8 use get_appropriate_dev to schedule rpc op
7 years ago
qiaolongfei f031555cfb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
Qiao Longfei 431491a2a8
Merge pull request #11366 from jacquesqiao/refine-prefetch
7 years ago
Wu Yi 34865f2de3
Trainer send term signal (#11220)
7 years ago
Yancey1989 4444e79e46 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
qiaolongfei 6dd3f3cf27 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 16658f7b59 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
chengduo 183377f410
Merge pull request #11306 from chengduoZH/enable_cpu_on_pe
7 years ago
qiaolongfei 83a577e8ce fix build problem
7 years ago
dzhwinter bfa3fd6f15
add inplace attribute to op_proto_maker (#10665)
7 years ago
gongweibao 9087c6687f
polish (#11363)
7 years ago
chengduoZH 173d72b481 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enable_cpu_on_pe
7 years ago
chengduoZH aadaadf735 replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
gongweibao 627d7a64f8
Clean `sendop` `recv` operator. (#11309)
7 years ago
chengduo fa29ef0b0d
Merge pull request #11277 from chengduoZH/check_ssa_graph
7 years ago
chengduoZH 961fbce8e2 follow comments
7 years ago
chengduoZH 7b723839ef Add cpu test for parallel_executor_crf executor_fetch_feed, and enable these tests
7 years ago
chengduoZH d24e046c1e fix allReduce bug
7 years ago
chengduoZH a57e8a4338 add cpu test
7 years ago
qiaolongfei 0485405b3d add more debug string
7 years ago
Luo Tao 79d555b9f2 Merge branch 'develop' into mkldnn
7 years ago
gongweibao 062d5a56b4
Add comments to a singleton. (#11333)
7 years ago
chengduoZH 1e731f5964 small fix
7 years ago
chengduoZH 495368c243 ADD CPU_NUM
7 years ago
chengduoZH 27073c284d nccl_all_reduce_op_handle => all_reduce_op_handle
7 years ago
chengduoZH 2d94697a82 code refine
7 years ago
chengduoZH 5a3c8bf813 fix in c++ side
7 years ago
Luo Tao c6d230e03e add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
fengjiayi d745840a6e fix a small compile error on Mac
7 years ago
chengduoZH 0c851cab22 add SSA graph checker
7 years ago
Yancey1989 d5a88b9340 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
chengduoZH 1076e85135 refine logic
7 years ago
Yancey 0aa9546eed
fix dist train error (#11281)
7 years ago
tensor-tang b8d315fb69 make scope thread safe
7 years ago
dzhwinter d48172f22a
split reduce op into multiple libraries, accelerate the compiling (#11029)
7 years ago
fengjiayi 5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
7 years ago
tensor-tang bfd42683ca Merge remote-tracking branch 'ups/develop' into refine
7 years ago
dzhwinter f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. (#10646)
7 years ago
fengjiayi dc8e0b494d fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
mozga-intel 3ff9ba0e6b Mkldnn layout (#11040)
7 years ago
chengduoZH 8291b916d6 replace graph_builder_factory with ssa_graph_builder_factory
7 years ago
chengduoZH 9ac785be39 check graph's validation
7 years ago
fengjiayi a1e046bfc0
Merge pull request #11270 from JiayiFeng/fix_a_error_on_max
7 years ago
fengjiayi 499dbe0536 fix a multi-thread bug in readers
7 years ago
fengjiayi 7344210070 Merge branch 'fix_a_error_on_max' into fix_reader_bug
7 years ago
fengjiayi 2f5e310167 fix a compile error
7 years ago
yuyang18 8149b0a9aa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fuse_var_op_handle
7 years ago
Yancey1989 23433def4b Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
fengjiayi 5d45793936 hot fix
7 years ago
tensor-tang 0693b4148d Merge remote-tracking branch 'ups/develop' into refine
7 years ago
Yu Yang ff9b1a0f95
Merge pull request #11234 from reyoung/feature/refine_code
7 years ago
Yu Yang 08823146ec
Merge pull request #11232 from reyoung/feature/extract_tensor
7 years ago
tensor-tang f326b0117e refine scope lock
7 years ago
Yancey1989 15913d92c5 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
tensor-tang 4b7b17a84f fix conflcts
7 years ago
yuyang18 d9af153232 SSA Graph Builder Factory
7 years ago
yuyang18 b6c8701e45 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/extract_tensor
7 years ago
chengduoZH a584bc86dd add fuse var op handle
7 years ago
Xin Pan 106ee9d1cc
Merge pull request #11243 from panyx0718/scope
7 years ago
tensor-tang 64323b1caf Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
Xin Pan 73aa5d230b small clean up and document pointer ownership.
7 years ago
Yancey1989 e533a4b4ab Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
tensor-tang 4ae935e2cf refine the lock in scope
7 years ago
Yancey1989 cb3861538d fix compile failed with CPU
7 years ago
Yu Yang c36dd3b338
Merge pull request #11114 from reyoung/feature/yep
7 years ago
tensor-tang 9b34f8dabd fix abort issue in cpu multi-threads
7 years ago
yuyang18 fc9f2d282f Extract method from tensor_impl.h to tensor.cc
7 years ago
Yancey1989 82d741c4b9 fix op name typo
7 years ago
Yancey1989 6d69ae0c6e code cleanup
7 years ago
Yancey1989 93401c98e1 overlap rpc op memcpy in distributed training
7 years ago
Qiao Longfei 238124909e
fix protobuf memory leak (#11177)
7 years ago
sneaxiy 02cc80b30d Fix dangling pointer bug
7 years ago
yuyang18 86a61c177f Add ScopeBufferedSSAGraphExecutor
7 years ago
fengjiayi d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
7 years ago
chengduo 01bbe532da
Merge pull request #11079 from chengduoZH/balance_parameter_update
7 years ago
fengjiayi a3aca2a3cf fix bugs
7 years ago
Tao Luo 47f3cae2af
Merge pull request #11038 from sneaxiy/zjl-dev
7 years ago
fengjiayi 2a3c58d3fe refine programdesc copy
7 years ago
chengduoZH e330cd032e balance parameter update
7 years ago
Yu Yang 14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
7 years ago
fengjiayi 3bce3dbce1 fix a bug
7 years ago
Qiao Longfei 654f5d3c91
Merge pull request #11012 from jacquesqiao/add-auto_grown_mutex
7 years ago
sneaxiy 46f13237dc Fix bugs in framework/tensor_impl.h and polish framework/reader.cc
7 years ago
qiaolongfei fa2079b71a add mutex header
7 years ago
qiaolongfei 167312484b use unique ptr to hold auto_grown_mutex_ for default copy and move constructor
7 years ago
qiaolongfei add9ed3388 change order of member
7 years ago
Qiao Longfei ab953baec6
Merge pull request #10973 from jacquesqiao/fix-prefetch
7 years ago
qiaolongfei e3c041d319 add auto_grown_mutex for selected rows
7 years ago
chengduo 8c54f1fb9c
Merge pull request #10906 from chengduoZH/fix_data_trans
7 years ago
chengduo 7d1332f696
Merge pull request #11006 from chengduoZH/fix_add_check_nan_inf_in_operator
7 years ago
qiaolongfei 74701b2692 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-prefetch
7 years ago
chengduoZH cb1c657ce4 move check_nan_inf to operator
7 years ago
chengduoZH 17a076d8ed replace TensorCopy with TensorCopySync
7 years ago
Yancey1989 6b91d407de revert variable mutex
7 years ago
Yancey1989 20c24c05aa singleton rpc_client
7 years ago
Yancey1989 28596a3386 add gflag ssa_graph_path
7 years ago
qiaolongfei 25f47fc0af fix prefetch bugs, optimize code
7 years ago
Yancey1989 ad6c0142c4 clean up codes
7 years ago
Yancey1989 268e9dc1c6 polish code
7 years ago
Yancey1989 ceefbf3259 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
chengduoZH fa613206e4 update
7 years ago
Yancey1989 b348e159f2 fix rename var
7 years ago
fengjiayi cd8700f172
Merge pull request #10872 from JiayiFeng/dev_CustomReader
7 years ago
Yancey1989 0aa6f9e934 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
chengduoZH 4bfadcd136 fix data trans
7 years ago
Yu Yang d406c76a30
Merge pull request #10744 from reyoung/feature/refine_parallel_executor
7 years ago
Yancey1989 540b45350d use req_count as atomic type
7 years ago
yuyang18 ad6a8715ff Disable unstable unittests
7 years ago
Yancey1989 6debbcd9f9 connect fetch barrier and concat op
7 years ago
yuyang18 3b04f0099c Fix bug
7 years ago
yuyang18 39a44e3834 Add Debug log
7 years ago
yuyang18 fb370f4411 Refine code
7 years ago
yuyang18 03e4da6d04 Fix bug
7 years ago
Yancey1989 147d54ba62 update
7 years ago
Yancey1989 e9abc66910 fix pe
7 years ago
yuyang18 c978259027 Add default value of op_role
7 years ago
yuyang18 0e5a278c23 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/support_op_role
7 years ago
fengjiayi 983c9a2a30 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CustomReader
7 years ago
Yancey1989 952fa04009 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989 62af10d440 support multiple devices
7 years ago