Commit Graph

932 Commits (55d7f55c63a33aa2fc8f8a4d1c5c2024ec8a137d)

Author SHA1 Message Date
sneaxiy 3f9292c6e6 fix merge conflict
7 years ago
sneaxiy dd70fb4393 fix type comparation bugs
7 years ago
fengjiayi ff4317cee9 follow comments
7 years ago
Yancey1989 c1ab215e26 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into dist_pass_barrier
7 years ago
Yancey1989 1366832a41 add dist pass barrier
7 years ago
gongweibao 66c91911cf
Improve brpccmake (#11842)
7 years ago
Yan Chunwei 28172bbb8e
add debug to replacing enforce with GLOG for debug (#11244)
7 years ago
guochaorong c318aa5ffa
Merge pull request #11850 from guochaorong/revert_11496
7 years ago
fengjiayi 49a04d75ee Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi 4b950951d3 Add unittests and fix a few bugs
7 years ago
chengduo 7b54f16855
Follow comment (#11845)
7 years ago
Tao Luo b0378963d0
Merge pull request #11666 from mozga-intel/mozga-intel/Batch_norm_support_other_type
7 years ago
guochaorong 6a35899131 Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray (#11496)"
7 years ago
gongweibao c2165ffa7b Fix codesytle (#11836)
7 years ago
chengduo 8d76cf397d
Fix TensorCopy bug (#11822)
7 years ago
fengjiayi ba5380129c Merge branch 'fix_Mac_compile_errors' into dev_data_balance
7 years ago
fengjiayi 077434c26c fix CPU compile error
7 years ago
fengjiayi 47388020a2 fix bugs
7 years ago
yuyang18 5e23a5ec18
Rename TransferData -> TransformData
7 years ago
yuyang18 5999201346
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refactor_op
7 years ago
chengduo 6711b7b5f1
fix FeedAndSplitTensorIntoLocalScopes (#11817)
7 years ago
fengjiayi 2e320079d3 fix bugs
7 years ago
fengjiayi 5b4f283069 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi b6dc3a59f1 Add DataBalanceOpHandle to MultiDeviceSSAGragh
7 years ago
mozga-intel 61c54dbbe7 Remove additional function of the code
7 years ago
Haichao Zhang bc28cf613f Extend fill_zeros_like_op for zero-filling an LoDTensorArray (#11496)
7 years ago
yi.wu 8d04d0e2a3 update
7 years ago
yi.wu 6f0107126a fix broadcast bug
7 years ago
yuyang18 9faf5a39c5
Refactor Operator.cc, and clean code
7 years ago
yi.wu 88cb47bd86 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_paraexe_bcast
7 years ago
yi.wu 8e48c77b54 wip
7 years ago
yi.wu 3d69a82b83 fix dist train broadcasting bug
7 years ago
chengduo bb29800aaa
small refine (#11460)
7 years ago
Tomasz Patejko e26f51ce74 MKLDNN elementwis_add with default broadcast operations (#11544)
7 years ago
yi.wu 254154a9bf fix sparse paraexe dist train
7 years ago
Yancey 0d4b376fc0
Merge pull request #11585 from Yancey1989/fix_pserver_sub_blocks
7 years ago
Yancey1989 8cb494f79c add blocks attr type in proto
7 years ago
Yancey1989 56a903d3ac use optimize block list instead of first optimize block
7 years ago
fengjiayi e45a55523b
Merge pull request #11639 from JiayiFeng/fix_mac_compile
7 years ago
chengduo da556ed6d4
enhance ParallelExecutor stable (#11637)
7 years ago
Kexin Zhao 073af6237a
add print lod_tensor int64 option (#11644)
7 years ago
fengjiayi 964f515e9a fix mac compile
7 years ago
Yu Yang 9b3f48d7e6
Merge pull request #11616 from chengduoZH/fix_parallel_exe
7 years ago
chengduoZH c99fca5f90 Add No Mutex
7 years ago
chengduoZH 13de72388d Fix broadcast
7 years ago
Xin Pan 0151e4eb3d
Merge pull request #11608 from panyx0718/doc
7 years ago
chengduoZH 28a86aebc3 Fix Parallel Exe(VarHandel's version)
7 years ago
Yancey c413c46915
Merge pull request #11603 from Yancey1989/rename_details_folder
7 years ago
Yancey 9cc1eb434e
Merge pull request #11221 from Yancey1989/overlap_memcpy_with_dist
7 years ago
Xin Pan df31926fcf small thread-safety fix and doc improvements.
7 years ago
Yancey1989 1ef6cdb60e move dist codes from operaotrs/detail to operators/distributed
7 years ago
Qiao Longfei 05a92779f4
Merge pull request #11524 from jacquesqiao/add-keep_kids-for-executor
7 years ago
Yancey1989 7e6518e8ca fix compile warning
7 years ago
Yancey1989 7d1b146939 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Qiyang Min a29cb4be2a Fix decay bug (#11520)
7 years ago
qiaolongfei 2b1ecdf55a update comment
7 years ago
qiaolongfei daa0fbd5f4 add keep_kids flag for executor
7 years ago
tensor-tang 609dccfb55
Merge pull request #11395 from tensor-tang/fix
7 years ago
Kexin Zhao 417fcf4f43
Modify Pybind LoDTensor API according to length-based LoD (#11106)
7 years ago
tensor-tang 0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
7 years ago
tensor-tang 6c1cf60950 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang c453573286 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang 3e58df20df initial with only 1 mkl/openblas threads for each pthreads
7 years ago
Qiyang Min 046bb5c8cb Fix NCCLBcast hang up bug in Parallel Executor (#11377)
7 years ago
tensor-tang 85c3bfc1f9 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yancey1989 3d875b693f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Yang Yang(Tony) d827c6e87a
Dynamic Graph first prototype (#11415)
7 years ago
qiaolongfei 7ebef493d5 add row_size for selected rows in DebugStringEx
7 years ago
qiaolongfei cdd55dbcaa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 82416f1844 fix concurrency_test build error on mac
7 years ago
qiaolongfei 9ebbfa6bbc fix build on mac
7 years ago
Yancey1989 f52d78d189 update by comment
7 years ago
fengjiayi 88fa9c2ec7
Merge pull request #11267 from JiayiFeng/fix_reader_bug
7 years ago
tensor-tang 6602db5b3e throw warning if try to use mkldnn while not compiled
7 years ago
Yancey1989 6d752bafd8 use get_appropriate_dev to schedule rpc op
7 years ago
qiaolongfei f031555cfb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
Qiao Longfei 431491a2a8
Merge pull request #11366 from jacquesqiao/refine-prefetch
7 years ago
Wu Yi 34865f2de3
Trainer send term signal (#11220)
7 years ago
Yancey1989 4444e79e46 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
qiaolongfei 6dd3f3cf27 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 16658f7b59 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
chengduo 183377f410
Merge pull request #11306 from chengduoZH/enable_cpu_on_pe
7 years ago
qiaolongfei 83a577e8ce fix build problem
7 years ago
dzhwinter bfa3fd6f15
add inplace attribute to op_proto_maker (#10665)
7 years ago
gongweibao 9087c6687f
polish (#11363)
7 years ago
chengduoZH 173d72b481 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enable_cpu_on_pe
7 years ago
chengduoZH aadaadf735 replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
gongweibao 627d7a64f8
Clean `sendop` `recv` operator. (#11309)
7 years ago
chengduo fa29ef0b0d
Merge pull request #11277 from chengduoZH/check_ssa_graph
7 years ago
chengduoZH 961fbce8e2 follow comments
7 years ago
chengduoZH 7b723839ef Add cpu test for parallel_executor_crf executor_fetch_feed, and enable these tests
7 years ago
chengduoZH d24e046c1e fix allReduce bug
7 years ago
chengduoZH a57e8a4338 add cpu test
7 years ago
qiaolongfei 0485405b3d add more debug string
7 years ago
Luo Tao 79d555b9f2 Merge branch 'develop' into mkldnn
7 years ago
gongweibao 062d5a56b4
Add comments to a singleton. (#11333)
7 years ago
chengduoZH 1e731f5964 small fix
7 years ago
chengduoZH 495368c243 ADD CPU_NUM
7 years ago
chengduoZH 27073c284d nccl_all_reduce_op_handle => all_reduce_op_handle
7 years ago
chengduoZH 2d94697a82 code refine
7 years ago
chengduoZH 5a3c8bf813 fix in c++ side
7 years ago
Luo Tao c6d230e03e add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
fengjiayi d745840a6e fix a small compile error on Mac
7 years ago
chengduoZH 0c851cab22 add SSA graph checker
7 years ago
Yancey1989 d5a88b9340 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
chengduoZH 1076e85135 refine logic
7 years ago
Yancey 0aa9546eed
fix dist train error (#11281)
7 years ago
tensor-tang b8d315fb69 make scope thread safe
7 years ago
dzhwinter d48172f22a
split reduce op into multiple libraries, accelerate the compiling (#11029)
7 years ago
fengjiayi 5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
7 years ago
tensor-tang bfd42683ca Merge remote-tracking branch 'ups/develop' into refine
7 years ago
dzhwinter f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. (#10646)
7 years ago
fengjiayi dc8e0b494d fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
mozga-intel 3ff9ba0e6b Mkldnn layout (#11040)
7 years ago
chengduoZH 8291b916d6 replace graph_builder_factory with ssa_graph_builder_factory
7 years ago
chengduoZH 9ac785be39 check graph's validation
7 years ago
fengjiayi a1e046bfc0
Merge pull request #11270 from JiayiFeng/fix_a_error_on_max
7 years ago
fengjiayi 499dbe0536 fix a multi-thread bug in readers
7 years ago
fengjiayi 7344210070 Merge branch 'fix_a_error_on_max' into fix_reader_bug
7 years ago
fengjiayi 2f5e310167 fix a compile error
7 years ago
yuyang18 8149b0a9aa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fuse_var_op_handle
7 years ago
Yancey1989 23433def4b Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
fengjiayi 5d45793936 hot fix
7 years ago
tensor-tang 0693b4148d Merge remote-tracking branch 'ups/develop' into refine
7 years ago
Yu Yang ff9b1a0f95
Merge pull request #11234 from reyoung/feature/refine_code
7 years ago
Yu Yang 08823146ec
Merge pull request #11232 from reyoung/feature/extract_tensor
7 years ago
tensor-tang f326b0117e refine scope lock
7 years ago
Yancey1989 15913d92c5 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
tensor-tang 4b7b17a84f fix conflcts
7 years ago
yuyang18 d9af153232 SSA Graph Builder Factory
7 years ago
yuyang18 b6c8701e45 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/extract_tensor
7 years ago
chengduoZH a584bc86dd add fuse var op handle
7 years ago
Xin Pan 106ee9d1cc
Merge pull request #11243 from panyx0718/scope
7 years ago
tensor-tang 64323b1caf Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
Xin Pan 73aa5d230b small clean up and document pointer ownership.
7 years ago
Yancey1989 e533a4b4ab Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
tensor-tang 4ae935e2cf refine the lock in scope
7 years ago
Yancey1989 cb3861538d fix compile failed with CPU
7 years ago
Yu Yang c36dd3b338
Merge pull request #11114 from reyoung/feature/yep
7 years ago
tensor-tang 9b34f8dabd fix abort issue in cpu multi-threads
7 years ago
yuyang18 fc9f2d282f Extract method from tensor_impl.h to tensor.cc
7 years ago
Yancey1989 82d741c4b9 fix op name typo
7 years ago
Yancey1989 6d69ae0c6e code cleanup
7 years ago
Yancey1989 93401c98e1 overlap rpc op memcpy in distributed training
7 years ago
Qiao Longfei 238124909e
fix protobuf memory leak (#11177)
7 years ago
sneaxiy 02cc80b30d Fix dangling pointer bug
7 years ago
yuyang18 86a61c177f Add ScopeBufferedSSAGraphExecutor
7 years ago
fengjiayi d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
7 years ago
chengduo 01bbe532da
Merge pull request #11079 from chengduoZH/balance_parameter_update
7 years ago
fengjiayi a3aca2a3cf fix bugs
7 years ago
Tao Luo 47f3cae2af
Merge pull request #11038 from sneaxiy/zjl-dev
7 years ago
fengjiayi 2a3c58d3fe refine programdesc copy
7 years ago
chengduoZH e330cd032e balance parameter update
7 years ago
Yu Yang 14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
7 years ago
fengjiayi 3bce3dbce1 fix a bug
7 years ago
Qiao Longfei 654f5d3c91
Merge pull request #11012 from jacquesqiao/add-auto_grown_mutex
7 years ago
sneaxiy 46f13237dc Fix bugs in framework/tensor_impl.h and polish framework/reader.cc
7 years ago
qiaolongfei fa2079b71a add mutex header
7 years ago
qiaolongfei 167312484b use unique ptr to hold auto_grown_mutex_ for default copy and move constructor
7 years ago
qiaolongfei add9ed3388 change order of member
7 years ago
Qiao Longfei ab953baec6
Merge pull request #10973 from jacquesqiao/fix-prefetch
7 years ago
qiaolongfei e3c041d319 add auto_grown_mutex for selected rows
7 years ago
chengduo 8c54f1fb9c
Merge pull request #10906 from chengduoZH/fix_data_trans
7 years ago
chengduo 7d1332f696
Merge pull request #11006 from chengduoZH/fix_add_check_nan_inf_in_operator
7 years ago
qiaolongfei 74701b2692 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-prefetch
7 years ago
chengduoZH cb1c657ce4 move check_nan_inf to operator
7 years ago
chengduoZH 17a076d8ed replace TensorCopy with TensorCopySync
7 years ago
Yancey1989 6b91d407de revert variable mutex
7 years ago
Yancey1989 20c24c05aa singleton rpc_client
7 years ago
Yancey1989 28596a3386 add gflag ssa_graph_path
7 years ago
qiaolongfei 25f47fc0af fix prefetch bugs, optimize code
7 years ago
Yancey1989 ad6c0142c4 clean up codes
7 years ago
Yancey1989 268e9dc1c6 polish code
7 years ago
Yancey1989 ceefbf3259 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
chengduoZH fa613206e4 update
7 years ago
Yancey1989 b348e159f2 fix rename var
7 years ago
fengjiayi cd8700f172
Merge pull request #10872 from JiayiFeng/dev_CustomReader
7 years ago
Yancey1989 0aa6f9e934 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
chengduoZH 4bfadcd136 fix data trans
7 years ago
Yu Yang d406c76a30
Merge pull request #10744 from reyoung/feature/refine_parallel_executor
7 years ago
Yancey1989 540b45350d use req_count as atomic type
7 years ago
yuyang18 ad6a8715ff Disable unstable unittests
7 years ago
Yancey1989 6debbcd9f9 connect fetch barrier and concat op
7 years ago
yuyang18 3b04f0099c Fix bug
7 years ago
yuyang18 39a44e3834 Add Debug log
7 years ago
yuyang18 fb370f4411 Refine code
7 years ago
yuyang18 03e4da6d04 Fix bug
7 years ago
Yancey1989 147d54ba62 update
7 years ago
Yancey1989 e9abc66910 fix pe
7 years ago
yuyang18 c978259027 Add default value of op_role
7 years ago
yuyang18 0e5a278c23 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/support_op_role
7 years ago
fengjiayi 983c9a2a30 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CustomReader
7 years ago
Yancey1989 952fa04009 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989 62af10d440 support multiple devices
7 years ago
dzhwinter 5828101c23
make uint8 support in data_type transform and memory optimize (#10715)
7 years ago
Yancey 7ae03ec08b
fix lack split_byref op in parallel executor (#10770)
7 years ago
Wu Yi ebc7303990
listen_and_serv use local scope (#10663)
7 years ago
chengduo 93c470034a fix DataTransFunc (#10752)
7 years ago
Yu Yang 14248a64d7 Fix hang when input is duplicated (#10709)
7 years ago
Yu Yang e528862714
Merge pull request #10683 from reyoung/feature/tensor_support_uint8
7 years ago