Commit Graph

838 Commits (7231ef6b68334ef095b643b565a8b2e52806c150)

Author SHA1 Message Date
yuyang18 82866d4a18
Add register kernel functor and shrink reshape op
7 years ago
fengjiayi 58560622bc
Merge pull request #11854 from JiayiFeng/dev_data_balance
7 years ago
yuyang18 75ae426aa5
Merge branch 'feature/change_op_kernel_to_func' into feature/fix_reshape_op_size
7 years ago
yuyang18 3b00ed81a9
Make Kernel registed as a function
7 years ago
yuyang18 1ce478f100
Polish reshape op
7 years ago
Yancey1989 37410a0c75 update by comment
7 years ago
sneaxiy 3f9292c6e6 fix merge conflict
7 years ago
sneaxiy dd70fb4393 fix type comparation bugs
7 years ago
fengjiayi ff4317cee9 follow comments
7 years ago
Yancey1989 c1ab215e26 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into dist_pass_barrier
7 years ago
Yancey1989 1366832a41 add dist pass barrier
7 years ago
gongweibao 66c91911cf
Improve brpccmake (#11842)
7 years ago
Yan Chunwei 28172bbb8e
add debug to replacing enforce with GLOG for debug (#11244)
7 years ago
guochaorong c318aa5ffa
Merge pull request #11850 from guochaorong/revert_11496
7 years ago
fengjiayi 49a04d75ee Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi 4b950951d3 Add unittests and fix a few bugs
7 years ago
chengduo 7b54f16855
Follow comment (#11845)
7 years ago
Tao Luo b0378963d0
Merge pull request #11666 from mozga-intel/mozga-intel/Batch_norm_support_other_type
7 years ago
guochaorong 6a35899131 Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray (#11496)"
7 years ago
gongweibao c2165ffa7b Fix codesytle (#11836)
7 years ago
chengduo 8d76cf397d
Fix TensorCopy bug (#11822)
7 years ago
fengjiayi ba5380129c Merge branch 'fix_Mac_compile_errors' into dev_data_balance
7 years ago
fengjiayi 077434c26c fix CPU compile error
7 years ago
fengjiayi 47388020a2 fix bugs
7 years ago
yuyang18 5e23a5ec18
Rename TransferData -> TransformData
7 years ago
yuyang18 5999201346
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refactor_op
7 years ago
chengduo 6711b7b5f1
fix FeedAndSplitTensorIntoLocalScopes (#11817)
7 years ago
fengjiayi 2e320079d3 fix bugs
7 years ago
fengjiayi 5b4f283069 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi b6dc3a59f1 Add DataBalanceOpHandle to MultiDeviceSSAGragh
7 years ago
mozga-intel 61c54dbbe7 Remove additional function of the code
7 years ago
Haichao Zhang bc28cf613f Extend fill_zeros_like_op for zero-filling an LoDTensorArray (#11496)
7 years ago
yi.wu 8d04d0e2a3 update
7 years ago
yi.wu 6f0107126a fix broadcast bug
7 years ago
yuyang18 9faf5a39c5
Refactor Operator.cc, and clean code
7 years ago
yi.wu 88cb47bd86 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_paraexe_bcast
7 years ago
yi.wu 8e48c77b54 wip
7 years ago
yi.wu 3d69a82b83 fix dist train broadcasting bug
7 years ago
chengduo bb29800aaa
small refine (#11460)
7 years ago
Tomasz Patejko e26f51ce74 MKLDNN elementwis_add with default broadcast operations (#11544)
7 years ago
yi.wu 254154a9bf fix sparse paraexe dist train
7 years ago
Yancey 0d4b376fc0
Merge pull request #11585 from Yancey1989/fix_pserver_sub_blocks
7 years ago
Yancey1989 8cb494f79c add blocks attr type in proto
7 years ago
Yancey1989 56a903d3ac use optimize block list instead of first optimize block
7 years ago
fengjiayi e45a55523b
Merge pull request #11639 from JiayiFeng/fix_mac_compile
7 years ago
chengduo da556ed6d4
enhance ParallelExecutor stable (#11637)
7 years ago
Kexin Zhao 073af6237a
add print lod_tensor int64 option (#11644)
7 years ago
fengjiayi 964f515e9a fix mac compile
7 years ago
Yu Yang 9b3f48d7e6
Merge pull request #11616 from chengduoZH/fix_parallel_exe
7 years ago
chengduoZH c99fca5f90 Add No Mutex
7 years ago
chengduoZH 13de72388d Fix broadcast
7 years ago
Xin Pan 0151e4eb3d
Merge pull request #11608 from panyx0718/doc
7 years ago
chengduoZH 28a86aebc3 Fix Parallel Exe(VarHandel's version)
7 years ago
Yancey c413c46915
Merge pull request #11603 from Yancey1989/rename_details_folder
7 years ago
Yancey 9cc1eb434e
Merge pull request #11221 from Yancey1989/overlap_memcpy_with_dist
7 years ago
Xin Pan df31926fcf small thread-safety fix and doc improvements.
7 years ago
Yancey1989 1ef6cdb60e move dist codes from operaotrs/detail to operators/distributed
7 years ago
Qiao Longfei 05a92779f4
Merge pull request #11524 from jacquesqiao/add-keep_kids-for-executor
7 years ago
Yancey1989 7e6518e8ca fix compile warning
7 years ago
Yancey1989 7d1b146939 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Qiyang Min a29cb4be2a Fix decay bug (#11520)
7 years ago
qiaolongfei 2b1ecdf55a update comment
7 years ago
qiaolongfei daa0fbd5f4 add keep_kids flag for executor
7 years ago
tensor-tang 609dccfb55
Merge pull request #11395 from tensor-tang/fix
7 years ago
Kexin Zhao 417fcf4f43
Modify Pybind LoDTensor API according to length-based LoD (#11106)
7 years ago
tensor-tang 0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
7 years ago
tensor-tang 6c1cf60950 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang c453573286 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang 3e58df20df initial with only 1 mkl/openblas threads for each pthreads
7 years ago
Qiyang Min 046bb5c8cb Fix NCCLBcast hang up bug in Parallel Executor (#11377)
7 years ago
tensor-tang 85c3bfc1f9 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yancey1989 3d875b693f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Yang Yang(Tony) d827c6e87a
Dynamic Graph first prototype (#11415)
7 years ago
qiaolongfei 7ebef493d5 add row_size for selected rows in DebugStringEx
7 years ago
qiaolongfei cdd55dbcaa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 82416f1844 fix concurrency_test build error on mac
7 years ago
qiaolongfei 9ebbfa6bbc fix build on mac
7 years ago
Yancey1989 f52d78d189 update by comment
7 years ago
fengjiayi 88fa9c2ec7
Merge pull request #11267 from JiayiFeng/fix_reader_bug
7 years ago
tensor-tang 6602db5b3e throw warning if try to use mkldnn while not compiled
7 years ago
Yancey1989 6d752bafd8 use get_appropriate_dev to schedule rpc op
7 years ago
qiaolongfei f031555cfb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
Qiao Longfei 431491a2a8
Merge pull request #11366 from jacquesqiao/refine-prefetch
7 years ago
Wu Yi 34865f2de3
Trainer send term signal (#11220)
7 years ago
Yancey1989 4444e79e46 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
qiaolongfei 6dd3f3cf27 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 16658f7b59 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
chengduo 183377f410
Merge pull request #11306 from chengduoZH/enable_cpu_on_pe
7 years ago
qiaolongfei 83a577e8ce fix build problem
7 years ago
dzhwinter bfa3fd6f15
add inplace attribute to op_proto_maker (#10665)
7 years ago
gongweibao 9087c6687f
polish (#11363)
7 years ago
chengduoZH 173d72b481 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enable_cpu_on_pe
7 years ago
chengduoZH aadaadf735 replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
gongweibao 627d7a64f8
Clean `sendop` `recv` operator. (#11309)
7 years ago
chengduo fa29ef0b0d
Merge pull request #11277 from chengduoZH/check_ssa_graph
7 years ago
chengduoZH 961fbce8e2 follow comments
7 years ago
chengduoZH 7b723839ef Add cpu test for parallel_executor_crf executor_fetch_feed, and enable these tests
7 years ago
chengduoZH d24e046c1e fix allReduce bug
7 years ago
chengduoZH a57e8a4338 add cpu test
7 years ago
qiaolongfei 0485405b3d add more debug string
7 years ago