Commit Graph

1214 Commits (b854d959a543ee83e89a77d0627fb375bf0f9ba1)

Author SHA1 Message Date
Xin Pan a9217031ba small fix
7 years ago
Michal Gallus 4a7f0698e0 Add consts to new MKLDNN integration
7 years ago
Michal Gallus 6588d0e039 Update MKLDNN to 0.15, fix conv integration
7 years ago
Qiao Longfei 03d4c7efd3
add rw lock test (#12752)
7 years ago
Qiao Longfei 653fad08f8
Optimize selected rows for dist lookup table with pthread rwlock (#12635)
7 years ago
Qiao Longfei 64d48f4d6a
fix mac compile (#12751)
7 years ago
Qiyang Min 340a104c58
Merge pull request #12658 from velconia/port_pybind11
7 years ago
Chen Weihang d4d8f83137
Merge pull request #12633 from chenwhql/demangle_type_name
7 years ago
gongweibao d06849305a
parameter dispather. (#12666)
7 years ago
minqiyang a32ce8c444 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
Yan Chunwei 5d2834fcf7
fea/ir support fuse, based on graph pattern detection helper (#12636)
7 years ago
minqiyang 0d7047ca79 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
yuyang18 265302edea Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fast_executor
7 years ago
fengjiayi f276006f0c
Merge pull request #12694 from JiayiFeng/dev_op_tensor_support
7 years ago
Yu Yang a197737c02
Merge pull request #12690 from reyoung/feature/better_exception_holder
7 years ago
Xin Pan d96ee24f0b
Merge pull request #12697 from panyx0718/ir2
7 years ago
minqiyang 77f12e000f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
fengjiayi d6b5302bd6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
gongweibao 842fb021b3
Fix clone() bug. (#12583)
7 years ago
chenweihang 7797e55f42 use paddle::platform::demangle
7 years ago
Xin Pan 891c3c0f9a test and doc IR Graph
7 years ago
minqiyang 7e0f66e99a Polish code
7 years ago
minqiyang 5338417b47 Polish code style
7 years ago
minqiyang ae39709e59 Polish code
7 years ago
minqiyang 55d7f55c63 Revert the changes to attribute.h
7 years ago
fengjiayi 5e7aa8c7e5 code clean
7 years ago
chenweihang 21d5b94228 error message refine: add demangle api to attribute type
7 years ago
yuyang18 05cadf1b24 Add FastExecutor
7 years ago
yuyang18 c6eb7a89ff Merge branch 'feature/better_exception_holder' into feature/fast_executor
7 years ago
yuyang18 aac80ef4cc Polish API of exception holder
7 years ago
yuyang18 d49763a87d Stash
7 years ago
minqiyang 68b221401d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
Xin Pan 6b45c5a134
Merge pull request #12605 from panyx0718/ir
7 years ago
minqiyang e4057d071b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
chengduo 64824ac73f
Add write after write dependence (#12632)
7 years ago
minqiyang 09103084d3 Polish compat.py and add unittest for it
7 years ago
Xin Pan 626abfc33a code clean up and renaming
7 years ago
Qiao Longfei c1446342ff
Merge pull request #12577 from jacquesqiao/optimize-vlog-before-and-after-op-run
7 years ago
minqiyang c3fdf3aee4 Fix divide problem in CI
7 years ago
fengjiayi 24d51de022 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
fengjiayi 27df3a9f2b make cross_entropy_op supporting tensors
7 years ago
Yu Yang c4f8afa258 Fix bug when memopt optimize loss.grad and use ParallelExecutor
7 years ago
qiaolongfei b4d48531e4 optimize vlog before and after op run, move into op.run
7 years ago
minqiyang 6abe819f07 Fix pybind11 problem
7 years ago
chengduo 97a77512b4
Fix the order of sum (#12562)
7 years ago
qiaolongfei 1623f1ba4f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler
7 years ago
qiaolongfei 903b2c0162 optimize code
7 years ago
Xin Pan 99a77cfc62
Merge pull request #12468 from panyx0718/improve_profiler2
7 years ago
qiaolongfei a3f9d6a38c optimize profiler
7 years ago
minqiyang f9ef0ee8a9 Polish code
7 years ago
minqiyang c4d000a990 Make code more efficient
7 years ago
minqiyang 9812bb8b48 Fix pserver can NOT start with DebugString problem
7 years ago
dzhwinter 0c8fde7dce
"cherry picked cpp tests" (#12182)
7 years ago
dzhwinter 595a2c83ae
explicit gradient of elementwise_add/elementwise_sub (#11970)
7 years ago
nhzlx a2749adf5d fix warning
7 years ago
Xin Pan caf10b474f make profiler use thread_id from g_thread_id
7 years ago
fengjiayi bc1b7b96ec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_exception_holder
7 years ago
qiaolongfei c8c8c01a23 fix mac build of graph_executor
7 years ago
fengjiayi 3e4083ed1f Make exception handling of threaded_ssa_graph_executor an independent class
7 years ago
Xin Pan 4b8ae523c4
Merge pull request #12367 from panyx0718/ir_pass
7 years ago
qiaolongfei 3033841b4a CheckTensorNANOrInf support checking SelectedRows
7 years ago
Xin Pan 02c31458bb
Merge pull request #12417 from panyx0718/add_dist_deps
7 years ago
Xin Pan 25706d0868 properly set up dep of concat and fetch_bar
7 years ago
minqiyang e96fef2cf7 Fix inference api impl deps
7 years ago
Xin Pan 398cfb47b1 disable dist_se_resnext since it's not stable yet.
7 years ago
Xin Pan 5fff8d7a55 add distributed training deps.
7 years ago
fengjiayi ec4c6e1f7c
Merge pull request #12384 from JiayiFeng/dev_update_save_inference_model
7 years ago
Xin Pan 99c0c20468 add pass test
7 years ago
fengjiayi 604bd85a45 update inference_optimize()
7 years ago
Xin Pan ab72d28a5e clean up and correctness check
7 years ago
Xin Pan aa1085ddc5 all passes
7 years ago
Xin Pan e4d7d7ae8f pass refactoring
7 years ago
Xin Pan 142e832d21 pass registration
7 years ago
Xin Pan 5b183557f3 graph viz pass
7 years ago
qiaolongfei 64e7902530 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into parallel-executor-support-prefetch
7 years ago
Xin Pan d7e08c53c2
Merge pull request #12169 from panyx0718/ir_graph_sort
7 years ago
qiaolongfei e7eeb19f90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into parallel-executor-support-prefetch
7 years ago
Qiao Longfei 2d21aa76c7
Merge pull request #12331 from jacquesqiao/fix-mixed-tensor
7 years ago
qiaolongfei 754e96a30c distribute lookup table work with parallel executor
7 years ago
qiaolongfei 65e5aebd43 fix mixed_vector_test
7 years ago
qiaolongfei da035fc674 remove explicit for compile problem
7 years ago
qiaolongfei 5022b14de8 fix mixed tensor compile and add cpu unit test
7 years ago
Yancey1989 fb06ed7bdc Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
Yan Chunwei 02cf54d331
bugfix lod cpu performance (#12297)
7 years ago
Xin Pan 5173a53c8a fix reorder issue.
7 years ago
Xin Pan 21a45420f0 polish and test
7 years ago
Xin Pan 2782e71a11 fix graph_test
7 years ago
Xin Pan 93355cc0d2 fix control deps
7 years ago
Xin Pan f6d99d1f73 polish
7 years ago
Xin Pan c3f6e0e8a2 add namespace to Graph
7 years ago
Xin Pan 0b3465d215 better
7 years ago
Xin Pan dcaf183daa builder SSA graph at the beginning.
7 years ago
Yancey1989 d4f51218ef Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
Xin Pan 1a3b84369c disable warning
7 years ago
Yu Yang b06309381b
Merge pull request #12149 from reyoung/feature/combine_open_files_and_double_buffer
7 years ago
chengduo a3ac54b672
Fix Reduce functor (#12262)
7 years ago
Yu Yang 6c981e7d3c
Merge pull request #12259 from reyoung/feature/fix_serialize_deserialize_bug
7 years ago
Tao Luo 6fd75edae5 Fix var_handle CMake dependence (#12261)
7 years ago
yuyang18 47ad8d4909
Fix deserialize bug
7 years ago
Yancey1989 be772741cf compile with cpu
7 years ago
Dang 6f72b3b486 Fix ir CMake dependence.
7 years ago
Qiao Longfei 37713f2282
Merge pull request #12244 from jacquesqiao/fix-mac-build
7 years ago
yuyang18 8c3cd4204e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
qiaolongfei 0e30c9d6fb fix mac build
7 years ago
Yancey1989 efd5a84986 update executor interface
7 years ago
yuyang18 7cd5f906b7
Correct IR dependencies
7 years ago
yuyang18 7268760fbc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
Yancey1989 7570d8e77c add rpc complete interface
7 years ago
Xin Pan 950585f419 follow comments
7 years ago
Xin Pan da5efa735a fix
7 years ago
Xin Pan 5daad16218 polish
7 years ago
Xin Pan ff5a7b67ed polish
7 years ago
Xin Pan 62e2aa115d add a graph_test
7 years ago
Xin Pan 9c9e28b57b fix program to graph
7 years ago
Xin Pan 64eaa4c829 clean
7 years ago
Xin Pan 10786a243e polish graph
7 years ago
Xin Pan 2fa8df1caf separate graph building pass and graph-based pe builder
7 years ago
Xin Pan 37e514432b op compose node and update nodes.
7 years ago
Xin Pan 9605fcd124 all graphs
7 years ago
Xin Pan af79b19207 add a simple program to graph
7 years ago
Xin Pan 7231ef6b68 tmp
7 years ago
Xin Pan 68aa500451 polish attrs
7 years ago
Xin Pan 9b9603306c graph attrs
7 years ago
Xin Pan 2eeaa8d5cf Graph in ParallelExecutor Builder
7 years ago
Xin Pan 7781297c70 variants
7 years ago
Xin Pan fcda23a3e4 simple node
7 years ago
Xin Pan 08acc03522 start
7 years ago
yuyang18 be528f9815
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18 72b78154b2
Polish reader speed
7 years ago
Yancey 0042ba93c8
Merge pull request #12127 from Yancey1989/enforce_rpc_timeout
7 years ago
chengduo 325fbc4f1b
Add learning rate decay test (#12124)
7 years ago
yuyang18 b8975d6842
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18 d36e13efd8
Merge branch 'feature/add_pyreader_demo' into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18 c680bc1d7f
Rewrite DoubleBuffer
7 years ago
chengduo 86b0a72576
Refine multi thread cpu parallel exe (#11406)
7 years ago
yuyang18 8c2eba71e2
Refine demo
7 years ago
Yu Yang ebe3b5e78a
Merge pull request #11853 from sneaxiy/complete_py_reader_python
7 years ago
Yancey1989 d14afcedeb polish function name
7 years ago
Yancey1989 9de096bd58 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fix_cpu_pe
7 years ago
Yancey1989 1effba3312 fix pe with cpu place
7 years ago
sneaxiy 9153223ab3 Merge branch 'complete_py_reader_python' of https://github.com/sneaxiy/Paddle into complete_py_reader_python
7 years ago
sneaxiy df7ecbf5a1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into complete_py_reader_python
7 years ago
sneaxiy 31c7f6b968
Merge branch 'develop' into complete_py_reader_python
7 years ago
sneaxiy de638f8112 Merge branch 'complete_py_reader_python' of https://github.com/sneaxiy/Paddle into complete_py_reader_python
7 years ago
fengjiayi 335e4b9d52 fix a dead lock bug
7 years ago
sneaxiy f85e16f1de Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into complete_py_reader_python
7 years ago
yuyang18 c822d0309b
Refine code
7 years ago
yuyang18 7d6afee5c1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exception_safe_pe
7 years ago
yuyang18 0a445da631
Make scope_buffered_ssa_graph_executor Exception safe
7 years ago
fengjiayi 26ae6111d1
Merge pull request #12051 from JiayiFeng/dev_reader_ResetAll
7 years ago
fengjiayi d55919c656 Impl ResetAll and fix errors
7 years ago
gongweibao 091ab63331
Fix singleton. (#11835)
7 years ago
fengjiayi 6d6f49cd56 Merge remote-tracking branch 'yuyang/feature/decorated_reader_chain' into dev_reader_ResetAll
7 years ago
fengjiayi 611716e9bc Merge branch 'dev_reader_shutdown_start' of https://github.com/JiayiFeng/Paddle into dev_reader_shutdown_start
7 years ago
fengjiayi 0e9f1e2790 Make ReaderBase thread safe and remove ThreadedReader
7 years ago
yuyang18 e8ee9dc7f8
Several Polish
7 years ago
yuyang18 0d2ccfbd3c
Remove atomic
7 years ago
fengjiayi b4f0e57956 fix errors
7 years ago
yuyang18 62c1133f42
Add mutex for decorated_chain
7 years ago
Tao Luo 436bb4500b
Merge pull request #11699 from pzelazko-intel/pzelazko/workaround-for-missing-mklnn-kernels
7 years ago
fengjiayi 5528f59900 Split ReInit() to Shutdown() and Start()
7 years ago
fengjiayi de9a411f1c adjust readers' inheritance relationships
7 years ago
yuyang18 c48c586aca
Use weak_ptr to implement DecoratedReaderChain
7 years ago
yuyang18 2bbe5f77e7
Add GetEndPoints of Reader.
7 years ago
Paweł Żelazko 05ffc1331e log for fallback added
7 years ago
Yu Yang de456e74ae
Merge pull request #11913 from reyoung/feature/remove_clone_method
7 years ago
yuyang18 8e86721fe7
Fix data balance on single GPU
7 years ago
dzhwinter 4ed0b62476
Move fluid::framework::InitDevices into fluid::platform (#11757)
7 years ago
fengjiayi ce16b40b04
Merge pull request #11891 from JiayiFeng/dev_eof_exp
7 years ago
yuyang18 4e4438a8aa
Remove Op::Clone method
7 years ago
yuyang18 2d0e5592b5
Use std::map for Place <--> DeviceContext
7 years ago
Yancey f7fd711e3f
Merge pull request #11868 from Yancey1989/dist_pass_barrier
7 years ago
Yu Yang 9401b64d61
Merge pull request #11877 from reyoung/feature/fix_reshape_op_size
7 years ago
fengjiayi 3fab4f65a4 Add EOFException to represent EOF in C++ reader
7 years ago
Paweł Żelazko ac323343a0 typos fix
7 years ago
Paweł Żelazko bf0090a1ef workaround for missing MKLDNN kernel
7 years ago
yuyang18 82866d4a18
Add register kernel functor and shrink reshape op
7 years ago
fengjiayi 58560622bc
Merge pull request #11854 from JiayiFeng/dev_data_balance
7 years ago
yuyang18 75ae426aa5
Merge branch 'feature/change_op_kernel_to_func' into feature/fix_reshape_op_size
7 years ago
yuyang18 3b00ed81a9
Make Kernel registed as a function
7 years ago
yuyang18 1ce478f100
Polish reshape op
7 years ago
Yancey1989 37410a0c75 update by comment
7 years ago
sneaxiy 3f9292c6e6 fix merge conflict
7 years ago
sneaxiy dd70fb4393 fix type comparation bugs
7 years ago
fengjiayi ff4317cee9 follow comments
7 years ago
Yancey1989 c1ab215e26 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into dist_pass_barrier
7 years ago
Yancey1989 1366832a41 add dist pass barrier
7 years ago
gongweibao 66c91911cf
Improve brpccmake (#11842)
7 years ago
Yan Chunwei 28172bbb8e
add debug to replacing enforce with GLOG for debug (#11244)
7 years ago
guochaorong c318aa5ffa
Merge pull request #11850 from guochaorong/revert_11496
7 years ago
fengjiayi 49a04d75ee Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi 4b950951d3 Add unittests and fix a few bugs
7 years ago
chengduo 7b54f16855
Follow comment (#11845)
7 years ago
Tao Luo b0378963d0
Merge pull request #11666 from mozga-intel/mozga-intel/Batch_norm_support_other_type
7 years ago
guochaorong 6a35899131 Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray (#11496)"
7 years ago
gongweibao c2165ffa7b Fix codesytle (#11836)
7 years ago
chengduo 8d76cf397d
Fix TensorCopy bug (#11822)
7 years ago
fengjiayi ba5380129c Merge branch 'fix_Mac_compile_errors' into dev_data_balance
7 years ago
fengjiayi 077434c26c fix CPU compile error
7 years ago
fengjiayi 47388020a2 fix bugs
7 years ago
yuyang18 5e23a5ec18
Rename TransferData -> TransformData
7 years ago
yuyang18 5999201346
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refactor_op
7 years ago
chengduo 6711b7b5f1
fix FeedAndSplitTensorIntoLocalScopes (#11817)
7 years ago
fengjiayi 2e320079d3 fix bugs
7 years ago
fengjiayi 5b4f283069 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi b6dc3a59f1 Add DataBalanceOpHandle to MultiDeviceSSAGragh
7 years ago
mozga-intel 61c54dbbe7 Remove additional function of the code
7 years ago
Haichao Zhang bc28cf613f Extend fill_zeros_like_op for zero-filling an LoDTensorArray (#11496)
7 years ago
yi.wu 8d04d0e2a3 update
7 years ago
yi.wu 6f0107126a fix broadcast bug
7 years ago
yuyang18 9faf5a39c5
Refactor Operator.cc, and clean code
7 years ago
yi.wu 88cb47bd86 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_paraexe_bcast
7 years ago
yi.wu 8e48c77b54 wip
7 years ago
yi.wu 3d69a82b83 fix dist train broadcasting bug
7 years ago
chengduo bb29800aaa
small refine (#11460)
7 years ago
Tomasz Patejko e26f51ce74 MKLDNN elementwis_add with default broadcast operations (#11544)
7 years ago
yi.wu 254154a9bf fix sparse paraexe dist train
7 years ago
Yancey 0d4b376fc0
Merge pull request #11585 from Yancey1989/fix_pserver_sub_blocks
7 years ago
Yancey1989 8cb494f79c add blocks attr type in proto
7 years ago
Yancey1989 56a903d3ac use optimize block list instead of first optimize block
7 years ago
fengjiayi e45a55523b
Merge pull request #11639 from JiayiFeng/fix_mac_compile
7 years ago
chengduo da556ed6d4
enhance ParallelExecutor stable (#11637)
7 years ago
Kexin Zhao 073af6237a
add print lod_tensor int64 option (#11644)
7 years ago
fengjiayi 964f515e9a fix mac compile
7 years ago
Yu Yang 9b3f48d7e6
Merge pull request #11616 from chengduoZH/fix_parallel_exe
7 years ago
chengduoZH c99fca5f90 Add No Mutex
7 years ago
chengduoZH 13de72388d Fix broadcast
7 years ago
Xin Pan 0151e4eb3d
Merge pull request #11608 from panyx0718/doc
7 years ago
chengduoZH 28a86aebc3 Fix Parallel Exe(VarHandel's version)
7 years ago
Yancey c413c46915
Merge pull request #11603 from Yancey1989/rename_details_folder
7 years ago
Yancey 9cc1eb434e
Merge pull request #11221 from Yancey1989/overlap_memcpy_with_dist
7 years ago
Xin Pan df31926fcf small thread-safety fix and doc improvements.
7 years ago
Yancey1989 1ef6cdb60e move dist codes from operaotrs/detail to operators/distributed
7 years ago
Qiao Longfei 05a92779f4
Merge pull request #11524 from jacquesqiao/add-keep_kids-for-executor
7 years ago
Yancey1989 7e6518e8ca fix compile warning
7 years ago
Yancey1989 7d1b146939 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Qiyang Min a29cb4be2a Fix decay bug (#11520)
7 years ago
qiaolongfei 2b1ecdf55a update comment
7 years ago
qiaolongfei daa0fbd5f4 add keep_kids flag for executor
7 years ago
tensor-tang 609dccfb55
Merge pull request #11395 from tensor-tang/fix
7 years ago
Kexin Zhao 417fcf4f43
Modify Pybind LoDTensor API according to length-based LoD (#11106)
7 years ago
tensor-tang 0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
7 years ago
tensor-tang 6c1cf60950 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang c453573286 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang 3e58df20df initial with only 1 mkl/openblas threads for each pthreads
7 years ago
Qiyang Min 046bb5c8cb Fix NCCLBcast hang up bug in Parallel Executor (#11377)
7 years ago
tensor-tang 85c3bfc1f9 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yancey1989 3d875b693f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Yang Yang(Tony) d827c6e87a
Dynamic Graph first prototype (#11415)
7 years ago
qiaolongfei 7ebef493d5 add row_size for selected rows in DebugStringEx
7 years ago
qiaolongfei cdd55dbcaa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 82416f1844 fix concurrency_test build error on mac
7 years ago
qiaolongfei 9ebbfa6bbc fix build on mac
7 years ago
Yancey1989 f52d78d189 update by comment
7 years ago
fengjiayi 88fa9c2ec7
Merge pull request #11267 from JiayiFeng/fix_reader_bug
7 years ago
tensor-tang 6602db5b3e throw warning if try to use mkldnn while not compiled
7 years ago
Yancey1989 6d752bafd8 use get_appropriate_dev to schedule rpc op
7 years ago
qiaolongfei f031555cfb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
Qiao Longfei 431491a2a8
Merge pull request #11366 from jacquesqiao/refine-prefetch
7 years ago
Wu Yi 34865f2de3
Trainer send term signal (#11220)
7 years ago
Yancey1989 4444e79e46 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
qiaolongfei 6dd3f3cf27 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 16658f7b59 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
chengduo 183377f410
Merge pull request #11306 from chengduoZH/enable_cpu_on_pe
7 years ago
qiaolongfei 83a577e8ce fix build problem
7 years ago
dzhwinter bfa3fd6f15
add inplace attribute to op_proto_maker (#10665)
7 years ago
gongweibao 9087c6687f
polish (#11363)
7 years ago
chengduoZH 173d72b481 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enable_cpu_on_pe
7 years ago
chengduoZH aadaadf735 replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
gongweibao 627d7a64f8
Clean `sendop` `recv` operator. (#11309)
7 years ago
chengduo fa29ef0b0d
Merge pull request #11277 from chengduoZH/check_ssa_graph
7 years ago
chengduoZH 961fbce8e2 follow comments
7 years ago
chengduoZH 7b723839ef Add cpu test for parallel_executor_crf executor_fetch_feed, and enable these tests
7 years ago
chengduoZH d24e046c1e fix allReduce bug
7 years ago
chengduoZH a57e8a4338 add cpu test
7 years ago
qiaolongfei 0485405b3d add more debug string
7 years ago
Luo Tao 79d555b9f2 Merge branch 'develop' into mkldnn
7 years ago
gongweibao 062d5a56b4
Add comments to a singleton. (#11333)
7 years ago
chengduoZH 1e731f5964 small fix
7 years ago
chengduoZH 495368c243 ADD CPU_NUM
7 years ago
chengduoZH 27073c284d nccl_all_reduce_op_handle => all_reduce_op_handle
7 years ago
chengduoZH 2d94697a82 code refine
7 years ago
chengduoZH 5a3c8bf813 fix in c++ side
7 years ago
Luo Tao c6d230e03e add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
fengjiayi d745840a6e fix a small compile error on Mac
7 years ago
chengduoZH 0c851cab22 add SSA graph checker
7 years ago
Yancey1989 d5a88b9340 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
chengduoZH 1076e85135 refine logic
7 years ago
Yancey 0aa9546eed
fix dist train error (#11281)
7 years ago
tensor-tang b8d315fb69 make scope thread safe
7 years ago
dzhwinter d48172f22a
split reduce op into multiple libraries, accelerate the compiling (#11029)
7 years ago
fengjiayi 5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
7 years ago
tensor-tang bfd42683ca Merge remote-tracking branch 'ups/develop' into refine
7 years ago
dzhwinter f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. (#10646)
7 years ago
fengjiayi dc8e0b494d fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
mozga-intel 3ff9ba0e6b Mkldnn layout (#11040)
7 years ago
chengduoZH 8291b916d6 replace graph_builder_factory with ssa_graph_builder_factory
7 years ago
chengduoZH 9ac785be39 check graph's validation
7 years ago
fengjiayi a1e046bfc0
Merge pull request #11270 from JiayiFeng/fix_a_error_on_max
7 years ago
fengjiayi 499dbe0536 fix a multi-thread bug in readers
7 years ago