Commit Graph

28 Commits (8ebffc78c9f999759a35921c71b83226200d8561)

Author SHA1 Message Date
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
Zeng Jinle b754700fb5
fix reduce and broadcast to avoid multi-stream, test=develop (#19889)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
gongweibao 7cd4dd7ce4
Hide varhandle members. (#15382)
6 years ago
Yu Yang c00e07cda0 Fix distribute compile
6 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
6 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
6 years ago
peizhilin 7c8c9dc9bf fix unit test cases
6 years ago
chengduo ed087f8232
refine op_handle (#14178)
6 years ago
Yancey1989 bad4ea192e update by comment
7 years ago
Yancey1989 5ce1a960a5 move bcast op into pass
7 years ago
chengduo 97a77512b4
Fix the order of sum (#12562)
7 years ago
Xin Pan caf10b474f make profiler use thread_id from g_thread_id
7 years ago
chengduo da556ed6d4
enhance ParallelExecutor stable (#11637)
7 years ago
chengduoZH c99fca5f90 Add No Mutex
7 years ago
chengduoZH 830532213a extract method from broadcast::RunImpl
7 years ago
chengduoZH 9eec2c7509 refine pe
7 years ago
chengduoZH 881e063ee2 follow comments
7 years ago
chengduoZH 7722baa8e3 follow comments and clean code
7 years ago
chengduoZH 5ff1ef36ee update sparse parameter
7 years ago
chengduoZH 9a4ae4df79 fix scope of gather broadcast
7 years ago
chengduoZH e63013a86f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
7 years ago
chengduoZH e4de957f19 code refine
7 years ago
chengduoZH 3c5bbf42c4 make unit test to work
7 years ago
chengduoZH e39adc8600 add reduce op handle
7 years ago