Commit Graph

22 Commits (7a3350dd31a27e6c1277ede4104cd2eb1adc8320)

Author SHA1 Message Date
yuyang18 d49763a87d Stash
7 years ago
Xin Pan 64eaa4c829 clean
7 years ago
Xin Pan 37e514432b op compose node and update nodes.
7 years ago
yuyang18 2d0e5592b5
Use std::map for Place <--> DeviceContext
7 years ago
chengduo da556ed6d4
enhance ParallelExecutor stable (#11637)
7 years ago
chengduoZH c99fca5f90 Add No Mutex
7 years ago
chengduoZH aadaadf735 replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
chengduoZH a584bc86dd add fuse var op handle
7 years ago
Yu Yang 14248a64d7 Fix hang when input is duplicated (#10709)
7 years ago
chengduoZH 830532213a extract method from broadcast::RunImpl
7 years ago
chengduoZH 9eec2c7509 refine pe
7 years ago
Xin Pan 98c12b1a08 Clean up C++ codes. (#10022)
7 years ago
Yu Yang 79be06045c Support CPU/GPU mixture for ParallelExecutor
7 years ago
Yu Yang 6b20b35589 Fix Transformer Hang Problem
7 years ago
Yu Yang ad73b331c7 Eagerly drop local scope in iteration (#9838)
7 years ago
Xin Pan be1373dcf9 Polish
7 years ago
Xin Pan d0ac92531d Improve ParallelExecutor performance
7 years ago
Yu Yang 084cdd1f4f Rename code
7 years ago
Yu Yang 9af870854e Use heap variables
7 years ago
Yu Yang 5c7a523326 Add Graphviz output
7 years ago
Yu Yang 15f5f10ed5 AddInput/AddOutput for OpHandle
7 years ago
Yu Yang fe7ed285d1 Extract NCCLCtxMap
7 years ago