Commit Graph

6 Commits (569951c418fb3c9f82cbdde9fda3910cc7033bff)

Author SHA1 Message Date
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Yiqun Liu 7e463c84a6
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979)
6 years ago
Yiqun Liu 5782dddad0
Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415)
6 years ago
chengduo b9fb03cf54
Move GetTensor to tensor_util (#15011)
7 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
7 years ago
chengduo a7497653d0
Refine Split op (#13967)
7 years ago