Commit Graph

10 Commits (eb37ee2a26e8ff1ceaf1058f1a332885ce239ab0)

Author SHA1 Message Date
Zhong Hui a85592bcbf
fix cpplint error for the autmic max/min
5 years ago
Zhong Hui 597345d17b
fix cuda atomic for ARCH<350 for the automic_max
5 years ago
Zhong Hui 4a9d21de49
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
5 years ago
dzhwinter 6d3da458a7
Fix/float16 style (#12446)
7 years ago
dzhwinter 39ac9e39c2
float16 type support enhance (#12181)
7 years ago
chengduoZH 0cc635497c merge develop
8 years ago
chengduo 4fbde42cdf Fix __shfl_down_sync_ of cross_entropy (#10345)
8 years ago
chengduoZH b8f7fa97b6 replace __shfl with __shfl_sync
8 years ago
chengduoZH 90d73c79c3 fix shfl_sync for CUDA8.0
8 years ago
dzhwinter eb6f9dd5de
Feature/cuda9 cudnn7 (#10140)
8 years ago