Commit Graph

11 Commits (f8da5536edaa004fd42988539508f6810a2fe958)

Author SHA1 Message Date
Chen Weihang 1a304e6c06
[Complex] Add support for complex grad accumulated (#29889)
4 years ago
Zhong Hui a85592bcbf
fix cpplint error for the autmic max/min
4 years ago
Zhong Hui 597345d17b
fix cuda atomic for ARCH<350 for the automic_max
4 years ago
Zhong Hui 4a9d21de49
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
4 years ago
dzhwinter 6d3da458a7
Fix/float16 style (#12446)
7 years ago
dzhwinter 39ac9e39c2
float16 type support enhance (#12181)
7 years ago
chengduoZH 0cc635497c merge develop
7 years ago
chengduo 4fbde42cdf Fix __shfl_down_sync_ of cross_entropy (#10345)
7 years ago
chengduoZH b8f7fa97b6 replace __shfl with __shfl_sync
7 years ago
chengduoZH 90d73c79c3 fix shfl_sync for CUDA8.0
7 years ago
dzhwinter eb6f9dd5de
Feature/cuda9 cudnn7 (#10140)
7 years ago