wilfChen
|
0a1195ddf5
|
broadcast kernel support unqual dims & half
|
5 years ago |
ZPaC
|
d3936b9f2a
|
GPU kernels adapt with special dimensions.
|
5 years ago |
wilfChen
|
1eb60df5d4
|
gpu support logsoftmax & logsoftmaxgrad kernel
|
5 years ago |
mindspore-ci-bot
|
f602970990
|
!323 Gpu Concat support 4 inputs
Merge pull request !323 from chenweifeng/concat
|
5 years ago |
mindspore-ci-bot
|
4e25fec769
|
!324 Gpu Slice kernel performance improve
Merge pull request !324 from chenweifeng/slice
|
5 years ago |
mindspore-ci-bot
|
378a7122a5
|
!372 Gpu support BatchMatMul kernel
Merge pull request !372 from chenweifeng/batchmatmul
|
5 years ago |
mindspore-ci-bot
|
97d21ba014
|
!502 Gpu Support Gelu & GeluGrad
Merge pull request !502 from chenweifeng/gelu
|
5 years ago |
mindspore-ci-bot
|
a97f30ba7d
|
!516 Gpu support Tanh & TanhGrad kernel
Merge pull request !516 from chenweifeng/tanh
|
5 years ago |
mindspore-ci-bot
|
38c56fd1a5
|
!945 gpu queue support Sqrt & Rsqrt kernel
Merge pull request !945 from chenweifeng/unary
|
5 years ago |
wilfChen
|
a304304c30
|
gpu support Gelu & GeluGrad kernels
|
5 years ago |
wilfChen
|
311bf41e6d
|
gpu support tanh & tanhgrad kernel
|
5 years ago |
wilfChen
|
67a0cc3bf1
|
gpu queue support unary
|
5 years ago |
wilfChen
|
16f0688230
|
gpu support broadcast kernels
|
5 years ago |
mindspore-ci-bot
|
0611c1a579
|
!849 [CT][MS] Wrong format of broadcast output, when multi-output
Merge pull request !849 from vlne-v1/I1FQ76-wrong-format-of-boardcast-output-when-multi-output
|
5 years ago |
Wei Luning
|
157710ca0f
|
bugfix* fix bug in output tuple of tuple.* check kRWWrite input no-variable* input x of ScatterNdUpdate should be a parameter node
|
5 years ago |
mindspore-ci-bot
|
8c035a5171
|
!756 Gpu support LayerNorm kernel
Merge pull request !756 from chenweifeng/layer_norm
|
5 years ago |
dinghao
|
f77de54aa4
|
fix tensor dirty
|
5 years ago |
wilfChen
|
53b4529558
|
Gpu support LayerNorm kernel
|
5 years ago |
VectorSL
|
4740c70fc3
|
gpu add testcases
|
5 years ago |
mindspore-ci-bot
|
728801301c
|
!313 GPU add akg kernel float_status
Merge pull request !313 from VectorSL/float_status
|
5 years ago |
VectorSL
|
c000fb2f34
|
gpu add float_status kernel
|
5 years ago |
wilfChen
|
9a7702b807
|
gpu support batchmatmul kernel
|
5 years ago |
mindspore-ci-bot
|
87be386581
|
!314 GPU add kernel assign
Merge pull request !314 from VectorSL/assign
|
5 years ago |
wilfChen
|
cc93646207
|
gpu concat kernel support 4 inputs
|
5 years ago |
wilfChen
|
5b7790a2a7
|
Gpu Slice kernel performance improvement
|
5 years ago |
VectorSL
|
9e372073e2
|
gpu add assigin
|
5 years ago |
VectorSL
|
d248b05a98
|
gpu add kernel select
|
5 years ago |
mindspore-ci-bot
|
94589ce611
|
!226 expend conv stride and dilation to 2d
Merge pull request !226 from wangnan39/expend_conv_stride_to_2d
|
5 years ago |
chenzomi
|
652ab6c386
|
add test case for aware quantizaiton
|
5 years ago |
wangnan39@huawei.com
|
2604acedcb
|
extend conv stride and dilation to 2d
|
5 years ago |
VectorSL
|
aea6b0c974
|
update tests/st/ops/gpu/test_tensoradd.py.
fix pytest.mark for testcase
|
5 years ago |
zhunaipan
|
930a1fb0a8
|
initial version
Signed-off-by: leonwanghui <leon.wanghui@huawei.com>
|
5 years ago |