Commit Graph

532 Commits (ac674a41b0f692dfbcf2411585f8c68167d1a9f7)

Author SHA1 Message Date
TFbunny 7f769568a6 Register SparseGatherV2 and add dynamic shape support
4 years ago
danishnxt 9dba65c424 TensorDot Conv: P -> C
4 years ago
mindspore-ci-bot 35dd47711d !7798 ROIAlign GPU kernel height/width plus1
4 years ago
mindspore-ci-bot 7622b44c21 !8170 Support Nd*2d for nn.Dense
4 years ago
HuangBingjian 265a6d61b6 reverse fix shape=0
4 years ago
mindspore-ci-bot c76e7107a5 !8220 Add supports to op unpack on gpu
4 years ago
mindspore-ci-bot 370e7ab95f !8204 add supports to op pack on gpu
4 years ago
wanyiming 3c7a3b6693 modify_dense
4 years ago
zhouyuanshen 5c4885821f add supports to op unpack on gpu
4 years ago
zhouyuanshen 00f1c0c7f3 add supports to op pack on gpu
4 years ago
HuangBingjian 4477fcfe19 fix shape bug.
4 years ago
jonwe 575280bb61 roi end mode
4 years ago
mindspore-ci-bot 4b4ca1a188 !8146 Improve performance for GPU-ScatterUpdate, add int32 support
4 years ago
mindspore-ci-bot 04142b74f6 !8144 Rename GPU-UniformSampler to GPU-UniformCandidateSampler
4 years ago
mindspore-ci-bot 4c2344ed35 !8141 Improve performance for GPU-ScatterAdd, add use_locking and add int32 support
4 years ago
TFbunny cd7329082e refine GPU-Scatteradd
4 years ago
mindspore-ci-bot 769bd24c63 !8002 Add expm1 op at GPU back-end
4 years ago
TFbunny 356844da46 refine GPU-ScatterUpdate
4 years ago
TFbunny ee4e2db77e rename UniformSampler to UniformCandidateSampler
4 years ago
mindspore-ci-bot cfe226fc97 !7950 new add softplus and softplus grad gpu ops.
4 years ago
mindspore-ci-bot ccbc6df79c !8068 add supports to op gathergrad on gpu
4 years ago
zhouyuanshen 21c96b3c31 add supports to op gathergrad on gpu
4 years ago
mindspore-ci-bot b64de9c1dc !8000 Add supports to op Gather on gpu
4 years ago
mindspore-ci-bot 497f2f0cff !8054 Add gpu support for ScatterUpdate
4 years ago
mindspore-ci-bot de20d3e488 !8006 [MS][GPU][CUDA][API] Adding new Ops - UnsortedSegmentMax GPU
4 years ago
TFbunny cd9d0bf79a add GPU-ScatterUpdate
4 years ago
mindspore-ci-bot 93c11d1dcc !8008 Add gpu support to ScatterAdd
4 years ago
mindspore-ci-bot 4f4eadda8f !8003 Repeat Elements Grad GPU Kernel
4 years ago
peixu_ren b8abcf858a Add expm1 op at GPU back-end
4 years ago
zhouyuanshen f0f67b8aa8 add gather op on gpu
4 years ago
TFbunny dc2144ee0d add GPU-ScatterAdd
4 years ago
danishnxt 34cc178bd0 New New UnsortedSegmentMax for GPU [API][CUDA_KERNEL]
4 years ago
jonwe e896d38c34 repeat grad
4 years ago
mindspore-ci-bot 542a52fbf8 !7873 [MS][GPU] Add GPU SparseApplyFtrl op
4 years ago
mindspore-ci-bot 851ee9c3d5 !7875 Add support to GPU SparseApplyProximalAdagrad
4 years ago
linqingke 6dc3618758 new add softplus and softplus grad gpu ops.
4 years ago
mindspore-ci-bot 9c2704030f !7872 Add log1p operator at GPU back-end and move erf and erf to the unary_op list
4 years ago
mindspore-ci-bot 5ef3d02ce3 !7696 Add calculation of triangle matrix determinant op at GPU back-end
4 years ago
TFbunny a638973378 add GPU SparseApplyProximalAdagrad
4 years ago
tom__chen 71b235c302 sparse apply ftrl gpu kernel
4 years ago
mindspore-ci-bot 4ebc55b29e !7841 Add supports to ACosGrad and AsinGrad on gpu
4 years ago
mindspore-ci-bot f3e8798b40 !7879 fix bug that asin/acos not support fp16 on gpu
4 years ago
mindspore-ci-bot ea4df3c3c2 !7826 new add l2normalize_grad gpu ops.
4 years ago
zhouyuanshen 1804c042ec fix bug that asin/acos not support fp16 on gpu
4 years ago
zhouyuanshen 8481fd59d8 Add supports to ACosGrad and AsinGrad
4 years ago
peixu_ren dfe5a951eb Add log1p operator at GPU back-end and move erf and erf to the unary_op list
4 years ago
peixu_ren 99f2927c21 Add calculation of triangle matrix determinant op at GPU back-end
4 years ago
linqingke 3465c9c400 new add l2normalize_grad gpu ops.
4 years ago
mindspore-ci-bot 5b28016b4d !7735 fix gpu multinomial seed issue and GRAPH_MODE
4 years ago
mindspore-ci-bot d479b91093 !7767 GPU update resnet50 readme and add cast type
4 years ago
mindspore-ci-bot deb17b36c1 !7786 Add SampledSoftmaxLoss GPU Kernel
4 years ago
mindspore-ci-bot 5ad25494ab !7736 [MS][GPU][CUDA] - Reworked MirrorPad / MirrroPadGrad Op to add support for 3d/4d padding + Updated ST + added support for int32 in all GPu paddings
4 years ago
jonwe fc81f46053 sampled softmax loss
4 years ago
mindspore-ci-bot a3af89bd48 !7611 [MS][GPU] Adding new Ops - TensorDot and TensorDot Grad
4 years ago
VectorSL 5102482e3a 1readme update resnet 2cast add more type
4 years ago
mindspore-ci-bot b6b254f6e4 !7670 add l2normalize gpu kernel.
4 years ago
danishnxt 0f69be06b1 Reworked mirrorPad
4 years ago
baihuawei 05dcec0583 fix multinomial
4 years ago
linqingke 99480d26c6 add l2normalize gpu kernel.
4 years ago
danishnxt 06a9b4aa37 Commit1_TensorDot
4 years ago
mindspore-ci-bot 0d5e119fa4 !7483 Add GPU-UniformSampler and nn.UniformCandidateSampler
4 years ago
mindspore-ci-bot 2c468a358a !7537 new gpu op for cbg: repeat_elements
4 years ago
Peilin Wang bd0b462691 new gpu op for cbg repeat_elements
4 years ago
TFbunny e18a78feb9 add GPU UniformSampler
4 years ago
mindspore-ci-bot 6cc37db833 !7482 roialign gpu operator output is zero
4 years ago
mindspore-ci-bot e24b50f559 !7548 Support elu and elugrad with dtype float and half on gpu
4 years ago
zhouyuanshen 2ab0b3d67a support ops elu and elugrad on gpu
4 years ago
mindspore-ci-bot fd8ebde400 !7545 GPU reshape add type
4 years ago
VectorSL 5c0c363c66 gpu reshepe add type
4 years ago
jonwe 4583bc1c57 roialign zero output
4 years ago
zhouyuanshen f49bd92b88 support asin and acos with dtype float on gpu
4 years ago
wanyiming e2b8810413 1D*ND
4 years ago
mindspore-ci-bot 21c5607fca !6971 cudnn inplace optimizer
4 years ago
wilfChen b420b6cda7 cudnn inplace optimizer
4 years ago
baihuawei fea928e976 remove CTCLossV2
4 years ago
wilfChen 13a3a6d117 test case seed
4 years ago
mindspore-ci-bot d60033c8db !6381 Add dtype float16 that erf and erfc should support
4 years ago
wukesong f9a865fd42 add GPU operator
4 years ago
peixu_ren 8132e56417 Add dtype float16 that erf and erfc should support
4 years ago
mindspore-ci-bot 3f0ac45954 !6206 new add gpu ops sqrt_grad and rsqrt_grad.
4 years ago
linqingke dda3176fca new add sqrt_grad and rsqrt_grad.
4 years ago
mindspore-ci-bot b79240fc3f !6367 fix shape bug
4 years ago
caozhou 5010956773 fix weight_init bug
4 years ago
mindspore-ci-bot 71ab9e300b !6012 Support erf and erfc ant GPU backend
4 years ago
mindspore-ci-bot af5ebcf1a9 !6232 fix gpu heterogeneous bug
4 years ago
peixu_ren fdd2d8209f Support erf and erfc ant GPU backend
4 years ago
baihuawei 09a3f2ff5e fix GPU hete
4 years ago
mindspore-ci-bot eedbbb3649 !5958 Modify interface for function and remove duplicated def
4 years ago
lilei 71adabd944 modify_bug
4 years ago
Peilin Wang f020e19636 add int32 support to greater gpu kernel
4 years ago
mindspore-ci-bot 6fdb43d22d !5895 gpu maximum minimum absgrad kernel fix
4 years ago
mindspore-ci-bot 98725bc865 !5790 [MS][GPU][CUDA] Dedicated new user facing Pad API kernel
4 years ago
mindspore-ci-bot b717a686cf !5690 ROIAlign kernel memory leak
4 years ago
wilfChen 3b54e55223 gpu maximum & minimum kernel with fp16 input
4 years ago
wilfChen 6ebe132cd3 broadcast refactor
4 years ago
danish 273fc0071c New User facing Pad Kernel + ST + Allows for channel padding
5 years ago
mindspore-ci-bot fc79997de5 !5502 Mod SoftmaxCrossEntropyWithlogits
5 years ago
wanyiming 0ec70068ae mod_SoftmaxCrossEntropyWithLogits
5 years ago
Jonathan Yan bbd19dbe43 roi align memory leak
5 years ago
mindspore-ci-bot 749979e7c4 !5458 NMS GPU OP Performance improvement
5 years ago
mindspore-ci-bot 087212441a !5192 Add unit tests for scatternd
5 years ago
danish 7d7fa760a0 reduce based nms final pass - speed improv
5 years ago
peixu_ren 2d5d44ab22 Add test cases for uniform ops on GPU
5 years ago
mindspore-ci-bot adbb75556d !4778 Added type support for Reshape gpu kernel for faster RCNN
5 years ago
mindspore-ci-bot 314c5ad055 !5172 remove name arg from gradoperation
5 years ago
panyifeng 1a54785fe2 remove name arg from gradoperation
5 years ago
peixu_ren 10f381d662 Modify the name of parameters in uniform
5 years ago
Peilin Wang d92a357e4a add scatternd ut
5 years ago
mindspore-ci-bot e2203bed01 !3957 Gpu StridedSlice dims exceeds
5 years ago
wilfChen 837aecf9af gpu stridedslice
5 years ago
Peilin Wang b7a4313755 added type support for reshape gpu kernel
5 years ago
mindspore-ci-bot 13d1738ff3 !4706 fix SmoothL1Loss gpu kernel
5 years ago
VectorSL e939d61a2c conv pooling pad support NHWC
5 years ago
Peilin Wang 0d5220d33c modified documentation and gpu kernel for smoothL1Loss
5 years ago
baihuawei 772e14d00d add categorical
5 years ago
danish 97f08e74ec nms_sorting fix
5 years ago
mindspore-ci-bot c7b50bcdd2 !4251 adding type support for gpu kernels for EfficientNet
5 years ago
mindspore-ci-bot a23dd7147a !4246 add type support for gpu kernelsl for faster-rcnn
5 years ago
mindspore-ci-bot 01962afd23 !4024 Support half data type in ROIAlign/ROIAlignGrad Kernel
5 years ago
mindspore-ci-bot c041f4a295 !4368 add fix to GPU-RandomChoiceWithMask
5 years ago
Peilin Wang 571094f473 added type support for transpose and maxgrad
5 years ago
Peilin Wang 3cb3a5c7d8 type support for faster rcnn gpu kernels
5 years ago
TFbunny 17d01e838f add fix to GPU-RandomChoiceWithMask (bitonicsort & testcase)
5 years ago
mindspore-ci-bot 64b0feb7a2 !4315 support int8, int16, bool types for akg ops GPU
5 years ago
baihuawei 216ef0e144 fix multinomial
5 years ago
mindspore-ci-bot 1856fb6af1 !3800 add gpu multinomial backend
5 years ago
Jonathan Yan e0528615e3 int16 uint8 bool supported akg ops
5 years ago
mindspore-ci-bot 6c4b4f91d2 !3978 Added unit tests for ResizeNearestNeighbor gpu kernel
5 years ago
baihuawei 40748a30c7 add multinomial backend
5 years ago
ZPaC 1dcc34e785 Add GPU div kernel
5 years ago
mindspore-ci-bot eb84ae4593 !4048 Fix broadcast, scatternd, reduce ops.
5 years ago
mindspore-ci-bot 8040e8bf89 !4130 modify some bug and add test case for gpu dropout op
5 years ago
hanhuifeng2020 ab6f7420b5 modify some bug and add test case for gpu dropout op
5 years ago
VectorSL b0a6346b54 gpu update cast type
5 years ago
linqingke fb405ee6f4 broadcast, slice, scatter_nd ops optimizer.
5 years ago
Peilin Wang 3bd64bf03d add unit test for resizeNearestNeighbor gpu op
5 years ago
mindspore-ci-bot fea930f7aa !4088 make gpu op Less to support int32
5 years ago
root 3b41023a6b add int32 cal for less gpu
5 years ago
mindspore-ci-bot 8f17535045 !3831 CUDA - GPU MirrorPad New Op
5 years ago
danish 081249b53f commit 1 - mirror pad
5 years ago
Jonathan Yan 43094bf78e suport half for roi align
5 years ago
mindspore-ci-bot 55d280e777 !3608 Add gpu support for RandomChoiceWithMask
5 years ago
mindspore-ci-bot 49ba473bcc !3803 add gpu klDivLoss op
5 years ago
mindspore-ci-bot 51fcaf6e61 !3714 stridedslice/stridedslicegrad 4D to 7D
5 years ago
TFbunny ad8a786b07 add GPU support to RandomChoiceWithMask
5 years ago
mindspore-ci-bot 2449e4e7f1 !3760 Fix GPU-ResizeNearestNeighbor
5 years ago
mindspore-ci-bot e1f93ec28c !3619 NMSWithMask - CUDA Impl
5 years ago
mindspore-ci-bot 82b103a740 !3780 add gpu BinaryCrossEntropy
5 years ago
baihuawei 9eca56635d add KLDiv loss
5 years ago
baihuawei aa9ea1707c add binary cross entropy
5 years ago
panbingao e6335739d2 stridedslice/stridedslicegrad 4D to 7D
5 years ago
TFbunny 075acf80b1 fix ResizeNearestNeighbor & add util.cuh to support atomicadd for half
5 years ago
wilfChen 4d600e70f1 gpu layernorm
5 years ago
mindspore-ci-bot f1a39a0f72 !3655 gpu support BroadcastTo kernel
5 years ago
mindspore-ci-bot 773f8e5352 !3707 Gpu support LayerNorm kernel
5 years ago
wilfChen 1eee3d6937 gpu layernorm
5 years ago
danish a2ffc9530e stuff added
5 years ago
mindspore-ci-bot afce1c3a40 !3341 GPU maxpool with argmax op
5 years ago
wilfChen 9cad0fec1d gpu broadcast to
5 years ago
mindspore-ci-bot 3cac1bb9f8 !3352 Add ROIAlignGrad GPU operator
5 years ago
peixu_ren e7a99397a5 Add random uniform real op at GPU end
5 years ago
mindspore-ci-bot e730224a2c !3511 Add IOU, CheckValid, GatherNd, ScatterNd ops.
5 years ago
linqingke 7829bab811 add iou ops.
5 years ago
wilfChen d3167da8af gpu support stridedslice
5 years ago
tom__chen 5c3be0114f add maxpool_with_argmax/grad cuda kernel
5 years ago
Jonathan Yan ad40e00228 roi align grad v1
5 years ago
peixu_ren 19d80b87a9 Fix minor errors in probabilistic programming
5 years ago
mindspore-ci-bot 1e88d64b20 !3259 add gpu cumsum op and reducemin
5 years ago
baihuawei 4a8724e0ac add gpu cumsum
5 years ago
mindspore-ci-bot 183cf5cf5d !3285 Add Encode,Decode,SGD,floordiv,ScatterNd,GatherNd ops.
5 years ago
linqingke f679568d86 gpu ops code and test case.
5 years ago
mindspore-ci-bot d15b4c5d61 !3201 RoI Align GPU kernel
5 years ago
mindspore-ci-bot 38a52a5b67 !3176 Add gpu support for ResizeNearestNeighbor
5 years ago
mindspore-ci-bot 5f10417b9f !3276 make gpu equal op support int32
5 years ago
Jonathan Yan 661b993475 roi align v1
5 years ago
TFbunny de28cdc71c add GPU support to ResizeNearestNeighbor
5 years ago
mindspore-ci-bot cf4353f728 !3220 Add random normal op at MindSpore front-end
5 years ago
qujianwei 7479fb24a0 make gpu equal op support int32
5 years ago
peixu_ren 9b45018dfd Add random normal op at MindSpore front-end
5 years ago
VectorSL 90f15df037 add int64-->fp16 and update conv pad
5 years ago
mindspore-ci-bot 32921ea3dc !3166 add gpu oneslike op
5 years ago
qujianwei fb2ac74d9a add gpu oneslike kernel
5 years ago
mindspore-ci-bot 11732f0ea2 !3135 GPU cast support more type
5 years ago
VectorSL aef2c1984e cast support more types
5 years ago
mindspore-ci-bot 251683096a !3045 Gpu support TopK kernel
5 years ago
mindspore-ci-bot e249197c73 !3003 gpu support BroadcastTo kernels
5 years ago
mindspore-ci-bot ad09bf3e87 !3083 add gpu split and restructure gpu concat
5 years ago
zhaoting 5c0962acfa add gpu split and restructure gpu concat
5 years ago
peixu_ren 1feca960aa Rollback to Normal on D
5 years ago
wilfChen c10e07734c gpu support TopK kernel
5 years ago
wilfChen dfb958de1e Gpu support BroadcastTo kernel
5 years ago
peixu_ren 20ca96c62b Add random normal MindSpore interface
5 years ago
kingfo add3778a61 add grad all in pynative mode
5 years ago
wilfChen 0fdc304a8e gpu support smoothl1loss
5 years ago
wilfChen d54154a1f9 Gpu support ctcloss kernel
5 years ago
mindspore-ci-bot 4c6bff75af !1393 Gpu Support AdamWeightDecay optimizer fusion
5 years ago
He Wei 43e0967024 Decouple ir::Tensor class from python
5 years ago
wilfChen 034d2ea2aa Gpu Adam Fusion
5 years ago
mindspore-ci-bot 8870956954 !2441 add fake quant test case for gpu
5 years ago
chenzomi 8873f9dc7e add fake quant test case for gpu
5 years ago
mindspore-ci-bot a2cd05339f !2180 Gpu Gelu kernel support fp16
5 years ago
mindspore-ci-bot d57decc8a3 !2338 Gpu Minimum & Maximum kernels support int32
5 years ago
lizhenyu eb68c9953d change ftrl operator st
5 years ago
wilfChen 480bf4151b Gpu Minimum & Maximum kernels support int32
5 years ago
mindspore-ci-bot a9d06edae9 !2282 remove _quant_op.py from __init__.py
5 years ago
mindspore-ci-bot fce37a5fbe !2281 add Sigmoid and SigmoidGrad operation of GPU
5 years ago
wilfChen 8f4cd76582 gpu Gelu kernel support fp16
5 years ago
chenzomi bbce6faff9 remove _quant_ops.py from __init__.py
5 years ago
mindspore-ci-bot 2e002ab64c !2292 gpu fix all nop node graph execute
5 years ago
limingqi107 0f4397cece fix all nop node graph execute
5 years ago
lizhenyu ea0cd5ccdd add Sigmoid and SigmoidGrad operation of GPU
5 years ago
mindspore-ci-bot 74c3e15675 !2194 fix FakeQuantPerLayer/FakeQuantPerLayerGrad symmetric=True calculation error bug
5 years ago
mindspore-ci-bot 19e66f06e2 !2150 Gpu Tanh kernel support fp16
5 years ago
mindspore-ci-bot fe797aaf10 !2229 add ftrl optimizer
5 years ago
mindspore-ci-bot 95d887a35b !2226 add adam op for wide&deep model
5 years ago
mindspore-ci-bot c4863683ef !2235 add SigmoidCrossEntropyWithLogitsGrad operation
5 years ago
mindspore-ci-bot 116ed509bf !2234 add SigmoidCrossEntropyWithLogits op
5 years ago
lizhenyu 636b8e2b88 add SigmoidCrossEntropyWithLogitsGrad op
5 years ago
mindspore-ci-bot 4642df207a !2210 gpu optimize the max device memory config
5 years ago
lizhenyu 694a8213b7 add adam optimizer
5 years ago
lizhenyu ac2217dbae add SigmoidCrossEntropyWithLogits op
5 years ago
lizhenyu c3360a84cd add ftrl optimizer
5 years ago
wilfChen 9201ea5ed2 replace tanh implement with cudnn
5 years ago
limingqi107 55b3557c0d gpu optimize the max device memory config
5 years ago
王东旭 4e09ae83eb fix FakeQuantPerLayer/FakeQuantPerLayerGrad symmetric bug
5 years ago
liuxiao df63a3195d fix input value check for SparseApplyFtrl and SparseApplyAdagrad
5 years ago
mindspore-ci-bot d4a7c87b22 !2093 GPU add argmaxwithvalue
5 years ago
VectorSL 17377912ba gpu add argmaxwithvalue
5 years ago
buxue 66bbdb4a31 change tensor dtype and shape from function to attr
5 years ago
mindspore-ci-bot 87fa15de80 !2021 GPU add akg kernel greaterequal notequal
5 years ago
VectorSL cf2fc1cecf gpu add notequal greaterequal akg kernel
5 years ago
buxue 0cd57ddc5d check arg is tensor with vm backend
5 years ago
jiangjinsheng 51affc2f1b fixed validator for CumProd, ReduceProd, ApplyRMSProp
5 years ago
mindspore-ci-bot 9c33da391a !1513 refine data copy in multi-graph
5 years ago
liuwenhao4 a7ad0d0a49 Fixing some tiny faults about Pylint in my code(ops)
5 years ago
lizhenyu a25b84055c refine data copy in multi-graph
5 years ago
liuwenhao4 f3f0cbaeee Fixing some tiny faults about Pylint in my code(ops)
5 years ago
cristoval f6c20178d2 fix pylint check issues
5 years ago
jinyaohui 5a914994ba clean pylint
5 years ago
jinyaohui bcfaff97f9 clean pylint
5 years ago
wilfChen 1991a89f40 LayerNormGrad fix & codex
5 years ago
wilfChen 59c4cf256c gpu support broadcast kernels
5 years ago
mindspore-ci-bot 680ce090a3 !1057 matmul support fp16
5 years ago
mindspore-ci-bot 0edc6d254a !370 Gpu Support UnsortedSegmentSum kernel
5 years ago
mindspore-ci-bot 907b609b05 !994 gpu broadcast kernel support different dims
5 years ago
mindspore-ci-bot b5096e1f6c !1021 gpu support MinimumGrad & MaximumGrad kernel
5 years ago
mindspore-ci-bot da7054645a !948 gpu support LogSoftmax & LogSoftmaxGrad kernel
5 years ago
wilfChen b56572bb89 matmul support fp16
5 years ago
wilfChen 00e78bf6c4 gpu support MinimumGrad & MaximumGrad kernel
5 years ago
wilfChen 31f3611f9a gpu support UnsortedSegmentSum kernel
5 years ago
wilfChen 0a1195ddf5 broadcast kernel support unqual dims & half
5 years ago
ZPaC d3936b9f2a GPU kernels adapt with special dimensions.
5 years ago
wilfChen 1eb60df5d4 gpu support logsoftmax & logsoftmaxgrad kernel
5 years ago
mindspore-ci-bot f602970990 !323 Gpu Concat support 4 inputs
5 years ago
mindspore-ci-bot 4e25fec769 !324 Gpu Slice kernel performance improve
5 years ago
mindspore-ci-bot 378a7122a5 !372 Gpu support BatchMatMul kernel
5 years ago
mindspore-ci-bot 97d21ba014 !502 Gpu Support Gelu & GeluGrad
5 years ago
mindspore-ci-bot a97f30ba7d !516 Gpu support Tanh & TanhGrad kernel
5 years ago
mindspore-ci-bot 38c56fd1a5 !945 gpu queue support Sqrt & Rsqrt kernel
5 years ago
wilfChen a304304c30 gpu support Gelu & GeluGrad kernels
5 years ago
wilfChen 311bf41e6d gpu support tanh & tanhgrad kernel
5 years ago
wilfChen 67a0cc3bf1 gpu queue support unary
5 years ago
wilfChen 16f0688230 gpu support broadcast kernels
5 years ago
mindspore-ci-bot 0611c1a579 !849 [CT][MS] Wrong format of broadcast output, when multi-output
5 years ago
Wei Luning 157710ca0f bugfix* fix bug in output tuple of tuple.* check kRWWrite input no-variable* input x of ScatterNdUpdate should be a parameter node
5 years ago
mindspore-ci-bot 8c035a5171 !756 Gpu support LayerNorm kernel
5 years ago
dinghao f77de54aa4 fix tensor dirty
5 years ago
wilfChen 53b4529558 Gpu support LayerNorm kernel
5 years ago
VectorSL 4740c70fc3 gpu add testcases
5 years ago
mindspore-ci-bot 728801301c !313 GPU add akg kernel float_status
5 years ago
VectorSL c000fb2f34 gpu add float_status kernel
5 years ago
wilfChen 9a7702b807 gpu support batchmatmul kernel
5 years ago
mindspore-ci-bot 87be386581 !314 GPU add kernel assign
5 years ago
wilfChen cc93646207 gpu concat kernel support 4 inputs
5 years ago
wilfChen 5b7790a2a7 Gpu Slice kernel performance improvement
5 years ago
VectorSL 9e372073e2 gpu add assigin
5 years ago
VectorSL d248b05a98 gpu add kernel select
5 years ago
mindspore-ci-bot 94589ce611 !226 expend conv stride and dilation to 2d
5 years ago
chenzomi 652ab6c386 add test case for aware quantizaiton
5 years ago
wangnan39@huawei.com 2604acedcb extend conv stride and dilation to 2d
5 years ago
VectorSL aea6b0c974 update tests/st/ops/gpu/test_tensoradd.py.
5 years ago
zhunaipan 930a1fb0a8 initial version
5 years ago