lilei
71adabd944
modify_bug
5 years ago
Peilin Wang
f020e19636
add int32 support to greater gpu kernel
...
fix ci
5 years ago
mindspore-ci-bot
6fdb43d22d
!5895 gpu maximum minimum absgrad kernel fix
...
Merge pull request !5895 from chenweifeng/maximum-fix
5 years ago
mindspore-ci-bot
98725bc865
!5790 [MS][GPU][CUDA] Dedicated new user facing Pad API kernel
...
Merge pull request !5790 from danishnxt/GPU_three
5 years ago
mindspore-ci-bot
b717a686cf
!5690 ROIAlign kernel memory leak
...
Merge pull request !5690 from JonathanY/rcnn
5 years ago
wilfChen
3b54e55223
gpu maximum & minimum kernel with fp16 input
5 years ago
wilfChen
6ebe132cd3
broadcast refactor
5 years ago
danish
273fc0071c
New User facing Pad Kernel + ST + Allows for channel padding
...
style fix
lint fixes
added check in NN layer for > 4 paddings, plus lint fix
fix python lint
lint fix
lint fix
updating to pytest asserts to improve testing
removed unnecc vars from test file fail checks
5 years ago
mindspore-ci-bot
fc79997de5
!5502 Mod SoftmaxCrossEntropyWithlogits
...
Merge pull request !5502 from wanyiming/mod_SoftmaxCrossEntropyWithlogits
5 years ago
wanyiming
0ec70068ae
mod_SoftmaxCrossEntropyWithLogits
5 years ago
Jonathan Yan
bbd19dbe43
roi align memory leak
5 years ago
mindspore-ci-bot
749979e7c4
!5458 NMS GPU OP Performance improvement
...
Merge pull request !5458 from danishnxt/GPU_two
5 years ago
mindspore-ci-bot
087212441a
!5192 Add unit tests for scatternd
...
Merge pull request !5192 from Peilin/scatternd-type-support
5 years ago
danish
7d7fa760a0
reduce based nms final pass - speed improv
...
refactored faster nms
refactored faster nms + typo fix
added box flipping choice
set choice to true for testing - yz
switching back
new test file
5 years ago
peixu_ren
2d5d44ab22
Add test cases for uniform ops on GPU
5 years ago
mindspore-ci-bot
adbb75556d
!4778 Added type support for Reshape gpu kernel for faster RCNN
...
Merge pull request !4778 from Peilin/reshape-gpu-kernel-type-support
5 years ago
mindspore-ci-bot
314c5ad055
!5172 remove name arg from gradoperation
...
Merge pull request !5172 from riemann_penn/remove_name_args_from_gradoperation
5 years ago
panyifeng
1a54785fe2
remove name arg from gradoperation
5 years ago
peixu_ren
10f381d662
Modify the name of parameters in uniform
5 years ago
Peilin Wang
d92a357e4a
add scatternd ut
5 years ago
mindspore-ci-bot
e2203bed01
!3957 Gpu StridedSlice dims exceeds
...
Merge pull request !3957 from chenweifeng/strided_slice_dims_exceeds
5 years ago
wilfChen
837aecf9af
gpu stridedslice
5 years ago
Peilin Wang
b7a4313755
added type support for reshape gpu kernel
...
fix ci
5 years ago
mindspore-ci-bot
13d1738ff3
!4706 fix SmoothL1Loss gpu kernel
...
Merge pull request !4706 from Peilin/smoothL1Loss-fix
5 years ago
VectorSL
e939d61a2c
conv pooling pad support NHWC
5 years ago
Peilin Wang
0d5220d33c
modified documentation and gpu kernel for smoothL1Loss
...
fix pylint
changed doc and code for SmoothL1Loss to be same a dchip. fixed grad kernel
fix ci
5 years ago
baihuawei
772e14d00d
add categorical
5 years ago
danish
97f08e74ec
nms_sorting fix
...
lint py fix 2
nms_py_file test value fix
lint fix
5 years ago
mindspore-ci-bot
c7b50bcdd2
!4251 adding type support for gpu kernels for EfficientNet
...
Merge pull request !4251 from Peilin/efficientnet
5 years ago
mindspore-ci-bot
a23dd7147a
!4246 add type support for gpu kernelsl for faster-rcnn
...
Merge pull request !4246 from Peilin/faster-rcnn-type-support
5 years ago
mindspore-ci-bot
01962afd23
!4024 Support half data type in ROIAlign/ROIAlignGrad Kernel
...
Merge pull request !4024 from JonathanY/roihalf
5 years ago
mindspore-ci-bot
c041f4a295
!4368 add fix to GPU-RandomChoiceWithMask
...
Merge pull request !4368 from 34bunny/GPU-RandomChoiceWithMask-fix
5 years ago
Peilin Wang
571094f473
added type support for transpose and maxgrad
...
fix pylint
addressed code review comment
5 years ago
Peilin Wang
3cb3a5c7d8
type support for faster rcnn gpu kernels
...
addressed code review comments
fix cpplint and pylint
trying to fix python ut
fix smoke test
5 years ago
TFbunny
17d01e838f
add fix to GPU-RandomChoiceWithMask (bitonicsort & testcase)
5 years ago
mindspore-ci-bot
64b0feb7a2
!4315 support int8, int16, bool types for akg ops GPU
...
Merge pull request !4315 from JonathanY/akg_ops
5 years ago
baihuawei
216ef0e144
fix multinomial
5 years ago
mindspore-ci-bot
1856fb6af1
!3800 add gpu multinomial backend
...
Merge pull request !3800 from baihuawei/multinomial-c
5 years ago
Jonathan Yan
e0528615e3
int16 uint8 bool supported akg ops
5 years ago
mindspore-ci-bot
6c4b4f91d2
!3978 Added unit tests for ResizeNearestNeighbor gpu kernel
...
Merge pull request !3978 from Peilin/master
5 years ago
baihuawei
40748a30c7
add multinomial backend
5 years ago
ZPaC
1dcc34e785
Add GPU div kernel
5 years ago
mindspore-ci-bot
eb84ae4593
!4048 Fix broadcast, scatternd, reduce ops.
...
Merge pull request !4048 from linqingke/new_ops
5 years ago
mindspore-ci-bot
8040e8bf89
!4130 modify some bug and add test case for gpu dropout op
...
Merge pull request !4130 from hanhuifeng/gpu_dropout
5 years ago
hanhuifeng2020
ab6f7420b5
modify some bug and add test case for gpu dropout op
5 years ago
VectorSL
b0a6346b54
gpu update cast type
5 years ago
linqingke
fb405ee6f4
broadcast, slice, scatter_nd ops optimizer.
5 years ago
Peilin Wang
3bd64bf03d
add unit test for resizeNearestNeighbor gpu op
...
fix pylint
fix pylint
changed mstype to np type to fix pylint
fix pylint
fix pylint
fix pylint
added back copyright
5 years ago
mindspore-ci-bot
fea930f7aa
!4088 make gpu op Less to support int32
...
Merge pull request !4088 from TuDouNi/master
5 years ago
root
3b41023a6b
add int32 cal for less gpu
5 years ago
mindspore-ci-bot
8f17535045
!3831 CUDA - GPU MirrorPad New Op
...
Merge pull request !3831 from danishnxt/GPU_One
5 years ago
danish
081249b53f
commit 1 - mirror pad
...
commit 2
lint fix
lint fix 2
updated backprop + st test
test_file_fix
test_file_fix_2
fixed header_guards
comments addressed
clangFormatFix
5 years ago
Jonathan Yan
43094bf78e
suport half for roi align
5 years ago
mindspore-ci-bot
55d280e777
!3608 Add gpu support for RandomChoiceWithMask
...
Merge pull request !3608 from 34bunny/GPU-RandomChoiceWithMask
5 years ago
mindspore-ci-bot
49ba473bcc
!3803 add gpu klDivLoss op
...
Merge pull request !3803 from baihuawei/loss
5 years ago
mindspore-ci-bot
51fcaf6e61
!3714 stridedslice/stridedslicegrad 4D to 7D
...
Merge pull request !3714 from panbingao/stridedslice
5 years ago
TFbunny
ad8a786b07
add GPU support to RandomChoiceWithMask
5 years ago
mindspore-ci-bot
2449e4e7f1
!3760 Fix GPU-ResizeNearestNeighbor
...
Merge pull request !3760 from 34bunny/GPU-ResizeNearestNeighbor-fix
5 years ago
mindspore-ci-bot
e1f93ec28c
!3619 NMSWithMask - CUDA Impl
...
Merge pull request !3619 from danishnxt/GPU_One
5 years ago
mindspore-ci-bot
82b103a740
!3780 add gpu BinaryCrossEntropy
...
Merge pull request !3780 from baihuawei/losscuda
5 years ago
baihuawei
9eca56635d
add KLDiv loss
5 years ago
baihuawei
aa9ea1707c
add binary cross entropy
5 years ago
panbingao
e6335739d2
stridedslice/stridedslicegrad 4D to 7D
5 years ago
TFbunny
075acf80b1
fix ResizeNearestNeighbor & add util.cuh to support atomicadd for half
5 years ago
wilfChen
4d600e70f1
gpu layernorm
5 years ago
mindspore-ci-bot
f1a39a0f72
!3655 gpu support BroadcastTo kernel
...
Merge pull request !3655 from chenweifeng/broadcast_to
5 years ago
mindspore-ci-bot
773f8e5352
!3707 Gpu support LayerNorm kernel
...
Merge pull request !3707 from chenweifeng/LayerNorm
5 years ago
wilfChen
1eee3d6937
gpu layernorm
5 years ago
danish
a2ffc9530e
stuff added
...
box copying fix
common function for IOU
new sort func
final update + unit test
remove one comment
fix lintig -1
lint fix 2
lint fix 3
last lint fix
value fix in ST nms with mask
addressing comments
pylint fix
pylint fix 1
test file fix
5 years ago
mindspore-ci-bot
afce1c3a40
!3341 GPU maxpool with argmax op
...
Merge pull request !3341 from tom_chen/maxpool_with_argmax
5 years ago
wilfChen
9cad0fec1d
gpu broadcast to
5 years ago
mindspore-ci-bot
3cac1bb9f8
!3352 Add ROIAlignGrad GPU operator
...
Merge pull request !3352 from JonathanY/ROIAlignGrad
5 years ago
peixu_ren
e7a99397a5
Add random uniform real op at GPU end
5 years ago
mindspore-ci-bot
e730224a2c
!3511 Add IOU, CheckValid, GatherNd, ScatterNd ops.
...
Merge pull request !3511 from linqingke/new_ops
5 years ago
linqingke
7829bab811
add iou ops.
5 years ago
wilfChen
d3167da8af
gpu support stridedslice
5 years ago
tom__chen
5c3be0114f
add maxpool_with_argmax/grad cuda kernel
5 years ago
Jonathan Yan
ad40e00228
roi align grad v1
5 years ago
peixu_ren
19d80b87a9
Fix minor errors in probabilistic programming
5 years ago
mindspore-ci-bot
1e88d64b20
!3259 add gpu cumsum op and reducemin
...
Merge pull request !3259 from baihuawei/automl
5 years ago
baihuawei
4a8724e0ac
add gpu cumsum
5 years ago
mindspore-ci-bot
183cf5cf5d
!3285 Add Encode,Decode,SGD,floordiv,ScatterNd,GatherNd ops.
...
Merge pull request !3285 from linqingke/gpu_ops
5 years ago
linqingke
f679568d86
gpu ops code and test case.
5 years ago
mindspore-ci-bot
d15b4c5d61
!3201 RoI Align GPU kernel
...
Merge pull request !3201 from JonathanY/main
5 years ago
mindspore-ci-bot
38a52a5b67
!3176 Add gpu support for ResizeNearestNeighbor
...
Merge pull request !3176 from 34bunny/GPU-ResizeNearestNeighbor
5 years ago
mindspore-ci-bot
5f10417b9f
!3276 make gpu equal op support int32
...
Merge pull request !3276 from qujianwei/master
5 years ago
Jonathan Yan
661b993475
roi align v1
5 years ago
TFbunny
de28cdc71c
add GPU support to ResizeNearestNeighbor
5 years ago
mindspore-ci-bot
cf4353f728
!3220 Add random normal op at MindSpore front-end
...
Merge pull request !3220 from peixu_ren/custom_gpu
5 years ago
qujianwei
7479fb24a0
make gpu equal op support int32
5 years ago
peixu_ren
9b45018dfd
Add random normal op at MindSpore front-end
5 years ago
VectorSL
90f15df037
add int64-->fp16 and update conv pad
5 years ago
mindspore-ci-bot
32921ea3dc
!3166 add gpu oneslike op
...
Merge pull request !3166 from qujianwei/gpu-oneslike
5 years ago
qujianwei
fb2ac74d9a
add gpu oneslike kernel
5 years ago
mindspore-ci-bot
11732f0ea2
!3135 GPU cast support more type
...
Merge pull request !3135 from VectorSL/cast2
5 years ago
VectorSL
aef2c1984e
cast support more types
5 years ago
mindspore-ci-bot
251683096a
!3045 Gpu support TopK kernel
...
Merge pull request !3045 from chenweifeng/sort
5 years ago
mindspore-ci-bot
e249197c73
!3003 gpu support BroadcastTo kernels
...
Merge pull request !3003 from chenweifeng/broadcastto
5 years ago
mindspore-ci-bot
ad09bf3e87
!3083 add gpu split and restructure gpu concat
...
Merge pull request !3083 from zhaoting/master
5 years ago
zhaoting
5c0962acfa
add gpu split and restructure gpu concat
5 years ago
peixu_ren
1feca960aa
Rollback to Normal on D
5 years ago
wilfChen
c10e07734c
gpu support TopK kernel
5 years ago
wilfChen
dfb958de1e
Gpu support BroadcastTo kernel
5 years ago
peixu_ren
20ca96c62b
Add random normal MindSpore interface
5 years ago
kingfo
add3778a61
add grad all in pynative mode
5 years ago
wilfChen
0fdc304a8e
gpu support smoothl1loss
5 years ago
wilfChen
d54154a1f9
Gpu support ctcloss kernel
5 years ago
mindspore-ci-bot
4c6bff75af
!1393 Gpu Support AdamWeightDecay optimizer fusion
...
Merge pull request !1393 from chenweifeng/adam_weight_decay
5 years ago
He Wei
43e0967024
Decouple ir::Tensor class from python
5 years ago
wilfChen
034d2ea2aa
Gpu Adam Fusion
5 years ago
mindspore-ci-bot
8870956954
!2441 add fake quant test case for gpu
...
Merge pull request !2441 from chenzhongming/master
5 years ago
chenzomi
8873f9dc7e
add fake quant test case for gpu
5 years ago
mindspore-ci-bot
a2cd05339f
!2180 Gpu Gelu kernel support fp16
...
Merge pull request !2180 from chenweifeng/gelu-fp16
5 years ago
mindspore-ci-bot
d57decc8a3
!2338 Gpu Minimum & Maximum kernels support int32
...
Merge pull request !2338 from chenweifeng/nezha
5 years ago
lizhenyu
eb68c9953d
change ftrl operator st
5 years ago
wilfChen
480bf4151b
Gpu Minimum & Maximum kernels support int32
5 years ago
mindspore-ci-bot
a9d06edae9
!2282 remove _quant_op.py from __init__.py
...
Merge pull request !2282 from chenzhongming/master
5 years ago
mindspore-ci-bot
fce37a5fbe
!2281 add Sigmoid and SigmoidGrad operation of GPU
...
Merge pull request !2281 from zyli2020/add_sigmoid
5 years ago
wilfChen
8f4cd76582
gpu Gelu kernel support fp16
5 years ago
chenzomi
bbce6faff9
remove _quant_ops.py from __init__.py
5 years ago
mindspore-ci-bot
2e002ab64c
!2292 gpu fix all nop node graph execute
...
Merge pull request !2292 from limingqi107/master
5 years ago
limingqi107
0f4397cece
fix all nop node graph execute
5 years ago
lizhenyu
ea0cd5ccdd
add Sigmoid and SigmoidGrad operation of GPU
5 years ago
mindspore-ci-bot
74c3e15675
!2194 fix FakeQuantPerLayer/FakeQuantPerLayerGrad symmetric=True calculation error bug
...
Merge pull request !2194 from 王东旭/master
5 years ago
mindspore-ci-bot
19e66f06e2
!2150 Gpu Tanh kernel support fp16
...
Merge pull request !2150 from chenweifeng/tanh-fp16
5 years ago
mindspore-ci-bot
fe797aaf10
!2229 add ftrl optimizer
...
Merge pull request !2229 from zyli2020/add_ftrl_op
5 years ago
mindspore-ci-bot
95d887a35b
!2226 add adam op for wide&deep model
...
Merge pull request !2226 from zyli2020/add_adam_op
5 years ago
mindspore-ci-bot
c4863683ef
!2235 add SigmoidCrossEntropyWithLogitsGrad operation
...
Merge pull request !2235 from zyli2020/add_sigmoid_cross_entropy_with_logits_grad
5 years ago
mindspore-ci-bot
116ed509bf
!2234 add SigmoidCrossEntropyWithLogits op
...
Merge pull request !2234 from zyli2020/add_sigmoid_cross_entropy_with_logits
5 years ago
lizhenyu
636b8e2b88
add SigmoidCrossEntropyWithLogitsGrad op
5 years ago
mindspore-ci-bot
4642df207a
!2210 gpu optimize the max device memory config
...
Merge pull request !2210 from limingqi107/max_device_memory_optimize
5 years ago
lizhenyu
694a8213b7
add adam optimizer
5 years ago
lizhenyu
ac2217dbae
add SigmoidCrossEntropyWithLogits op
5 years ago
lizhenyu
c3360a84cd
add ftrl optimizer
5 years ago
wilfChen
9201ea5ed2
replace tanh implement with cudnn
5 years ago
limingqi107
55b3557c0d
gpu optimize the max device memory config
5 years ago
王东旭
4e09ae83eb
fix FakeQuantPerLayer/FakeQuantPerLayerGrad symmetric bug
5 years ago
liuxiao
df63a3195d
fix input value check for SparseApplyFtrl and SparseApplyAdagrad
5 years ago
mindspore-ci-bot
d4a7c87b22
!2093 GPU add argmaxwithvalue
...
Merge pull request !2093 from VectorSL/argmaxwithvalue
5 years ago
VectorSL
17377912ba
gpu add argmaxwithvalue
5 years ago
buxue
66bbdb4a31
change tensor dtype and shape from function to attr
5 years ago
mindspore-ci-bot
87fa15de80
!2021 GPU add akg kernel greaterequal notequal
...
Merge pull request !2021 from VectorSL/gpu-add-akg-kernel
5 years ago
VectorSL
cf2fc1cecf
gpu add notequal greaterequal akg kernel
5 years ago
buxue
0cd57ddc5d
check arg is tensor with vm backend
5 years ago
jiangjinsheng
51affc2f1b
fixed validator for CumProd, ReduceProd, ApplyRMSProp
5 years ago
mindspore-ci-bot
9c33da391a
!1513 refine data copy in multi-graph
...
Merge pull request !1513 from zyli2020/refine_data_copy_in_multi_graph
5 years ago
liuwenhao4
a7ad0d0a49
Fixing some tiny faults about Pylint in my code(ops)
5 years ago
lizhenyu
a25b84055c
refine data copy in multi-graph
5 years ago
liuwenhao4
f3f0cbaeee
Fixing some tiny faults about Pylint in my code(ops)
5 years ago
cristoval
f6c20178d2
fix pylint check issues
5 years ago
jinyaohui
5a914994ba
clean pylint
5 years ago
jinyaohui
bcfaff97f9
clean pylint
5 years ago
wilfChen
1991a89f40
LayerNormGrad fix & codex
5 years ago
wilfChen
59c4cf256c
gpu support broadcast kernels
5 years ago
mindspore-ci-bot
680ce090a3
!1057 matmul support fp16
...
Merge pull request !1057 from chenweifeng/matmul
5 years ago
mindspore-ci-bot
0edc6d254a
!370 Gpu Support UnsortedSegmentSum kernel
...
Merge pull request !370 from chenweifeng/unsorted_segment_sum
5 years ago
mindspore-ci-bot
907b609b05
!994 gpu broadcast kernel support different dims
...
Merge pull request !994 from chenweifeng/broadcast_unequal_dims
5 years ago
mindspore-ci-bot
b5096e1f6c
!1021 gpu support MinimumGrad & MaximumGrad kernel
...
Merge pull request !1021 from chenweifeng/broadcast_grad
5 years ago
mindspore-ci-bot
da7054645a
!948 gpu support LogSoftmax & LogSoftmaxGrad kernel
...
Merge pull request !948 from chenweifeng/logsoftmax
5 years ago
wilfChen
b56572bb89
matmul support fp16
5 years ago
wilfChen
00e78bf6c4
gpu support MinimumGrad & MaximumGrad kernel
5 years ago
wilfChen
31f3611f9a
gpu support UnsortedSegmentSum kernel
5 years ago
wilfChen
0a1195ddf5
broadcast kernel support unqual dims & half
5 years ago
ZPaC
d3936b9f2a
GPU kernels adapt with special dimensions.
5 years ago
wilfChen
1eb60df5d4
gpu support logsoftmax & logsoftmaxgrad kernel
5 years ago
mindspore-ci-bot
f602970990
!323 Gpu Concat support 4 inputs
...
Merge pull request !323 from chenweifeng/concat
5 years ago
mindspore-ci-bot
4e25fec769
!324 Gpu Slice kernel performance improve
...
Merge pull request !324 from chenweifeng/slice
5 years ago
mindspore-ci-bot
378a7122a5
!372 Gpu support BatchMatMul kernel
...
Merge pull request !372 from chenweifeng/batchmatmul
5 years ago
mindspore-ci-bot
97d21ba014
!502 Gpu Support Gelu & GeluGrad
...
Merge pull request !502 from chenweifeng/gelu
5 years ago
mindspore-ci-bot
a97f30ba7d
!516 Gpu support Tanh & TanhGrad kernel
...
Merge pull request !516 from chenweifeng/tanh
5 years ago
mindspore-ci-bot
38c56fd1a5
!945 gpu queue support Sqrt & Rsqrt kernel
...
Merge pull request !945 from chenweifeng/unary
5 years ago
wilfChen
a304304c30
gpu support Gelu & GeluGrad kernels
5 years ago
wilfChen
311bf41e6d
gpu support tanh & tanhgrad kernel
5 years ago
wilfChen
67a0cc3bf1
gpu queue support unary
5 years ago
wilfChen
16f0688230
gpu support broadcast kernels
5 years ago
mindspore-ci-bot
0611c1a579
!849 [CT][MS] Wrong format of broadcast output, when multi-output
...
Merge pull request !849 from vlne-v1/I1FQ76-wrong-format-of-boardcast-output-when-multi-output
5 years ago
Wei Luning
157710ca0f
bugfix* fix bug in output tuple of tuple.* check kRWWrite input no-variable* input x of ScatterNdUpdate should be a parameter node
5 years ago
mindspore-ci-bot
8c035a5171
!756 Gpu support LayerNorm kernel
...
Merge pull request !756 from chenweifeng/layer_norm
5 years ago
dinghao
f77de54aa4
fix tensor dirty
5 years ago
wilfChen
53b4529558
Gpu support LayerNorm kernel
5 years ago
VectorSL
4740c70fc3
gpu add testcases
5 years ago
mindspore-ci-bot
728801301c
!313 GPU add akg kernel float_status
...
Merge pull request !313 from VectorSL/float_status
5 years ago
VectorSL
c000fb2f34
gpu add float_status kernel
5 years ago
wilfChen
9a7702b807
gpu support batchmatmul kernel
5 years ago
mindspore-ci-bot
87be386581
!314 GPU add kernel assign
...
Merge pull request !314 from VectorSL/assign
5 years ago
wilfChen
cc93646207
gpu concat kernel support 4 inputs
5 years ago
wilfChen
5b7790a2a7
Gpu Slice kernel performance improvement
5 years ago
VectorSL
9e372073e2
gpu add assigin
5 years ago
VectorSL
d248b05a98
gpu add kernel select
5 years ago
mindspore-ci-bot
94589ce611
!226 expend conv stride and dilation to 2d
...
Merge pull request !226 from wangnan39/expend_conv_stride_to_2d
5 years ago
chenzomi
652ab6c386
add test case for aware quantizaiton
5 years ago
wangnan39@huawei.com
2604acedcb
extend conv stride and dilation to 2d
5 years ago
VectorSL
aea6b0c974
update tests/st/ops/gpu/test_tensoradd.py.
...
fix pytest.mark for testcase
5 years ago
zhunaipan
930a1fb0a8
initial version
...
Signed-off-by: leonwanghui <leon.wanghui@huawei.com>
5 years ago