Commit Graph

514 Commits (b95d6fbe588c92c5b60225b223bec1734acd7d44)

Author SHA1 Message Date
mindspore-ci-bot 0e27dccbcf !3093 VOCDataset output change to multi-columns
5 years ago
liyong 28ebd73096 fix bug when int or float is numpy type
5 years ago
Amir Lashkari 572750cd40 Added cpp Equalize op
5 years ago
nhussain b064cbab5d fix upper bound
5 years ago
mindspore-ci-bot b091f74c23 !3016 Add CSV dataset loader
5 years ago
jiangzhiwen 2f506b7985 CSV dataset loader
5 years ago
mindspore-ci-bot fa7fa8a162 !3234 [MD] fix bug when save tfrecord data
5 years ago
liyong f521532a06 fix field_name probelem from tfrecord to mindrecord
5 years ago
xulei2020 18b519ae0f add sentence piece
5 years ago
xiefangqi fbd9e3d402 vocdataset output multi-columns
5 years ago
anzhengqi 008b91b2a1 inject epoch ctrl op in the execution tree and send eos at the end of epoch
5 years ago
mindspore-ci-bot 4945d34a41 !3133 [MD] fix num_sample in distributed sampler
5 years ago
mindspore-ci-bot a84092e89d !3102 [MD] Add additional parameter checks in RandomCropWithBBoxOp and RandomResizeWithBbox
5 years ago
liyong e2ea1fa0df activate num_samples in distributed samplers
5 years ago
mindspore-ci-bot 1bcbbb08d6 !2973 [MD] Add implementation of SlidingWindow TensorOp, c++ & python
5 years ago
mindspore-ci-bot 1a518e1cff !3144 [MD] Fix ResizeOp behaviour
5 years ago
nhussain c984b0fc88 fix size tuple
5 years ago
mindspore-ci-bot 35c0f1d2e6 !3143 Add python UT to MnistDataset op
5 years ago
tinazhang ab37e87d7a adding Mnist python ut coverage
5 years ago
islam_amin 5647889c0d Added AutoContrast Op
5 years ago
mindspore-ci-bot 37e8439c0a !3077 supporting cpp invert operation
5 years ago
mindspore-ci-bot c07a8d2033 !3137 Cleanup dataset UT: Remove deprecated save_and_check
5 years ago
nhussain 5871ba8a05 more validator fixes
5 years ago
liyong bc676fe250 save op in minddataset
5 years ago
avakh 35c3a63701 support cpp invert operation
5 years ago
Cathy Wong 04eb4f89c7 Cleanup dataset UT: Replace save_and_check
5 years ago
Zirui Wu d233c54139 implemented cpp random choice, apply and compos
5 years ago
YangLuo 4136892a3e add SlidingWindow Op
5 years ago
mindspore-ci-bot bc0a53cfb1 !3030 minddata support ps-lite feature
5 years ago
mindspore-ci-bot ba0143402c !3067 Cleanup dataset UT: Remove unneeded tf data files and tests
5 years ago
mindspore-ci-bot 219a716eae !3066 fix some batch's get_dataset_size and some text validator inconsistency
5 years ago
mindspore-ci-bot bed93a9ead !3039 [Dataset Python UT] adding coverage to Cifar Op and update test dataset
5 years ago
Zirui Wu 63185cb20f fix some validators errors
5 years ago
Cathy Wong ae1ed327ba Cleanup dataset UT: Remove unneeded data files and tests
5 years ago
nhussain 05b2a57d2a fix validation errors, and fix try catch error tests
5 years ago
tinazhang 340d98a4d1 added test case to cifar_op
5 years ago
xiefangqi add19a591c md support ps-lite
5 years ago
mindspore-ci-bot 48654be14d !3040 BoundingBoxAugment op fix ratio
5 years ago
islam_amin 56da3b0ae1 Fixing ratio bug with BoundingBoxAugment
5 years ago
Jesse Lee 7c1bc5192b Initial Drop of CacheOp Phase I
5 years ago
mindspore-ci-bot 6284c42a76 !2941 MD tokenizer support output offsets
5 years ago
xiefangqi 47060631e5 add offsets feature to tokenizer
5 years ago
mindspore-ci-bot 541456044d !2934 GraphData support random_walk and get_edge_feature
5 years ago
mindspore-ci-bot 2f565f4c20 !2776 Update API of dataset.text.lookup
5 years ago
heleiwang 7a046a1d70 support get_edge_feature
5 years ago
Zirui Wu 7b15e5a742 rework on lookup
5 years ago
nhussain 2c7fd248f8 fixes for PR-2908: avoid empty strings for column names
5 years ago
nhussain 6c37ea3be0 fix validators
5 years ago
mindspore-ci-bot 915ddd25dd !2892 gnn random walk pr 1977 comment follow up
5 years ago
avakh 421b80bf8f updating python unit tests based on floating point bounding boxes + adding coco dataset in unit tests
5 years ago
tony_liu2 14899a1410 fix gnn random walk pr 1977 comments
5 years ago
mindspore-ci-bot 9ba937b1af !2920 Updating 3 BBox AugOps for Float + Common Files for BBox Float Support - (VertFlip/RandomCrop/RandomResizeCrop)
5 years ago
mindspore-ci-bot c9929fd8a1 !2900 Update BoundingBoxAugment and RandomHorizontalFlipWithBBox to use float
5 years ago
mindspore-ci-bot 92d2011b8b !2924 [Dataset Python UT] Disable md5 random perspective test
5 years ago
Danish Farid 78c370f72a First Commit - New Infrastructure - Python UT tests + Common Aug Files
5 years ago
tinazhang 8455f957ac disable md5 random perspective test case
5 years ago
islam_amin dfc3409f67 Update RandomHorizontalFlipWithBBox and BoundingBouxAugment C++ Ops to use floats
5 years ago
xiefangqi 62d55fdbce vocdataset support float bbox
5 years ago
mindspore-ci-bot b5066e81ee !2308 Unskip md5 python UT case for RandomPerspective
5 years ago
tinazhang dc03019200 unskip md5 testcase for RandomPerspective
5 years ago
Danish Farid 5a9673ee6f fix_annotate bug fix
5 years ago
peilin-wang 29aa589972 added check for invalid type for boolean args
5 years ago
yanghaitao 365c901ee0 get default value if num_parallel_worker is None
5 years ago
mindspore-ci-bot d08a89ab87 !2737 makes 0 an invaild bucket size
5 years ago
mindspore-ci-bot 1227719cdd !2603 [Dataset Python UT] Cleanup mse and md5 validation
5 years ago
peilin-wang 9468b49e28 bucket_batch_sizes must be strictly positive, 0 is not a valid batch size
5 years ago
mindspore-ci-bot c70e6987ba !2624 Fix config test
5 years ago
mindspore-ci-bot 9b4399480a !2321 Connector throughput performance metric
5 years ago
tinazhang66 5cd3136355 remove local defined mse and add missing mse/md5 validation
5 years ago
ms_yan 7fa0d9e7e4 add paramter check for numpyslices and num_shards
5 years ago
mindspore-ci-bot 32a2d5d21c !2629 get monitor sampling interval from json file
5 years ago
mindspore-ci-bot 363489d00f !2580 BasicTokenizer do not case fold on preserved words
5 years ago
qianlong cae77c0c22 BasicTokenizer not case fold on preserverd words
5 years ago
liyong 68e2097897 fix split erroer message
5 years ago
yanghaitao fd701b8504 set monitor sampling interval through json file
5 years ago
Eric a4f5802924 Added Fix for test config
5 years ago
Alexey Shevlyakov 698abf75fc Connector throughput + PerfData + CyclicArray
5 years ago
mindspore-ci-bot a1b17bd2e2 !2593 python unit tests for randomResizeWithBBox and ResizeWithBBox
5 years ago
avakh 1e869146e9 applying comments removing VOC
5 years ago
Cathy Wong 277aba5326 dataset: Fixup docs; remove pylint disabled messages in UT
5 years ago
mindspore-ci-bot e11c953225 !2587 Implementing Python UTs for BoundingBoxAugment and RandomHorizontalFlipWithBBox
5 years ago
mindspore-ci-bot 067592ccf9 !2590 Updated Python UTs for RandomVerticalFlipWithBBox, RandomCropWithBBox and RandomResizedCropWithBBox
5 years ago
islam_amin 0f58f0338e updating ut for RandomHorizontalFlipWithBBox and BBoxAugment
5 years ago
mindspore-ci-bot b0a10c26a4 !2532 adding random-resize-with-bbox-op and resize-with-bbox-op
5 years ago
Danish Farid 3d1ecaaeb5 updated UT test for Python (3) AugOps with BBox - MD5 checks + imrpv comments
5 years ago
avakh 587e260236 addressing comments
5 years ago
Jamie Nisbet c22eac743d subtree creation in python apis
5 years ago
Danish Farid a8f392ca42 updated test util file + new BoundingBoxCheck + fixed VOCDataset annotations
5 years ago
YangLuo 4e3bfcf4c9 !2306 [Dataset] Code review & improve quality
5 years ago
mindspore-ci-bot beb436f457 !2398 Bug in Slice when multiple rows are used
5 years ago
mindspore-ci-bot a9c309da4c !2404 [Dataset]rename input variable name to children and output variable to parent
5 years ago
mindspore-ci-bot 1ea38eb60c !2375 Add Python Tokenizer
5 years ago
liyong 2412ee09ce fix distributedSampler reshuffle and fix random_device failed
5 years ago
mindspore-ci-bot 78a8bc302d !2380 Fix CocoDataset issue
5 years ago
xulei2020 3b71bd0d5d rename input to children, output to parent
5 years ago
hesham 68030e6a4b Slice Bug
5 years ago
qianlong 980ddd32a2 change output of WordpieceTokenizer and BertTokenizer to 1-D string tensors
5 years ago
xiefangqi 5703a10b8b fix CocoDataset issue
5 years ago
mindspore-ci-bot bbf69912be !2363 fix TextFildDataset and CLUEDataset does not support to_device
5 years ago
hesham e981c67acd Python Tokenizer
5 years ago
hesham 674415f7be Cleanup work for Concate, Mask, Slice, PadEnd and TruncatePair
5 years ago
mindspore-ci-bot bc4b1c2460 !2373 Comments + improved casting/parameters + UT tests for 3 AugOps (Augmentation Ops RandomCropWithBBox, Random Vertical FlipWithWithBBox and Random CropAndResizeWithWithBBox)
5 years ago
mindspore-ci-bot 7bdcc319b2 !2325 Addressing comments from PR2314
5 years ago
Danish Farid 6442a85b75 comments added, ptrs to consts, UT tests for BBOps
5 years ago
islam_amin edc42c5b85 addressed comments from reviews
5 years ago
nhussain d89101b95f add missing test
5 years ago
yanghaitao 9f336bb735 fix TextFileDataset and CLUEDataset failed with to_device
5 years ago
peilinwang 1e36b0649f remove graphengine changes
5 years ago
hesham 11826fb256 DuplicateOp
5 years ago
mindspore-ci-bot 90bb9320aa !2314 Implemented RandomHorizontalFlipWithBBox and BoundingBoxAugment C++ Ops
5 years ago
mindspore-ci-bot a83baafbf6 !2253 Add ConcatOp to Dataset
5 years ago
mindspore-ci-bot 3784220056 !2317 rework on dataset.text.vocab to support any user special_tokens
5 years ago
nhussain 5515016dba remove graphengine changes
5 years ago
mindspore-ci-bot ffc8a3c362 !2274 add PadEndOp
5 years ago
mindspore-ci-bot e4322baece !2316 [Dataset] Move python tree transformation into cpp pass
5 years ago
islam_amin 0af156d169 added two ops for object detection, random horizontal flip with bbox and bouding box augment. along with their unit tests
5 years ago
xunxue 87aa9c8f7a PadEndOp
5 years ago
Junhan Hu 57f3732ac3 Move python tree transformation into cpp pass.
5 years ago
Zirui Wu b6e9504b31 phase I of Vocab rework
5 years ago
Zirui Wu 8f2674850b address API doc style and content
5 years ago
mindspore-ci-bot 5850b99161 !2211 Add TruncatePair Op to dataset
5 years ago
mindspore-ci-bot d0c5071cab !1993 [Dataset] Fix codedex.
5 years ago
ms_yan 8d1dae46ac Throw error when load config failed
5 years ago
hesham b9495a9ccc Truncate Pair
5 years ago
mindspore-ci-bot ea829e89c3 !2198 Add MaskOp to dataset
5 years ago
mindspore-ci-bot 808380a9f9 !2142 Cleanup dataset UT: refactor visualize()
5 years ago
ms_yan fdafc690f8 repair problem in NumpySlicesDataset
5 years ago
Tinazhang 169eaf73c7 reorganize util function visualize()
5 years ago
Yang dee8471d23 !1993 [Dataset] Fix codedex.
5 years ago
qianlong 4f16f036be Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp
5 years ago
hesham f2462bb00d Mask Op
5 years ago
mindspore-ci-bot ca94916ef9 !2184 CocoDataset add category_id check
5 years ago
mindspore-ci-bot 625f2421b5 !1977 random walk in gnn node2vec
5 years ago
mindspore-ci-bot 3adbde56b2 !2162 SliceOp
5 years ago
mindspore-ci-bot 898b2fdecd !2187 Fix comment display issues in BuildVocabDataset
5 years ago
Zirui Wu 2794883644 fix selected minor issues
5 years ago
xiefangqi aab190fcdb CocoDataset add category_id validation
5 years ago
liyong d4f8f57c7e add split in minddataset
5 years ago
hesham e2012a1de9 Slice Op
5 years ago
mindspore-ci-bot dceb3c2be4 !2026 Change C++ behavior in RandomCropAndResize
5 years ago
Mahdi add29f95f5 resolved issues in the comments
5 years ago
mindspore-ci-bot 2cab44a637 !2028 FillOp for BERT
5 years ago
nhussain dd9bf09f0a added FillOp for #119 - special Ops
5 years ago
Peilin Wang 848e07d022 initial commit, start of BucketBatchByLengthOp
5 years ago
Jonathan Yan 87d2c27c7f random walk v1
5 years ago
mindspore-ci-bot a8cf83acf2 !1932 Add CLUE dataset
5 years ago
jiangzhiwen e0e167a000 add CLUE dataset
5 years ago
hesham a22763b612 Bug in CIFAR after removing GetMutabble buffer
5 years ago
jonyguo bb2082daec fix: MindDataset padded log error
5 years ago
mindspore-ci-bot 3085e51e45 !1922 check parameter num_samples of sampler
5 years ago
mindspore-ci-bot 1b5d9a1194 !1981 Implemented from_dataset, a way to build a vocab from existing dataset
5 years ago
Zirui Wu 880ce5ea26 implemented from_dataset
5 years ago
Alexey Shevlyakov 4e333a2f22 MindData profiler infrastructure
5 years ago
mindspore-ci-bot 0a95223f25 !1910 RandomColorAdjust error for grayscale images
5 years ago
yanghaitao 8f16cff12e add para check for sampler
5 years ago
jonyguo 1de7271afc add floatxx test case
5 years ago
ms_yan bc22c172b8 add TensorDataset and its ut
5 years ago
mindspore-ci-bot 251a6667a5 !1989 fix MindDataset distribute shuffle error
5 years ago
mindspore-ci-bot b3da41bd7a !1913 CocoDataset implementation
5 years ago
xiefangqi 097cbcc0cb CocoDataset implementation
5 years ago
jonyguo 96859f44b6 fix: MindDataset distribute shuffle bug
5 years ago
Mahdi ba2e2dced4 fixed some warnings and typos
5 years ago
Tinazhang 2812cd57db fix pylint error
5 years ago
mindspore-ci-bot fe0c3e9a8f !1908 Add UT to six data vision transforms ops
5 years ago
heleiwang 3ece8dd090 1. support get_all_edges, get_nodes_from_edge, get_sampled_neighbors, get_neg_sampled_neighbors and graph_info API
5 years ago
mindspore-ci-bot 5eb95599f6 !1874 Updates to string tensor
5 years ago
Tinazhang c3de5c7a54 adding new TCs to six ops and skip md5 case in RandomPerspective
5 years ago
hesham f837ddc956 - Bug when empty strings sent to Python
5 years ago
liyong feff8899ac support padding samples
5 years ago
mindspore-ci-bot 5c21616293 !1807 Implemented Ngram TensorOp for dataset
5 years ago
Zirui Wu dbf9936ec4 Implemented n-gram for dataset TensorOp
5 years ago
Jamie Nisbet 51bc0c0460 consistent design for num_samples
5 years ago
Tinazhang 33fe66fd1e fix error handling for bigger crop size case
5 years ago
Yang 5522edc4b5 !1713 [Dataset] Clean pylint.
5 years ago
liuwenhao4 7b911886ec Fixing some tiny faults about Pylint in my code(ops)
5 years ago
mindspore-ci-bot 7878743400 !1638 fixed rounding edge case in split
5 years ago
mindspore-ci-bot 0e3dd8149a !1703 Adding sync_wait input check
5 years ago
mindspore-ci-bot c366b3fb18 !1704 fix num rows bug in sampler
5 years ago
Eric 8e1b2107ce Added example for sync_wait bad batch_num
5 years ago
mindspore-ci-bot 976b4793d1 !1544 Python Unit Test for TenCrop
5 years ago
Peilin Wang 3ef3d1433d fixed num_samples bug for RandomSampler
5 years ago
Tinazhang f4a61fdbb2 Adding new UT cases and missing TCs in PY Ops.
5 years ago
Peilin Wang 5469be2a97 fixed bug for split, RandomSampler and some other cleanup
5 years ago
Mahdi dfc097019b Added TenCrop test
5 years ago
mindspore-ci-bot 21da86b393 !1591 add get_dataset_size for CelebADataset
5 years ago
yanghaitao cc6c7a3f60 add get_dataset_size to celebadataset
5 years ago
mindspore-ci-bot 0f4dc13e8e !1478 [Dataset] Fix CI warnings, clean pylint.
5 years ago
Yang 9b2a778d94 !1478 [Dataset] clean pylint.
5 years ago
xiefangqi 5e4728c50f add voc support split
5 years ago
Peilin Wang d4c93575e7 fixed bug for split, RandomSampler and some other cleanup
5 years ago
Yang abca62f407 10:00 26/5 clean pylint
5 years ago
mindspore-ci-bot 0a8ef2fe72 !1455 Cleanup dataset UT: restore config support
5 years ago
Cathy Wong f891e1755c Cleanup dataset UT: restore config support
5 years ago
mindspore-ci-bot aeb4c52f2d !1379 Creating new UTs to three vision transformation ops
5 years ago
mindspore-ci-bot 718a96430f !1389 Security issues in recent string tensor PR
5 years ago
mindspore-ci-bot a6b8451a33 !1380 make ShuffleOp have deterministic behavior for subsequent epochs
5 years ago
heleiwang f28f883cac fix comments error and modify parameter check
5 years ago
hesham c0f0e154f5 Clean up work for text python package
5 years ago
mindspore-ci-bot 41456ac824 !1369 dataset: delete StorageDataset
5 years ago
Tinazhang 7322839b04 add UTs for LinearTransformation, ToPIL, ToType
5 years ago
Peilin Wang 0cbcc7200b made shuffle determinisitc for subsequent epochs
5 years ago
mindspore-ci-bot 61639be1e0 !1375 Cleanup dataset UT: remove num_parallel_workers=1 in test_exception
5 years ago
mindspore-ci-bot a528797253 !1377 Code Fix for Uniform Augmentation
5 years ago
mindspore-ci-bot 458436186c !1365 Clean up work for text python sub-package
5 years ago
Tinazhang b390883c6a Bug fix
5 years ago
Cathy Wong 702005d403 Cleanup dataset UT: remove num_parallel_workers=1 in test_exception
5 years ago
ms_yan d5e896b51c delete storageDataset Op API and its test case
5 years ago
mindspore-ci-bot 6f733ec113 !1308 Stage 2 of adding support for string Tensor
5 years ago
xiefangqi 34236ce1f1 fix pylint
5 years ago
mindspore-ci-bot 3363d4e834 !1249 Add GNN dataset processing API
5 years ago
hesham 6c21e556c4 Clean up work for text python package
5 years ago
heleiwang 599a449e0b Support processing GNN data
5 years ago
jinyaohui fbdba6e4da clean pylint
5 years ago
hesham df361d1d26 Change mem layout of string tensor
5 years ago
mindspore-ci-bot 58e6d7d950 !1341 Added lookup and vocab to mindspore.dataset.text
5 years ago
jonwe bb51bb88d7 add compress in mindrecord
5 years ago
mindspore-ci-bot 2e3d55ed87 !1281 Implementation of SplitOp
5 years ago
mindspore-ci-bot 39b9aedf68 !1342 Bug fix on issue Core dump on GPU when train with lenet with AU
5 years ago
Peilin Wang 71e8bb1960 general split case done, chaining sampler (basic case) is working
5 years ago
Tinazhang e9e40b688b Bug fix
5 years ago
Zirui Wu 25ab2ef303 Implemented lookup and vocab
5 years ago
mindspore-ci-bot 46949fc327 !1307 Cleanup dataset UT: unskip and enhance TFRecord sharding tests
5 years ago
qianlong 451c20a6f5 Add UnicodeCharTokenizer for nlp
5 years ago
mindspore-ci-bot 93e7c97a96 !1272 [Dataset] MindData Tree Optimizer Infrastructure
5 years ago
Cathy Wong b78894e02b Cleanup dataset UT: unskip and enhance TFRecord sharding tests
5 years ago
Junhan Hu f44d213503 MindData optimizer infrastructure.
5 years ago
xulei2020 163b6b7ea7 add jieba c++ code
5 years ago
Tinazhang 17cecf2cf5 Added TCs to RandomCrop and RandomCropAndResize and modified visalize() calling
5 years ago
jinyaohui 5a914994ba clean pylint
5 years ago
jinyaohui bcfaff97f9 clean pylint
5 years ago
hesham e8ca243364 -Add DE_STRING
5 years ago
jiangzhiwen cb2814b498 flat_map first commit
5 years ago
mindspore-ci-bot c680cfbf27 !1157 dataset: add concat operation for dataset
5 years ago
mindspore-ci-bot ab031ee9ea !1126 VOCDataset support object detection function
5 years ago
xiefangqi c937bad53f minddata support voc
5 years ago
ms_yan c0fa7b4b19 init commit of concat dataset
5 years ago
jonyguo be2e7531ca fix: MindDataset parameter shard_id & num_shards check
5 years ago
Cathy Wong 913074e656 Cleanup dataset UT: resolve skipped test units
5 years ago
liyong aa3f89e74f mindrecord support read file list
5 years ago
Cathy Wong 49ef53f164 Cleanup dataset UT: util.py internals
5 years ago
mindspore-ci-bot 2860fd9338 !984 Add unit test case for HWC2CHW.
5 years ago
Tinazhang c8b5586c7f add unit test for HWC2CHWC
5 years ago
Cathy Wong 58226addd6 Cleanup dataset UT: use md5 npz in test_zip for images
5 years ago
mindspore-ci-bot 47f5abceb4 !960 Adding example for grayscale
5 years ago
mindspore-ci-bot 078dd86cfe !507 Implemented padded_batch
5 years ago
mindspore-ci-bot de7625777f !951 fix: MindDataset with columns_name parameter cause errors in some scenes
5 years ago
eric 0f0548f21b Added test case for grayscale support
5 years ago
Zirui Wu c2d364a573 batch with padding implemented
5 years ago
jonyguo d4d236bcce fix: use MindDataset by column_names get data error in some situation
5 years ago
liyong b520ca9087 fix pk sampler in mindrecord
5 years ago