Commit Graph

449 Commits (3621bb23488dae78e0e5d8b14d99accfec1816c3)

Author SHA1 Message Date
peilin-wang 9468b49e28 bucket_batch_sizes must be strictly positive, 0 is not a valid batch size
5 years ago
mindspore-ci-bot c70e6987ba !2624 Fix config test
5 years ago
mindspore-ci-bot 9b4399480a !2321 Connector throughput performance metric
5 years ago
tinazhang66 5cd3136355 remove local defined mse and add missing mse/md5 validation
5 years ago
ms_yan 7fa0d9e7e4 add paramter check for numpyslices and num_shards
5 years ago
mindspore-ci-bot 32a2d5d21c !2629 get monitor sampling interval from json file
5 years ago
mindspore-ci-bot 363489d00f !2580 BasicTokenizer do not case fold on preserved words
5 years ago
qianlong cae77c0c22 BasicTokenizer not case fold on preserverd words
5 years ago
liyong 68e2097897 fix split erroer message
5 years ago
yanghaitao fd701b8504 set monitor sampling interval through json file
5 years ago
Eric a4f5802924 Added Fix for test config
5 years ago
Alexey Shevlyakov 698abf75fc Connector throughput + PerfData + CyclicArray
5 years ago
mindspore-ci-bot a1b17bd2e2 !2593 python unit tests for randomResizeWithBBox and ResizeWithBBox
5 years ago
avakh 1e869146e9 applying comments removing VOC
5 years ago
Cathy Wong 277aba5326 dataset: Fixup docs; remove pylint disabled messages in UT
5 years ago
mindspore-ci-bot e11c953225 !2587 Implementing Python UTs for BoundingBoxAugment and RandomHorizontalFlipWithBBox
5 years ago
mindspore-ci-bot 067592ccf9 !2590 Updated Python UTs for RandomVerticalFlipWithBBox, RandomCropWithBBox and RandomResizedCropWithBBox
5 years ago
islam_amin 0f58f0338e updating ut for RandomHorizontalFlipWithBBox and BBoxAugment
5 years ago
mindspore-ci-bot b0a10c26a4 !2532 adding random-resize-with-bbox-op and resize-with-bbox-op
5 years ago
Danish Farid 3d1ecaaeb5 updated UT test for Python (3) AugOps with BBox - MD5 checks + imrpv comments
5 years ago
avakh 587e260236 addressing comments
5 years ago
Jamie Nisbet c22eac743d subtree creation in python apis
5 years ago
Danish Farid a8f392ca42 updated test util file + new BoundingBoxCheck + fixed VOCDataset annotations
5 years ago
YangLuo 4e3bfcf4c9 !2306 [Dataset] Code review & improve quality
5 years ago
mindspore-ci-bot beb436f457 !2398 Bug in Slice when multiple rows are used
5 years ago
mindspore-ci-bot a9c309da4c !2404 [Dataset]rename input variable name to children and output variable to parent
5 years ago
mindspore-ci-bot 1ea38eb60c !2375 Add Python Tokenizer
5 years ago
liyong 2412ee09ce fix distributedSampler reshuffle and fix random_device failed
5 years ago
mindspore-ci-bot 78a8bc302d !2380 Fix CocoDataset issue
5 years ago
xulei2020 3b71bd0d5d rename input to children, output to parent
5 years ago
hesham 68030e6a4b Slice Bug
5 years ago
qianlong 980ddd32a2 change output of WordpieceTokenizer and BertTokenizer to 1-D string tensors
5 years ago
xiefangqi 5703a10b8b fix CocoDataset issue
5 years ago
mindspore-ci-bot bbf69912be !2363 fix TextFildDataset and CLUEDataset does not support to_device
5 years ago
hesham e981c67acd Python Tokenizer
5 years ago
hesham 674415f7be Cleanup work for Concate, Mask, Slice, PadEnd and TruncatePair
5 years ago
mindspore-ci-bot bc4b1c2460 !2373 Comments + improved casting/parameters + UT tests for 3 AugOps (Augmentation Ops RandomCropWithBBox, Random Vertical FlipWithWithBBox and Random CropAndResizeWithWithBBox)
5 years ago
mindspore-ci-bot 7bdcc319b2 !2325 Addressing comments from PR2314
5 years ago
Danish Farid 6442a85b75 comments added, ptrs to consts, UT tests for BBOps
5 years ago
islam_amin edc42c5b85 addressed comments from reviews
5 years ago
nhussain d89101b95f add missing test
5 years ago
yanghaitao 9f336bb735 fix TextFileDataset and CLUEDataset failed with to_device
5 years ago
peilinwang 1e36b0649f remove graphengine changes
5 years ago
hesham 11826fb256 DuplicateOp
5 years ago
mindspore-ci-bot 90bb9320aa !2314 Implemented RandomHorizontalFlipWithBBox and BoundingBoxAugment C++ Ops
5 years ago
mindspore-ci-bot a83baafbf6 !2253 Add ConcatOp to Dataset
5 years ago
mindspore-ci-bot 3784220056 !2317 rework on dataset.text.vocab to support any user special_tokens
5 years ago
nhussain 5515016dba remove graphengine changes
5 years ago
mindspore-ci-bot ffc8a3c362 !2274 add PadEndOp
5 years ago
mindspore-ci-bot e4322baece !2316 [Dataset] Move python tree transformation into cpp pass
5 years ago
islam_amin 0af156d169 added two ops for object detection, random horizontal flip with bbox and bouding box augment. along with their unit tests
5 years ago
xunxue 87aa9c8f7a PadEndOp
5 years ago
Junhan Hu 57f3732ac3 Move python tree transformation into cpp pass.
5 years ago
Zirui Wu b6e9504b31 phase I of Vocab rework
5 years ago
Zirui Wu 8f2674850b address API doc style and content
5 years ago
mindspore-ci-bot 5850b99161 !2211 Add TruncatePair Op to dataset
5 years ago
mindspore-ci-bot d0c5071cab !1993 [Dataset] Fix codedex.
5 years ago
ms_yan 8d1dae46ac Throw error when load config failed
5 years ago
hesham b9495a9ccc Truncate Pair
5 years ago
mindspore-ci-bot ea829e89c3 !2198 Add MaskOp to dataset
5 years ago
mindspore-ci-bot 808380a9f9 !2142 Cleanup dataset UT: refactor visualize()
5 years ago
ms_yan fdafc690f8 repair problem in NumpySlicesDataset
5 years ago
Tinazhang 169eaf73c7 reorganize util function visualize()
5 years ago
Yang dee8471d23 !1993 [Dataset] Fix codedex.
5 years ago
qianlong 4f16f036be Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp
5 years ago
hesham f2462bb00d Mask Op
5 years ago
mindspore-ci-bot ca94916ef9 !2184 CocoDataset add category_id check
5 years ago
mindspore-ci-bot 625f2421b5 !1977 random walk in gnn node2vec
5 years ago
mindspore-ci-bot 3adbde56b2 !2162 SliceOp
5 years ago
mindspore-ci-bot 898b2fdecd !2187 Fix comment display issues in BuildVocabDataset
5 years ago
Zirui Wu 2794883644 fix selected minor issues
5 years ago
xiefangqi aab190fcdb CocoDataset add category_id validation
5 years ago
liyong d4f8f57c7e add split in minddataset
5 years ago
hesham e2012a1de9 Slice Op
5 years ago
mindspore-ci-bot dceb3c2be4 !2026 Change C++ behavior in RandomCropAndResize
5 years ago
Mahdi add29f95f5 resolved issues in the comments
5 years ago
mindspore-ci-bot 2cab44a637 !2028 FillOp for BERT
5 years ago
nhussain dd9bf09f0a added FillOp for #119 - special Ops
5 years ago
Peilin Wang 848e07d022 initial commit, start of BucketBatchByLengthOp
5 years ago
Jonathan Yan 87d2c27c7f random walk v1
5 years ago
mindspore-ci-bot a8cf83acf2 !1932 Add CLUE dataset
5 years ago
jiangzhiwen e0e167a000 add CLUE dataset
5 years ago
hesham a22763b612 Bug in CIFAR after removing GetMutabble buffer
5 years ago
jonyguo bb2082daec fix: MindDataset padded log error
5 years ago
mindspore-ci-bot 3085e51e45 !1922 check parameter num_samples of sampler
5 years ago
mindspore-ci-bot 1b5d9a1194 !1981 Implemented from_dataset, a way to build a vocab from existing dataset
5 years ago
Zirui Wu 880ce5ea26 implemented from_dataset
5 years ago
Alexey Shevlyakov 4e333a2f22 MindData profiler infrastructure
5 years ago
mindspore-ci-bot 0a95223f25 !1910 RandomColorAdjust error for grayscale images
5 years ago
yanghaitao 8f16cff12e add para check for sampler
5 years ago
jonyguo 1de7271afc add floatxx test case
5 years ago
ms_yan bc22c172b8 add TensorDataset and its ut
5 years ago
mindspore-ci-bot 251a6667a5 !1989 fix MindDataset distribute shuffle error
5 years ago
mindspore-ci-bot b3da41bd7a !1913 CocoDataset implementation
5 years ago
xiefangqi 097cbcc0cb CocoDataset implementation
5 years ago
jonyguo 96859f44b6 fix: MindDataset distribute shuffle bug
5 years ago
Mahdi ba2e2dced4 fixed some warnings and typos
5 years ago
Tinazhang 2812cd57db fix pylint error
5 years ago
mindspore-ci-bot fe0c3e9a8f !1908 Add UT to six data vision transforms ops
5 years ago
heleiwang 3ece8dd090 1. support get_all_edges, get_nodes_from_edge, get_sampled_neighbors, get_neg_sampled_neighbors and graph_info API
5 years ago
mindspore-ci-bot 5eb95599f6 !1874 Updates to string tensor
5 years ago
Tinazhang c3de5c7a54 adding new TCs to six ops and skip md5 case in RandomPerspective
5 years ago
hesham f837ddc956 - Bug when empty strings sent to Python
5 years ago
liyong feff8899ac support padding samples
5 years ago
mindspore-ci-bot 5c21616293 !1807 Implemented Ngram TensorOp for dataset
5 years ago
Zirui Wu dbf9936ec4 Implemented n-gram for dataset TensorOp
5 years ago
Jamie Nisbet 51bc0c0460 consistent design for num_samples
5 years ago
Tinazhang 33fe66fd1e fix error handling for bigger crop size case
5 years ago
Yang 5522edc4b5 !1713 [Dataset] Clean pylint.
5 years ago
liuwenhao4 7b911886ec Fixing some tiny faults about Pylint in my code(ops)
5 years ago
mindspore-ci-bot 7878743400 !1638 fixed rounding edge case in split
5 years ago
mindspore-ci-bot 0e3dd8149a !1703 Adding sync_wait input check
5 years ago
mindspore-ci-bot c366b3fb18 !1704 fix num rows bug in sampler
5 years ago
Eric 8e1b2107ce Added example for sync_wait bad batch_num
5 years ago
mindspore-ci-bot 976b4793d1 !1544 Python Unit Test for TenCrop
5 years ago
Peilin Wang 3ef3d1433d fixed num_samples bug for RandomSampler
5 years ago
Tinazhang f4a61fdbb2 Adding new UT cases and missing TCs in PY Ops.
5 years ago
Peilin Wang 5469be2a97 fixed bug for split, RandomSampler and some other cleanup
5 years ago
Mahdi dfc097019b Added TenCrop test
5 years ago
mindspore-ci-bot 21da86b393 !1591 add get_dataset_size for CelebADataset
5 years ago
yanghaitao cc6c7a3f60 add get_dataset_size to celebadataset
5 years ago
mindspore-ci-bot 0f4dc13e8e !1478 [Dataset] Fix CI warnings, clean pylint.
5 years ago
Yang 9b2a778d94 !1478 [Dataset] clean pylint.
5 years ago
xiefangqi 5e4728c50f add voc support split
5 years ago
Peilin Wang d4c93575e7 fixed bug for split, RandomSampler and some other cleanup
5 years ago
Yang abca62f407 10:00 26/5 clean pylint
5 years ago
mindspore-ci-bot 0a8ef2fe72 !1455 Cleanup dataset UT: restore config support
5 years ago
Cathy Wong f891e1755c Cleanup dataset UT: restore config support
5 years ago
mindspore-ci-bot aeb4c52f2d !1379 Creating new UTs to three vision transformation ops
5 years ago
mindspore-ci-bot 718a96430f !1389 Security issues in recent string tensor PR
5 years ago
mindspore-ci-bot a6b8451a33 !1380 make ShuffleOp have deterministic behavior for subsequent epochs
5 years ago
heleiwang f28f883cac fix comments error and modify parameter check
5 years ago
hesham c0f0e154f5 Clean up work for text python package
5 years ago
mindspore-ci-bot 41456ac824 !1369 dataset: delete StorageDataset
5 years ago
Tinazhang 7322839b04 add UTs for LinearTransformation, ToPIL, ToType
5 years ago
Peilin Wang 0cbcc7200b made shuffle determinisitc for subsequent epochs
5 years ago
mindspore-ci-bot 61639be1e0 !1375 Cleanup dataset UT: remove num_parallel_workers=1 in test_exception
5 years ago
mindspore-ci-bot a528797253 !1377 Code Fix for Uniform Augmentation
5 years ago
mindspore-ci-bot 458436186c !1365 Clean up work for text python sub-package
5 years ago
Tinazhang b390883c6a Bug fix
5 years ago
Cathy Wong 702005d403 Cleanup dataset UT: remove num_parallel_workers=1 in test_exception
5 years ago
ms_yan d5e896b51c delete storageDataset Op API and its test case
5 years ago
mindspore-ci-bot 6f733ec113 !1308 Stage 2 of adding support for string Tensor
5 years ago
xiefangqi 34236ce1f1 fix pylint
5 years ago
mindspore-ci-bot 3363d4e834 !1249 Add GNN dataset processing API
5 years ago
hesham 6c21e556c4 Clean up work for text python package
5 years ago
heleiwang 599a449e0b Support processing GNN data
5 years ago
jinyaohui fbdba6e4da clean pylint
5 years ago
hesham df361d1d26 Change mem layout of string tensor
5 years ago
mindspore-ci-bot 58e6d7d950 !1341 Added lookup and vocab to mindspore.dataset.text
5 years ago
jonwe bb51bb88d7 add compress in mindrecord
5 years ago
mindspore-ci-bot 2e3d55ed87 !1281 Implementation of SplitOp
5 years ago
mindspore-ci-bot 39b9aedf68 !1342 Bug fix on issue Core dump on GPU when train with lenet with AU
5 years ago
Peilin Wang 71e8bb1960 general split case done, chaining sampler (basic case) is working
5 years ago
Tinazhang e9e40b688b Bug fix
5 years ago
Zirui Wu 25ab2ef303 Implemented lookup and vocab
5 years ago
mindspore-ci-bot 46949fc327 !1307 Cleanup dataset UT: unskip and enhance TFRecord sharding tests
5 years ago
qianlong 451c20a6f5 Add UnicodeCharTokenizer for nlp
5 years ago
mindspore-ci-bot 93e7c97a96 !1272 [Dataset] MindData Tree Optimizer Infrastructure
5 years ago
Cathy Wong b78894e02b Cleanup dataset UT: unskip and enhance TFRecord sharding tests
5 years ago
Junhan Hu f44d213503 MindData optimizer infrastructure.
5 years ago
xulei2020 163b6b7ea7 add jieba c++ code
5 years ago
Tinazhang 17cecf2cf5 Added TCs to RandomCrop and RandomCropAndResize and modified visalize() calling
5 years ago
jinyaohui 5a914994ba clean pylint
5 years ago
jinyaohui bcfaff97f9 clean pylint
5 years ago
hesham e8ca243364 -Add DE_STRING
5 years ago
jiangzhiwen cb2814b498 flat_map first commit
5 years ago
mindspore-ci-bot c680cfbf27 !1157 dataset: add concat operation for dataset
5 years ago
mindspore-ci-bot ab031ee9ea !1126 VOCDataset support object detection function
5 years ago
xiefangqi c937bad53f minddata support voc
5 years ago
ms_yan c0fa7b4b19 init commit of concat dataset
5 years ago
jonyguo be2e7531ca fix: MindDataset parameter shard_id & num_shards check
5 years ago
Cathy Wong 913074e656 Cleanup dataset UT: resolve skipped test units
5 years ago
liyong aa3f89e74f mindrecord support read file list
5 years ago
Cathy Wong 49ef53f164 Cleanup dataset UT: util.py internals
5 years ago
mindspore-ci-bot 2860fd9338 !984 Add unit test case for HWC2CHW.
5 years ago
Tinazhang c8b5586c7f add unit test for HWC2CHWC
5 years ago
Cathy Wong 58226addd6 Cleanup dataset UT: use md5 npz in test_zip for images
5 years ago
mindspore-ci-bot 47f5abceb4 !960 Adding example for grayscale
5 years ago
mindspore-ci-bot 078dd86cfe !507 Implemented padded_batch
5 years ago
mindspore-ci-bot de7625777f !951 fix: MindDataset with columns_name parameter cause errors in some scenes
5 years ago
eric 0f0548f21b Added test case for grayscale support
5 years ago
Zirui Wu c2d364a573 batch with padding implemented
5 years ago
jonyguo d4d236bcce fix: use MindDataset by column_names get data error in some situation
5 years ago
liyong b520ca9087 fix pk sampler in mindrecord
5 years ago
Cathy Wong 772e6c1461 Cleanup dataset UT: test_batch, save_and_check support
5 years ago
eric 36fffb7706 Added example md5 generation
5 years ago
Junhan Hu 83c68ca2ef Skip pyfunc test case
5 years ago
eric 26cb3e8a5f Added test function to show that seed doesn't work.
5 years ago
ms_yan c56fe3aa2d modify take op with an operator
5 years ago
mindspore-ci-bot 8af10eb51e !875 Reject python OP in operations argument for C++ uniform augmentation OP
5 years ago
Adel Shafiei d15bd04bfe added input validation to reject python op in C++ uniform augmentation operations list
5 years ago
mindspore-ci-bot a606c2e4da !872 [Dataset] Add schema support for GeneratorDataset
5 years ago
mindspore-ci-bot 2303453753 !869 Random data op
5 years ago
Junhan Hu c5a8ffe4f4 Add schema support for GeneratorDataset
5 years ago
Jesse Lee 5236d0c3c0 Replace print with logger.info
5 years ago
mindspore-ci-bot 8d3695f666 !672 Added UT for uniform augmentation C++ OP
5 years ago
Jesse Lee 270bf831a9 Random Data Op
5 years ago
jiangzhiwen 34bfa2f7c9 fix skip
5 years ago
Adel Shafiei 3322e65da9 added ut for uniform augment C++ op
5 years ago
mindspore-ci-bot b37db1edf5 !603 [MD] update pk sampler in minddataset
5 years ago
mindspore-ci-bot f82e63fecc !671 Added testcase for sync_wait
5 years ago
mindspore-ci-bot 0e3054d527 !466 Deepcopy problem when pyfunc cannot be pickled
5 years ago
liyong bfba630aa2 update pK_sampler
5 years ago
Zirui Wu 8c3931cf1d fix first epoch always shuffle with default seed in random sampler
5 years ago
eric 2d115cd04e Added example for multiple iterator
5 years ago
hesham a9e9266149 Deepcopy problem when pyfunc cannot be pickled
5 years ago
mindspore-ci-bot aad5771a62 !524 Added support for UA augmentation ops with tests
5 years ago
Amir Lashkari 56e7a7deb5 Added UniformAugment + Python Augmentation Ops
5 years ago
mindspore-ci-bot dc0491caf9 !508 [Dataset] Adding sync_wait operator for dataset
5 years ago
eric cd94518769 X# This is a combination of 2 commits.
5 years ago
Junhan Hu 78001ac9e6 Add multiprocessing support for Mindspore.Dataset.GeneratorDataset
5 years ago
mindspore-ci-bot fb18671b28 !506 [Dataset] Multiprocessing support for Pyfunc
5 years ago
Junhan Hu b13e7bc31a Add python multiprocessing support for Mindspore.dataset
5 years ago
qianlong db80f4ff92 The num_samples and numRows in schema for TFRecordDataset are conflict
5 years ago
mindspore-ci-bot d9e4dcc33b !483 Optimize skip dataset op
5 years ago
liyong f1542a90a3 add pk sampler
5 years ago
jiangzhiwen e1b109e8b8 optimize skip dataset op
5 years ago
Cathy Wong 60df369100 Fixup py Normalize doc: takes input CHW
5 years ago
mindspore-ci-bot 6369cf27bd !406 added first row crc check for when reading tfrecord files
5 years ago
mindspore-ci-bot 98fbd30a5b !460 [Data]Add filter operation
5 years ago
mindspore-ci-bot 822a3160e4 !404 [Dataset] Add Python Sampler support for CPP dataset
5 years ago
xulei2020 c705ea5e5b add filterOp code
5 years ago
Peilin Wang 9bc2134cb7 added checking of first row crc to find invalid tfrecord files
5 years ago
yanghaitao 2795e492ff TextFileDataset
5 years ago
Junhan Hu 43a2e99833 Add python sampler support for CPP dataset
5 years ago
ms_yan f0c07c3fa6 Realize take op and add ut
5 years ago
mindspore-ci-bot 80333e9f55 !435 Fix dataset serialize and deserialize for MindDataset
5 years ago
mindspore-ci-bot 40f0a4a4f4 !333 Add skip op to Dataset
5 years ago
mindspore-ci-bot 9e1b5efd1d !434 Bug in cleaning dataset iterators
5 years ago
anthonyaje ea297c0889 Fix dataset serdes for MindDataset
5 years ago
hesham 3c02c82771 Bug in weak reference.
5 years ago
jzw 3f7054dccb add skip dataset op
5 years ago
mindspore-ci-bot cf026096a6 !183 Mindspore.dataset CPP sampler for GeneratorDataset
5 years ago
Junhan Hu 9739d3b048 Add CPP sampler support for GeneratorDataset
5 years ago
mindspore-ci-bot 30de261c3c !243 Support nested repeat
5 years ago
hesham 0fc23eee0f Support nested repeat
5 years ago
xiefangqi 1a1cbc6814 implemention of new api: apply
5 years ago
liyong 0ce83e39e1 fix TestShardSampleWrongNumber
5 years ago
liyong 11403492ae add mindrecord subset random sampler
5 years ago
Cathy Wong 59a714c654 Correct shuffle UT buffer_size > #dataset-row as valid
5 years ago
jonyguo c688265671 fix: when use MindDataset block_reade=True hung
5 years ago
xiefangqi bc4602b58e fix and remove useless import of example, st, ut
5 years ago
mindspore-ci-bot 5c22c088bb !69 Enable skipped dataset zip operator python unit tests
5 years ago
anzhengqi 6a1b865c91 check num_samples
5 years ago
Cathy Wong 2e881276ab Enable skipped dataset zip python unit tests
5 years ago
qianlong 8c88b39da1 Optimize the execution time of test case test_rgb_hsv.py
5 years ago
jonyguo 34e42bd6f9 1. add more log info for dataset & mindrecord, 2. add two new testcase for MindDataset
5 years ago
zhunaipan 930a1fb0a8 initial version
5 years ago