Commit Graph

266 Commits (915ddd25dd82ba90eb6d8e01a1d1cd935240ec82)

Author SHA1 Message Date
mindspore-ci-bot 41456ac824 !1369 dataset: delete StorageDataset
5 years ago
Tinazhang 7322839b04 add UTs for LinearTransformation, ToPIL, ToType
5 years ago
Peilin Wang 0cbcc7200b made shuffle determinisitc for subsequent epochs
5 years ago
mindspore-ci-bot 61639be1e0 !1375 Cleanup dataset UT: remove num_parallel_workers=1 in test_exception
5 years ago
mindspore-ci-bot a528797253 !1377 Code Fix for Uniform Augmentation
5 years ago
mindspore-ci-bot 458436186c !1365 Clean up work for text python sub-package
5 years ago
Tinazhang b390883c6a Bug fix
5 years ago
Cathy Wong 702005d403 Cleanup dataset UT: remove num_parallel_workers=1 in test_exception
5 years ago
ms_yan d5e896b51c delete storageDataset Op API and its test case
5 years ago
mindspore-ci-bot 6f733ec113 !1308 Stage 2 of adding support for string Tensor
5 years ago
xiefangqi 34236ce1f1 fix pylint
5 years ago
mindspore-ci-bot 3363d4e834 !1249 Add GNN dataset processing API
5 years ago
hesham 6c21e556c4 Clean up work for text python package
5 years ago
heleiwang 599a449e0b Support processing GNN data
5 years ago
jinyaohui fbdba6e4da clean pylint
5 years ago
hesham df361d1d26 Change mem layout of string tensor
5 years ago
mindspore-ci-bot 58e6d7d950 !1341 Added lookup and vocab to mindspore.dataset.text
5 years ago
jonwe bb51bb88d7 add compress in mindrecord
5 years ago
mindspore-ci-bot 2e3d55ed87 !1281 Implementation of SplitOp
5 years ago
mindspore-ci-bot 39b9aedf68 !1342 Bug fix on issue Core dump on GPU when train with lenet with AU
5 years ago
Peilin Wang 71e8bb1960 general split case done, chaining sampler (basic case) is working
5 years ago
Tinazhang e9e40b688b Bug fix
5 years ago
Zirui Wu 25ab2ef303 Implemented lookup and vocab
5 years ago
mindspore-ci-bot 46949fc327 !1307 Cleanup dataset UT: unskip and enhance TFRecord sharding tests
5 years ago
qianlong 451c20a6f5 Add UnicodeCharTokenizer for nlp
5 years ago
mindspore-ci-bot 93e7c97a96 !1272 [Dataset] MindData Tree Optimizer Infrastructure
5 years ago
Cathy Wong b78894e02b Cleanup dataset UT: unskip and enhance TFRecord sharding tests
5 years ago
Junhan Hu f44d213503 MindData optimizer infrastructure.
5 years ago
xulei2020 163b6b7ea7 add jieba c++ code
5 years ago
Tinazhang 17cecf2cf5 Added TCs to RandomCrop and RandomCropAndResize and modified visalize() calling
5 years ago
jinyaohui 5a914994ba clean pylint
5 years ago
jinyaohui bcfaff97f9 clean pylint
5 years ago
hesham e8ca243364 -Add DE_STRING
5 years ago
jiangzhiwen cb2814b498 flat_map first commit
5 years ago
mindspore-ci-bot c680cfbf27 !1157 dataset: add concat operation for dataset
5 years ago
mindspore-ci-bot ab031ee9ea !1126 VOCDataset support object detection function
5 years ago
xiefangqi c937bad53f minddata support voc
5 years ago
ms_yan c0fa7b4b19 init commit of concat dataset
5 years ago
jonyguo be2e7531ca fix: MindDataset parameter shard_id & num_shards check
5 years ago
Cathy Wong 913074e656 Cleanup dataset UT: resolve skipped test units
5 years ago
liyong aa3f89e74f mindrecord support read file list
5 years ago
Cathy Wong 49ef53f164 Cleanup dataset UT: util.py internals
5 years ago
mindspore-ci-bot 2860fd9338 !984 Add unit test case for HWC2CHW.
5 years ago
Tinazhang c8b5586c7f add unit test for HWC2CHWC
5 years ago
Cathy Wong 58226addd6 Cleanup dataset UT: use md5 npz in test_zip for images
5 years ago
mindspore-ci-bot 47f5abceb4 !960 Adding example for grayscale
5 years ago
mindspore-ci-bot 078dd86cfe !507 Implemented padded_batch
5 years ago
mindspore-ci-bot de7625777f !951 fix: MindDataset with columns_name parameter cause errors in some scenes
5 years ago
eric 0f0548f21b Added test case for grayscale support
5 years ago
Zirui Wu c2d364a573 batch with padding implemented
5 years ago
jonyguo d4d236bcce fix: use MindDataset by column_names get data error in some situation
5 years ago
liyong b520ca9087 fix pk sampler in mindrecord
5 years ago
Cathy Wong 772e6c1461 Cleanup dataset UT: test_batch, save_and_check support
5 years ago
eric 36fffb7706 Added example md5 generation
5 years ago
Junhan Hu 83c68ca2ef Skip pyfunc test case
5 years ago
eric 26cb3e8a5f Added test function to show that seed doesn't work.
5 years ago
ms_yan c56fe3aa2d modify take op with an operator
5 years ago
mindspore-ci-bot 8af10eb51e !875 Reject python OP in operations argument for C++ uniform augmentation OP
5 years ago
Adel Shafiei d15bd04bfe added input validation to reject python op in C++ uniform augmentation operations list
5 years ago
mindspore-ci-bot a606c2e4da !872 [Dataset] Add schema support for GeneratorDataset
5 years ago
mindspore-ci-bot 2303453753 !869 Random data op
5 years ago
Junhan Hu c5a8ffe4f4 Add schema support for GeneratorDataset
5 years ago
Jesse Lee 5236d0c3c0 Replace print with logger.info
5 years ago
mindspore-ci-bot 8d3695f666 !672 Added UT for uniform augmentation C++ OP
5 years ago
Jesse Lee 270bf831a9 Random Data Op
5 years ago
jiangzhiwen 34bfa2f7c9 fix skip
5 years ago
Adel Shafiei 3322e65da9 added ut for uniform augment C++ op
5 years ago
mindspore-ci-bot b37db1edf5 !603 [MD] update pk sampler in minddataset
5 years ago
mindspore-ci-bot f82e63fecc !671 Added testcase for sync_wait
5 years ago
mindspore-ci-bot 0e3054d527 !466 Deepcopy problem when pyfunc cannot be pickled
5 years ago
liyong bfba630aa2 update pK_sampler
5 years ago
Zirui Wu 8c3931cf1d fix first epoch always shuffle with default seed in random sampler
5 years ago
eric 2d115cd04e Added example for multiple iterator
5 years ago
hesham a9e9266149 Deepcopy problem when pyfunc cannot be pickled
5 years ago
mindspore-ci-bot aad5771a62 !524 Added support for UA augmentation ops with tests
5 years ago
Amir Lashkari 56e7a7deb5 Added UniformAugment + Python Augmentation Ops
5 years ago
mindspore-ci-bot dc0491caf9 !508 [Dataset] Adding sync_wait operator for dataset
5 years ago
eric cd94518769 X# This is a combination of 2 commits.
5 years ago
Junhan Hu 78001ac9e6 Add multiprocessing support for Mindspore.Dataset.GeneratorDataset
5 years ago
mindspore-ci-bot fb18671b28 !506 [Dataset] Multiprocessing support for Pyfunc
5 years ago
Junhan Hu b13e7bc31a Add python multiprocessing support for Mindspore.dataset
5 years ago
qianlong db80f4ff92 The num_samples and numRows in schema for TFRecordDataset are conflict
5 years ago
mindspore-ci-bot d9e4dcc33b !483 Optimize skip dataset op
5 years ago
liyong f1542a90a3 add pk sampler
5 years ago
jiangzhiwen e1b109e8b8 optimize skip dataset op
5 years ago
Cathy Wong 60df369100 Fixup py Normalize doc: takes input CHW
5 years ago
mindspore-ci-bot 6369cf27bd !406 added first row crc check for when reading tfrecord files
5 years ago
mindspore-ci-bot 98fbd30a5b !460 [Data]Add filter operation
5 years ago
mindspore-ci-bot 822a3160e4 !404 [Dataset] Add Python Sampler support for CPP dataset
5 years ago
xulei2020 c705ea5e5b add filterOp code
5 years ago
Peilin Wang 9bc2134cb7 added checking of first row crc to find invalid tfrecord files
5 years ago
yanghaitao 2795e492ff TextFileDataset
5 years ago
Junhan Hu 43a2e99833 Add python sampler support for CPP dataset
5 years ago
ms_yan f0c07c3fa6 Realize take op and add ut
5 years ago
mindspore-ci-bot 80333e9f55 !435 Fix dataset serialize and deserialize for MindDataset
5 years ago
mindspore-ci-bot 40f0a4a4f4 !333 Add skip op to Dataset
5 years ago
mindspore-ci-bot 9e1b5efd1d !434 Bug in cleaning dataset iterators
5 years ago
anthonyaje ea297c0889 Fix dataset serdes for MindDataset
5 years ago
hesham 3c02c82771 Bug in weak reference.
5 years ago
jzw 3f7054dccb add skip dataset op
5 years ago
mindspore-ci-bot cf026096a6 !183 Mindspore.dataset CPP sampler for GeneratorDataset
5 years ago
Junhan Hu 9739d3b048 Add CPP sampler support for GeneratorDataset
5 years ago
mindspore-ci-bot 30de261c3c !243 Support nested repeat
5 years ago
hesham 0fc23eee0f Support nested repeat
5 years ago
xiefangqi 1a1cbc6814 implemention of new api: apply
5 years ago
liyong 0ce83e39e1 fix TestShardSampleWrongNumber
5 years ago
liyong 11403492ae add mindrecord subset random sampler
5 years ago
Cathy Wong 59a714c654 Correct shuffle UT buffer_size > #dataset-row as valid
5 years ago
jonyguo c688265671 fix: when use MindDataset block_reade=True hung
5 years ago
xiefangqi bc4602b58e fix and remove useless import of example, st, ut
5 years ago
mindspore-ci-bot 5c22c088bb !69 Enable skipped dataset zip operator python unit tests
5 years ago
anzhengqi 6a1b865c91 check num_samples
5 years ago
Cathy Wong 2e881276ab Enable skipped dataset zip python unit tests
5 years ago
qianlong 8c88b39da1 Optimize the execution time of test case test_rgb_hsv.py
5 years ago
jonyguo 34e42bd6f9 1. add more log info for dataset & mindrecord, 2. add two new testcase for MindDataset
5 years ago
zhunaipan 930a1fb0a8 initial version
5 years ago