mindspore-ci-bot
41456ac824
!1369 dataset: delete StorageDataset
...
Merge pull request !1369 from ms_yan/del_storage
5 years ago
Tinazhang
7322839b04
add UTs for LinearTransformation, ToPIL, ToType
5 years ago
Peilin Wang
0cbcc7200b
made shuffle determinisitc for subsequent epochs
5 years ago
mindspore-ci-bot
61639be1e0
!1375 Cleanup dataset UT: remove num_parallel_workers=1 in test_exception
...
Merge pull request !1375 from cathwong/ckw_dataset_ut_cleanup5
5 years ago
mindspore-ci-bot
a528797253
!1377 Code Fix for Uniform Augmentation
...
Merge pull request !1377 from Tinazhang/cc
5 years ago
mindspore-ci-bot
458436186c
!1365 Clean up work for text python sub-package
...
Merge pull request !1365 from h.farahat/text_namespace
5 years ago
Tinazhang
b390883c6a
Bug fix
5 years ago
Cathy Wong
702005d403
Cleanup dataset UT: remove num_parallel_workers=1 in test_exception
5 years ago
ms_yan
d5e896b51c
delete storageDataset Op API and its test case
5 years ago
mindspore-ci-bot
6f733ec113
!1308 Stage 2 of adding support for string Tensor
...
Merge pull request !1308 from h.farahat/string_tensor2
5 years ago
xiefangqi
34236ce1f1
fix pylint
5 years ago
mindspore-ci-bot
3363d4e834
!1249 Add GNN dataset processing API
...
Merge pull request !1249 from heleiwang/hlw_gnn_data
5 years ago
hesham
6c21e556c4
Clean up work for text python package
5 years ago
heleiwang
599a449e0b
Support processing GNN data
5 years ago
jinyaohui
fbdba6e4da
clean pylint
5 years ago
hesham
df361d1d26
Change mem layout of string tensor
...
add support for MindRecord and TFRecord
----
optimize tensorshape
optimize tensorshape and FlatIndex
TFRecord and MindRecord support for string tensor
Modify mem layout
Add new constructor
Add method Allocate
Change some GetMutableBuffer usages to AllocateBuffer
5 years ago
mindspore-ci-bot
58e6d7d950
!1341 Added lookup and vocab to mindspore.dataset.text
...
Merge pull request !1341 from ZiruiWu/vocab_and_lookup
5 years ago
jonwe
bb51bb88d7
add compress in mindrecord
5 years ago
mindspore-ci-bot
2e3d55ed87
!1281 Implementation of SplitOp
...
Merge pull request !1281 from Peilin/splitOp
5 years ago
mindspore-ci-bot
39b9aedf68
!1342 Bug fix on issue Core dump on GPU when train with lenet with AU
...
Merge pull request !1342 from Tinazhang/cc
5 years ago
Peilin Wang
71e8bb1960
general split case done, chaining sampler (basic case) is working
...
implementation 99% complete
everything and tested except for repeatable shuffling
tested most basic/typical split usecases
cleanup
some more cleanup
fix CI
more ci fix
more ci fixes
more ci fix
more ci fix
more ci fix
added more tests, fixed some bugs
some more clean up and test cases
added shard/shuffle before split warning/error
addressed code review comments and ci
fixed ci
5 years ago
Tinazhang
e9e40b688b
Bug fix
5 years ago
Zirui Wu
25ab2ef303
Implemented lookup and vocab
5 years ago
mindspore-ci-bot
46949fc327
!1307 Cleanup dataset UT: unskip and enhance TFRecord sharding tests
...
Merge pull request !1307 from cathwong/ckw_dataset_ut_unskip2
5 years ago
qianlong
451c20a6f5
Add UnicodeCharTokenizer for nlp
5 years ago
mindspore-ci-bot
93e7c97a96
!1272 [Dataset] MindData Tree Optimizer Infrastructure
...
Merge pull request !1272 from JunhanHu/minddata_opt
5 years ago
Cathy Wong
b78894e02b
Cleanup dataset UT: unskip and enhance TFRecord sharding tests
5 years ago
Junhan Hu
f44d213503
MindData optimizer infrastructure.
5 years ago
xulei2020
163b6b7ea7
add jieba c++ code
5 years ago
Tinazhang
17cecf2cf5
Added TCs to RandomCrop and RandomCropAndResize and modified visalize() calling
5 years ago
jinyaohui
5a914994ba
clean pylint
5 years ago
jinyaohui
bcfaff97f9
clean pylint
5 years ago
hesham
e8ca243364
-Add DE_STRING
...
-replace switch'case by indexing
- Add test case
- Add constructors
- Add getItem string
- Fix bugs
- Add more tests
- Tensor iterator
- asNumpy
- TextFileDataset
- Tensor(Numpy)
- Super > 2D
- Add more test cases for GeneratorDataset
- Change StartAddr to GetBuffer and GetMutableNuffer
- Raise an error if batch is used with strings
Clean-up work
5 years ago
jiangzhiwen
cb2814b498
flat_map first commit
5 years ago
mindspore-ci-bot
c680cfbf27
!1157 dataset: add concat operation for dataset
...
Merge pull request !1157 from ms_yan/concat_dataset
5 years ago
mindspore-ci-bot
ab031ee9ea
!1126 VOCDataset support object detection function
...
Merge pull request !1126 from xiefangqi/voc_support_detection
5 years ago
xiefangqi
c937bad53f
minddata support voc
5 years ago
ms_yan
c0fa7b4b19
init commit of concat dataset
...
change to use __add__ operation instead ds.concat
5 years ago
jonyguo
be2e7531ca
fix: MindDataset parameter shard_id & num_shards check
5 years ago
Cathy Wong
913074e656
Cleanup dataset UT: resolve skipped test units
5 years ago
liyong
aa3f89e74f
mindrecord support read file list
5 years ago
Cathy Wong
49ef53f164
Cleanup dataset UT: util.py internals
5 years ago
mindspore-ci-bot
2860fd9338
!984 Add unit test case for HWC2CHW.
...
Merge pull request !984 from Tinazhang/hwc2chw
5 years ago
Tinazhang
c8b5586c7f
add unit test for HWC2CHWC
5 years ago
Cathy Wong
58226addd6
Cleanup dataset UT: use md5 npz in test_zip for images
5 years ago
mindspore-ci-bot
47f5abceb4
!960 Adding example for grayscale
...
Merge pull request !960 from EricZ/grayscale_fix
5 years ago
mindspore-ci-bot
078dd86cfe
!507 Implemented padded_batch
...
Merge pull request !507 from ZiruiWu/batch_with_padding
5 years ago
mindspore-ci-bot
de7625777f
!951 fix: MindDataset with columns_name parameter cause errors in some scenes
...
Merge pull request !951 from guozhijian/fix_read_by_columns
5 years ago
eric
0f0548f21b
Added test case for grayscale support
5 years ago
Zirui Wu
c2d364a573
batch with padding implemented
...
support for 1 specific dimension to be None, added validator
fix various CI complains
another round of CI fixes
ci
refactor parts of the code
code refactor
ci fix
comments added, fix bugs
address review comments
address review comments
review cmts
added simple perf test script
update pad code
perf imprv
5 years ago
jonyguo
d4d236bcce
fix: use MindDataset by column_names get data error in some situation
5 years ago
liyong
b520ca9087
fix pk sampler in mindrecord
5 years ago
Cathy Wong
772e6c1461
Cleanup dataset UT: test_batch, save_and_check support
5 years ago
eric
36fffb7706
Added example md5 generation
...
Comparison example
Added md5 and comparison example for py_transforms
Added md5 check for images
5 years ago
Junhan Hu
83c68ca2ef
Skip pyfunc test case
5 years ago
eric
26cb3e8a5f
Added test function to show that seed doesn't work.
...
Added testcase to show that c image aug don't use seed properly
Added passing test cases
Added working testcases for using seed
Added additional test cases to show seed use
Added test case for seed
5 years ago
ms_yan
c56fe3aa2d
modify take op with an operator
5 years ago
mindspore-ci-bot
8af10eb51e
!875 Reject python OP in operations argument for C++ uniform augmentation OP
...
Merge pull request !875 from AdelShafiei/ua_py
5 years ago
Adel Shafiei
d15bd04bfe
added input validation to reject python op in C++ uniform augmentation operations list
5 years ago
mindspore-ci-bot
a606c2e4da
!872 [Dataset] Add schema support for GeneratorDataset
...
Merge pull request !872 from JunhanHu/generator_schema
5 years ago
mindspore-ci-bot
2303453753
!869 Random data op
...
Merge pull request !869 from JesseKLee/random_data_op
5 years ago
Junhan Hu
c5a8ffe4f4
Add schema support for GeneratorDataset
5 years ago
Jesse Lee
5236d0c3c0
Replace print with logger.info
5 years ago
mindspore-ci-bot
8d3695f666
!672 Added UT for uniform augmentation C++ OP
...
Merge pull request !672 from AdelShafiei/ua_ut
5 years ago
Jesse Lee
270bf831a9
Random Data Op
5 years ago
jiangzhiwen
34bfa2f7c9
fix skip
5 years ago
Adel Shafiei
3322e65da9
added ut for uniform augment C++ op
5 years ago
mindspore-ci-bot
b37db1edf5
!603 [MD] update pk sampler in minddataset
...
Merge pull request !603 from liyong126/update_pk_sampler
5 years ago
mindspore-ci-bot
f82e63fecc
!671 Added testcase for sync_wait
...
Merge pull request !671 from EricZ/master
5 years ago
mindspore-ci-bot
0e3054d527
!466 Deepcopy problem when pyfunc cannot be pickled
...
Merge pull request !466 from h.farahat/deepcopy
5 years ago
liyong
bfba630aa2
update pK_sampler
5 years ago
Zirui Wu
8c3931cf1d
fix first epoch always shuffle with default seed in random sampler
...
fix ut
5 years ago
eric
2d115cd04e
Added example for multiple iterator
...
Added new testcase for multi iterator
Addressing review
Fixed typo
5 years ago
hesham
a9e9266149
Deepcopy problem when pyfunc cannot be pickled
5 years ago
mindspore-ci-bot
aad5771a62
!524 Added support for UA augmentation ops with tests
...
Merge pull request !524 from alashkari/ua-ops
5 years ago
Amir Lashkari
56e7a7deb5
Added UniformAugment + Python Augmentation Ops
5 years ago
mindspore-ci-bot
dc0491caf9
!508 [Dataset] Adding sync_wait operator for dataset
...
Merge pull request !508 from EricZ/master
5 years ago
eric
cd94518769
X# This is a combination of 2 commits.
...
Initial commit for dataset op python
Added signature to barrier
Adde compiling barrier code
Rebasing, fixed new compile errors
Final fix for make_unique
Added pybind API for barrier
Fixed pyfunc invocation
python interface - sync_wait
!1 sync_wait python interface
* python interface - sync_wait
fix test
update test
update test
Added new test case
add test case
test for shuffle + batch
Added two-sync test case
Restrited that no shuffle after sync
Added sync to pipeline info
block first databuffer as well
Intelligently get batch size
Fix default case
Lock Pair shares among all iterators
Added fix for empty character
Fixed up test case formatting
Fix end of epoch in sync_wait
Fixing CI
5 years ago
Junhan Hu
78001ac9e6
Add multiprocessing support for Mindspore.Dataset.GeneratorDataset
5 years ago
mindspore-ci-bot
fb18671b28
!506 [Dataset] Multiprocessing support for Pyfunc
...
Merge pull request !506 from JunhanHu/multiprocess_pyfunc
5 years ago
Junhan Hu
b13e7bc31a
Add python multiprocessing support for Mindspore.dataset
5 years ago
qianlong
db80f4ff92
The num_samples and numRows in schema for TFRecordDataset are conflict
5 years ago
mindspore-ci-bot
d9e4dcc33b
!483 Optimize skip dataset op
...
Merge pull request !483 from jiangzhiwen/dataset/skip_opt
5 years ago
liyong
f1542a90a3
add pk sampler
5 years ago
jiangzhiwen
e1b109e8b8
optimize skip dataset op
5 years ago
Cathy Wong
60df369100
Fixup py Normalize doc: takes input CHW
5 years ago
mindspore-ci-bot
6369cf27bd
!406 added first row crc check for when reading tfrecord files
...
Merge pull request !406 from Peilin/first-row-crc-check
5 years ago
mindspore-ci-bot
98fbd30a5b
!460 [Data]Add filter operation
...
Merge pull request !460 from xulei/filter_master
5 years ago
mindspore-ci-bot
822a3160e4
!404 [Dataset] Add Python Sampler support for CPP dataset
...
Merge pull request !404 from JunhanHu/python_sampler_for_cpp_dataset
5 years ago
xulei2020
c705ea5e5b
add filterOp code
5 years ago
Peilin Wang
9bc2134cb7
added checking of first row crc to find invalid tfrecord files
...
addressed code review comments. added check in python layer to exclude directories and to raise an error if a pattern does not match any file
fixed clang format
fixed cppcheck
fixed cppcheck (used std::accumulate and std::copy_if). regenerated tfrecord file to contain correct header, it was a dummy header before
fixed cppcheck: added const reference for string parameter for lambdas, fixed clang format: whitespace adjustments
more clang whitespace fixes...
changed print to logger.info
5 years ago
yanghaitao
2795e492ff
TextFileDataset
5 years ago
Junhan Hu
43a2e99833
Add python sampler support for CPP dataset
5 years ago
ms_yan
f0c07c3fa6
Realize take op and add ut
5 years ago
mindspore-ci-bot
80333e9f55
!435 Fix dataset serialize and deserialize for MindDataset
...
Merge pull request !435 from anthonyaje/fix_serdes_dataset
5 years ago
mindspore-ci-bot
40f0a4a4f4
!333 Add skip op to Dataset
...
Merge pull request !333 from jiangzhiwen/dataset/skip
5 years ago
mindspore-ci-bot
9e1b5efd1d
!434 Bug in cleaning dataset iterators
...
Merge pull request !434 from h.farahat/multi_itr_bug
5 years ago
anthonyaje
ea297c0889
Fix dataset serdes for MindDataset
5 years ago
hesham
3c02c82771
Bug in weak reference.
...
Add new test cases
5 years ago
jzw
3f7054dccb
add skip dataset op
5 years ago
mindspore-ci-bot
cf026096a6
!183 Mindspore.dataset CPP sampler for GeneratorDataset
...
Merge pull request !183 from JunhanHu/cpp_sampler
5 years ago
Junhan Hu
9739d3b048
Add CPP sampler support for GeneratorDataset
5 years ago
mindspore-ci-bot
30de261c3c
!243 Support nested repeat
...
Merge pull request !243 from h.farahat/nested_repeat
5 years ago
hesham
0fc23eee0f
Support nested repeat
5 years ago
xiefangqi
1a1cbc6814
implemention of new api: apply
5 years ago
liyong
0ce83e39e1
fix TestShardSampleWrongNumber
...
adjust internal api
5 years ago
liyong
11403492ae
add mindrecord subset random sampler
5 years ago
Cathy Wong
59a714c654
Correct shuffle UT buffer_size > #dataset-row as valid
5 years ago
jonyguo
c688265671
fix: when use MindDataset block_reade=True hung
5 years ago
xiefangqi
bc4602b58e
fix and remove useless import of example, st, ut
5 years ago
mindspore-ci-bot
5c22c088bb
!69 Enable skipped dataset zip operator python unit tests
...
Merge pull request !69 from cathwong/ckw_dataset_py_ut1
5 years ago
anzhengqi
6a1b865c91
check num_samples
5 years ago
Cathy Wong
2e881276ab
Enable skipped dataset zip python unit tests
...
Signed-off-by: Cathy Wong <cathy.wong@huawei.com>
5 years ago
qianlong
8c88b39da1
Optimize the execution time of test case test_rgb_hsv.py
5 years ago
jonyguo
34e42bd6f9
1. add more log info for dataset & mindrecord, 2. add two new testcase for MindDataset
5 years ago
zhunaipan
930a1fb0a8
initial version
...
Signed-off-by: leonwanghui <leon.wanghui@huawei.com>
5 years ago