Yancey1989
cbabaa4544
convert dataset into recordio format
8 years ago
武毅
c10121e13c
[Done] Sync master client between passes and fix recordio split ( #2948 )
...
* fix recordio split and task passes
* update for pre commit
* update
* update, still need to sync client wait for pass end.
* able to sync passes for task dispatching
* update to comment
* update
* fix yapf check
* why local pre-commit fails? version is the same
* fix race condition
* update
* fix race condition
* this still have duplicate problem in unit test
* update
* update
* update by comment
* update
8 years ago
dongzhihong
e5cb9b77b6
"fix gen list"
8 years ago
livc
ea09edc991
pass precommit
8 years ago
wanghaoshuang
ceb9a73aaa
fix import err
8 years ago
wanghaoshuang
b142a6b063
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into voc_dataset
8 years ago
wanghaoshuang
302c4f11d1
rename voc_seg to voc2012
8 years ago
Yancey
83f263e6ec
Fix fetch record from master failed ( #2848 )
...
Fix fetch record from master
8 years ago
dzhwinter
9b5adfa482
Merge pull request #2905 from dzhwinter/fix_mq2007
...
"fix dense vector shape in mq2007"
8 years ago
武毅
23b8346072
Fault tolerant distributed training, just work version, with etcd ( #2849 )
...
* using etcd as fault tolerant training
* update
* workable version, ft not tested
* small fix
* update
* remove TODO
8 years ago
dongzhihong
56fbed9a42
"fix dense vector shape in mq2007"
8 years ago
wanghaoshuang
4a5c3714ea
fix python dependency for voc2012 dataset
8 years ago
wanghaoshuang
1ba879bead
Use PIL to read image in palette mode
8 years ago
wanghaoshuang
c4f301ded7
Modify comments and fix code format.
8 years ago
wanghaoshuang
a5239ac7a5
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into voc_dataset
8 years ago
caoying03
68ec558c21
update the md5sum of WMT14 trained model.
8 years ago
whs
a698824a8a
Merge branch 'develop' into voc_dataset
8 years ago
wanghaoshuang
338dd13542
Add voc2012 dataset for image segment
8 years ago
dangqingqing
0925681543
fix typo.
8 years ago
dangqingqing
6a7f1212f5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into row_conv_fix
8 years ago
dangqingqing
e287034d73
minus mean in flowers dataset.
8 years ago
dangqingqing
0c70f34c60
Fix bug for flowers dataset and row_conv.
8 years ago
gongweibao
5b2f9939c2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fixdownloadbug
8 years ago
gongweibao
48fdfd5a03
fix bugs
8 years ago
wanghaoshuang
01f44bff66
rename args and add comments
...
1. rename 'useXmap' to 'use_xmap'
2. add comments about exchanging train data and test data
8 years ago
wanghaoshuang
80642bee00
fix_xmap and refine flowers dataset
8 years ago
Your Name
e915aa9cf1
fix bugs
8 years ago
root
97270b9f27
add convert function
8 years ago
Yi Wang
ff4be82252
Handle multiple processes trying to create the data home directory
8 years ago
Helin Wang
936ac658c0
Fix unit test error: File exists: /root/.cache/paddle/dataset
8 years ago
gongweibao
77c4dce759
modify 10000 to 1000
8 years ago
gongweibao
283bdc5062
fix by helin's comments
8 years ago
gongweibao
9011f9e52c
add precommit
8 years ago
wanghaoshuang
990b7d7bc9
rename xmap to xmap_readers and remove 'from cv2 import resize' in image.py
8 years ago
wanghaoshuang
e62a4d7abe
xmap: change multiprocess to multithread.
...
images reader: read the data without untarring the tarball file.
image.py: move batch function from reader to image.py
8 years ago
wanghaoshuang@baidu.com
2799b0ec50
Add flowers dataset for image classification model
8 years ago
Yancey
97a594e7f8
Split dataset into multiple files ( #2320 )
...
cluster dataset split and reader
8 years ago
dzhwinter
6e91ebc0dd
"remove extrafile"
8 years ago
dzhwinter
b098ef69a4
"remove the rar extractfile, prevent small files"
8 years ago
dzhwinter
590c6038fc
"format Test"
8 years ago
dzhwinter
a4313de808
"remove the pairwise other genereate method"
8 years ago
dzhwinter
4ac5caaaa7
"formatter"
8 years ago
dzhwinter
d86fb1d133
"precommit format with github style"
8 years ago
dzhwinter
82eb0fe45b
"fix len type error of QueryList"
8 years ago
dzhwinter
16d6bd7c38
"fix label genenerate type. avoid IVector create error when init label"
8 years ago
dzhwinter
d7ef562450
"better format"
8 years ago
dzhwinter
96ca1e966a
"add mq2007 dataset for learning to rank task"
8 years ago
Cao Ying
81fb41f097
Merge pull request #2023 from pkuyym/develop
...
Add dataset PTB into paddle.dataset for language model task.
8 years ago
yangyaming
286696aa2b
extend imikolov instead of adding ptb
8 years ago
yangyaming
70d15e84be
Add dataset ptb
8 years ago