Merge pull request #14702 from barrierye/update_checkfile_in_datafeed

add the comment for CheckFile function
revert-14666-feature/estiminate_flops
guru4elephant 6 years ago committed by GitHub
commit f9e4d49bbc
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

@ -259,6 +259,14 @@ bool MultiSlotDataFeed::CheckFile(const char* filename) {
return false;
}
}
// It may be added '\t' character to the end of the output of reduce
// task when processes data by Hadoop(when the output of the reduce
// task of Hadoop has only one field, it will add a '\t' at the end
// of the line by default, and you can use this option to avoid it:
// `-D mapred.textoutputformat.ignoreseparator=true`), which does
// not affect the correctness of the data. Therefore, it should be
// judged that the data is not normal when the end of each line of
// data contains characters which are not spaces.
while (endptr - str != len) {
if (!isspace(*(endptr++))) {
VLOG(0)

Loading…
Cancel
Save