java.lang.Object org.apache.hadoop.mapred.FileInputFormat<LongWritable,Text> org.apache.hadoop.mapred.TextInputFormat
public class TextInputFormat
An InputFormat
for plain text files. Files are broken into lines. Either linefeed or carriage-return are used to signal end of line. Keys are the position in the file, and values are the line of text..
addInputPath, addInputPaths, computeSplitSize, getBlockIndex, getInputPathFilter, getInputPaths, getSplitHosts, getSplits, listStatus, setInputPathFilter, setInputPaths, setInputPaths, setMinSplitSize
Methods inherited from class java.lang.Object clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
TextInputFormat
public TextInputFormat()
public void configure(JobConf conf)
JobConfigurable
JobConf
.
configure
in interface JobConfigurable
conf
- the configuration
protected boolean isSplitable(FileSystem fs, Path file)
FileInputFormat
FileInputFormat
implementations can override this and return false
to ensure that individual input files are never split-up so that Mapper
s process entire files.
isSplitable
in class FileInputFormat<LongWritable,Text>
fs
- the file system that the file is on
file
- the file name to check
public RecordReader<LongWritable,Text> getRecordReader(InputSplit genericSplit, JobConf job, Reporter reporter) throws IOException
InputFormat
RecordReader
for the given InputSplit
.
It is the responsibility of the RecordReader
to respect record boundaries while processing the logical split to present a record-oriented view to the individual task.
getRecordReader
in interface InputFormat<LongWritable,Text>
getRecordReader
in class FileInputFormat<LongWritable,Text>
genericSplit
- the InputSplit
job
- the job that this split belongs to
RecordReader
IOException
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4