限定符和类型 | 方法和说明 |
---|---|
Class<? extends InputFormat<?,?>> |
JobContext.getInputFormatClass()
Get the
InputFormat class for the job. |
限定符和类型 | 方法和说明 |
---|---|
void |
Job.setInputFormatClass(Class<? extends InputFormat> cls)
Set the
InputFormat for the job. |
限定符和类型 | 类和说明 |
---|---|
class |
DataDrivenDBInputFormat<T extends DBWritable>
A InputFormat that reads input data from an SQL table.
|
class |
DBInputFormat<T extends DBWritable>
A InputFormat that reads input data from an SQL table.
|
class |
OracleDataDrivenDBInputFormat<T extends DBWritable>
A InputFormat that reads input data from an SQL table in an Oracle db.
|
限定符和类型 | 类和说明 |
---|---|
class |
CombineFileInputFormat<K,V>
|
class |
DelegatingInputFormat<K,V>
An
InputFormat that delegates behavior of paths to multiple other
InputFormats. |
class |
FileInputFormat<K,V>
A base class for file-based
InputFormat s. |
class |
KeyValueTextInputFormat
An
InputFormat for plain text files. |
class |
NLineInputFormat
NLineInputFormat which splits N lines of input as one split.
|
class |
SequenceFileAsBinaryInputFormat
InputFormat reading keys, values from SequenceFiles in binary (raw)
format.
|
class |
SequenceFileAsTextInputFormat
This class is similar to SequenceFileInputFormat, except it generates
SequenceFileAsTextRecordReader which converts the input keys and values
to their String forms by calling toString() method.
|
class |
SequenceFileInputFilter<K,V>
A class that allows a map/red job to work on a sample of sequence files.
|
class |
SequenceFileInputFormat<K,V>
An
InputFormat for SequenceFile s. |
class |
TextInputFormat
An
InputFormat for plain text files. |
限定符和类型 | 方法和说明 |
---|---|
static void |
MultipleInputs.addInputPath(Job job,
Path path,
Class<? extends InputFormat> inputFormatClass)
Add a
Path with a custom InputFormat to the list of
inputs for the map-reduce job. |
static void |
MultipleInputs.addInputPath(Job job,
Path path,
Class<? extends InputFormat> inputFormatClass,
Class<? extends Mapper> mapperClass)
|
限定符和类型 | 方法和说明 |
---|---|
K[] |
InputSampler.Sampler.getSample(InputFormat<K,V> inf,
Job job)
For a given job, collect and return a subset of the keys from the
input data.
|
K[] |
InputSampler.SplitSampler.getSample(InputFormat<K,V> inf,
Job job)
From each split sampled, take the first numSamples / numSplits records.
|
K[] |
InputSampler.RandomSampler.getSample(InputFormat<K,V> inf,
Job job)
Randomize the split order, then take the specified number of keys from
each split sampled, where each key is selected with the specified
probability and possibly replaced by a subsequently selected key when
the quota of keys from that split is satisfied.
|
K[] |
InputSampler.IntervalSampler.getSample(InputFormat<K,V> inf,
Job job)
For each split sampled, emit when the ratio of the number of records
retained to the total record count is less than the specified
frequency.
|
限定符和类型 | 类和说明 |
---|---|
class |
UniformSizeInputFormat
UniformSizeInputFormat extends the InputFormat<> class, to produce
input-splits for DistCp.
|
限定符和类型 | 类和说明 |
---|---|
class |
DynamicInputFormat<K,V>
DynamicInputFormat implements the "Worker pattern" for DistCp.
|
限定符和类型 | 方法和说明 |
---|---|
static Class<? extends InputFormat> |
DistCpUtils.getStrategy(Configuration conf,
DistCpOptions options)
Returns the class that implements a copy strategy.
|
Copyright © 2009 The Apache Software Foundation