程序包 | 说明 |
---|---|
org.apache.hadoop.mapred |
A software framework for easily writing applications which process vast
amounts of data (multi-terabyte data-sets) parallelly on large clusters
(thousands of nodes) built of commodity hardware in a reliable, fault-tolerant
manner.
|
org.apache.hadoop.mapreduce | |
org.apache.hadoop.mapreduce.lib.db | |
org.apache.hadoop.mapreduce.lib.input | |
org.apache.hadoop.mapreduce.split | |
org.apache.hadoop.tools.distcp2.mapred | |
org.apache.hadoop.tools.distcp2.mapred.lib | |
org.apache.hadoop.tools.rumen |
Rumen is a data extraction and analysis tool built for
Apache Hadoop.
|
限定符和类型 | 类和说明 |
---|---|
class |
FileSplit
A section of an input file.
|
限定符和类型 | 方法和说明 |
---|---|
InputSplit |
MapContext.getInputSplit()
Get the input split for this map.
|
限定符和类型 | 方法和说明 |
---|---|
abstract List<InputSplit> |
InputFormat.getSplits(JobContext context)
Logically split the set of input files for the job.
|
限定符和类型 | 方法和说明 |
---|---|
abstract RecordReader<K,V> |
InputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context)
Create a record reader for a given split.
|
abstract void |
RecordReader.initialize(InputSplit split,
TaskAttemptContext context)
Called once at initialization.
|
构造器和说明 |
---|
MapContext(Configuration conf,
TaskAttemptID taskid,
RecordReader<KEYIN,VALUEIN> reader,
RecordWriter<KEYOUT,VALUEOUT> writer,
OutputCommitter committer,
StatusReporter reporter,
InputSplit split) |
Mapper.Context(Configuration conf,
TaskAttemptID taskid,
RecordReader<KEYIN,VALUEIN> reader,
RecordWriter<KEYOUT,VALUEOUT> writer,
OutputCommitter committer,
StatusReporter reporter,
InputSplit split) |
限定符和类型 | 类和说明 |
---|---|
static class |
DataDrivenDBInputFormat.DataDrivenDBInputSplit
A InputSplit that spans a set of rows
|
static class |
DBInputFormat.DBInputSplit
A InputSplit that spans a set of rows
|
限定符和类型 | 方法和说明 |
---|---|
List<InputSplit> |
DBInputFormat.getSplits(JobContext job)
Logically split the set of input files for the job.
|
List<InputSplit> |
DataDrivenDBInputFormat.getSplits(JobContext job)
Logically split the set of input files for the job.
|
List<InputSplit> |
TextSplitter.split(Configuration conf,
ResultSet results,
String colName)
This method needs to determine the splits between two user-provided strings.
|
List<InputSplit> |
IntegerSplitter.split(Configuration conf,
ResultSet results,
String colName) |
List<InputSplit> |
FloatSplitter.split(Configuration conf,
ResultSet results,
String colName) |
List<InputSplit> |
DBSplitter.split(Configuration conf,
ResultSet results,
String colName)
Given a ResultSet containing one record (and already advanced to that record)
with two columns (a low value, and a high value, both of the same type), determine
a set of splits that span the given values.
|
List<InputSplit> |
DateSplitter.split(Configuration conf,
ResultSet results,
String colName) |
List<InputSplit> |
BooleanSplitter.split(Configuration conf,
ResultSet results,
String colName) |
List<InputSplit> |
BigDecimalSplitter.split(Configuration conf,
ResultSet results,
String colName) |
限定符和类型 | 方法和说明 |
---|---|
RecordReader<LongWritable,T> |
DBInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context)
Create a record reader for a given split.
|
void |
DBRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
限定符和类型 | 类和说明 |
---|---|
class |
CombineFileSplit
A sub-collection of input files.
|
限定符和类型 | 方法和说明 |
---|---|
List<InputSplit> |
NLineInputFormat.getSplits(JobContext job)
Logically splits the set of input files for the job, splits N lines
of the input as one split.
|
List<InputSplit> |
FileInputFormat.getSplits(JobContext job)
Generate the list of files and make them into FileSplits.
|
List<InputSplit> |
DelegatingInputFormat.getSplits(JobContext job) |
List<InputSplit> |
CombineFileInputFormat.getSplits(JobContext job) |
限定符和类型 | 方法和说明 |
---|---|
RecordReader<LongWritable,Text> |
TextInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context) |
RecordReader<K,V> |
SequenceFileInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context) |
RecordReader<K,V> |
SequenceFileInputFilter.createRecordReader(InputSplit split,
TaskAttemptContext context)
Create a record reader for the given split
|
RecordReader<Text,Text> |
SequenceFileAsTextInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context) |
RecordReader<BytesWritable,BytesWritable> |
SequenceFileAsBinaryInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context) |
RecordReader<LongWritable,Text> |
NLineInputFormat.createRecordReader(InputSplit genericSplit,
TaskAttemptContext context) |
RecordReader<Text,Text> |
KeyValueTextInputFormat.createRecordReader(InputSplit genericSplit,
TaskAttemptContext context) |
RecordReader<K,V> |
DelegatingInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context) |
abstract RecordReader<K,V> |
CombineFileInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context)
This is not implemented yet.
|
void |
SequenceFileRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
void |
SequenceFileAsTextRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
void |
SequenceFileAsBinaryInputFormat.SequenceFileAsBinaryRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
void |
LineRecordReader.initialize(InputSplit genericSplit,
TaskAttemptContext context) |
void |
KeyValueLineRecordReader.initialize(InputSplit genericSplit,
TaskAttemptContext context) |
void |
DelegatingRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
void |
CombineFileRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
构造器和说明 |
---|
DelegatingRecordReader(InputSplit split,
TaskAttemptContext context)
Constructs the DelegatingRecordReader.
|
限定符和类型 | 方法和说明 |
---|---|
static <T extends InputSplit> |
JobSplitWriter.createSplitFiles(Path jobSubmitDir,
Configuration conf,
FileSystem fs,
List<InputSplit> splits) |
static <T extends InputSplit> |
JobSplitWriter.createSplitFiles(Path jobSubmitDir,
Configuration conf,
FileSystem fs,
T[] splits) |
限定符和类型 | 方法和说明 |
---|---|
static <T extends InputSplit> |
JobSplitWriter.createSplitFiles(Path jobSubmitDir,
Configuration conf,
FileSystem fs,
T[] splits) |
限定符和类型 | 方法和说明 |
---|---|
static <T extends InputSplit> |
JobSplitWriter.createSplitFiles(Path jobSubmitDir,
Configuration conf,
FileSystem fs,
List<InputSplit> splits) |
构造器和说明 |
---|
JobSplit.SplitMetaInfo(InputSplit split,
long startOffset) |
JobSplit.TaskSplitMetaInfo(InputSplit split,
long startOffset) |
限定符和类型 | 方法和说明 |
---|---|
List<InputSplit> |
UniformSizeInputFormat.getSplits(JobContext context)
Implementation of InputFormat::getSplits().
|
限定符和类型 | 方法和说明 |
---|---|
RecordReader<Text,FileStatus> |
UniformSizeInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context)
Implementation of InputFormat::createRecordReader().
|
限定符和类型 | 方法和说明 |
---|---|
List<InputSplit> |
DynamicInputFormat.getSplits(JobContext jobContext)
Implementation of InputFormat::getSplits().
|
限定符和类型 | 方法和说明 |
---|---|
RecordReader<K,V> |
DynamicInputFormat.createRecordReader(InputSplit inputSplit,
TaskAttemptContext taskAttemptContext)
Implementation of Inputformat::createRecordReader().
|
void |
DynamicRecordReader.initialize(InputSplit inputSplit,
TaskAttemptContext taskAttemptContext)
Implementation for RecordReader::initialize().
|
限定符和类型 | 方法和说明 |
---|---|
InputSplit[] |
ZombieJob.getInputSplits() |
InputSplit[] |
JobStory.getInputSplits()
Get the input splits for the job.
|
Copyright © 2009 The Apache Software Foundation