程序包 | 说明 |
---|---|
org.apache.hadoop.examples |
Hadoop example code.
|
org.apache.hadoop.examples.dancing |
This package is a distributed implementation of Knuth's dancing links
algorithm that can run under Hadoop.
|
org.apache.hadoop.examples.terasort |
This package consists of 3 map/reduce applications for Hadoop to
compete in the annual terabyte sort
competition.
|
org.apache.hadoop.fs |
An abstract file system API.
|
org.apache.hadoop.fs.ftp | |
org.apache.hadoop.fs.kfs |
A client for the Kosmos filesystem (KFS)
Introduction
This pages describes how to use Kosmos Filesystem
( KFS ) as a backing
store with Hadoop.
|
org.apache.hadoop.fs.s3 |
A distributed, block-based implementation of
FileSystem that uses Amazon S3
as a backing store. |
org.apache.hadoop.fs.s3native |
A distributed implementation of
FileSystem for reading and writing files on
Amazon S3. |
org.apache.hadoop.fs.shell | |
org.apache.hadoop.hdfs |
A distributed implementation of
FileSystem . |
org.apache.hadoop.hdfs.server.datanode | |
org.apache.hadoop.hdfs.tools | |
org.apache.hadoop.hdfs.web | |
org.apache.hadoop.io.serializer |
This package provides a mechanism for using different serialization frameworks
in Hadoop.
|
org.apache.hadoop.mapred |
A software framework for easily writing applications which process vast
amounts of data (multi-terabyte data-sets) parallelly on large clusters
(thousands of nodes) built of commodity hardware in a reliable, fault-tolerant
manner.
|
org.apache.hadoop.mapred.pipes |
Hadoop Pipes allows C++ code to use Hadoop DFS and map/reduce.
|
org.apache.hadoop.mapred.tools | |
org.apache.hadoop.mapreduce.lib.partition | |
org.apache.hadoop.tools.distcp2 | |
org.apache.hadoop.tools.rumen |
Rumen is a data extraction and analysis tool built for
Apache Hadoop.
|
org.apache.hadoop.util |
Common utilities.
|
限定符和类型 | 类和说明 |
---|---|
class |
DBCountPageView
This is a demonstrative program, which uses DBInputFormat for reading
the input data from a database, and DBOutputFormat for writing the data
to the database.
|
class |
Grep |
class |
Join
This is the trivial map/reduce program that does absolutely nothing
other than use the framework to fragment and sort the input values.
|
class |
MultiFileWordCount
MultiFileWordCount is an example to demonstrate the usage of
MultiFileInputFormat.
|
class |
PiEstimator
A Map-reduce program to estimate the value of Pi
using quasi-Monte Carlo method.
|
class |
RandomTextWriter
This program uses map/reduce to just run a distributed job where there is
no interaction between the tasks and each task writes a large unsorted
random sequence of words.
|
class |
RandomWriter
This program uses map/reduce to just run a distributed job where there is
no interaction between the tasks and each task write a large unsorted
random binary sequence file of BytesWritable.
|
class |
SleepJob
Dummy class for testing MR framefork.
|
static class |
SleepJob.SleepInputFormat |
class |
Sort<K,V>
This is the trivial map/reduce program that does absolutely nothing
other than use the framework to fragment and sort the input values.
|
限定符和类型 | 类和说明 |
---|---|
class |
DistributedPentomino
Launch a distributed pentomino solver.
|
限定符和类型 | 类和说明 |
---|---|
class |
TeraGen
Generate the official terasort input data set.
|
class |
TeraSort
Generates the sampled split points, launches the job, and waits for it to
finish.
|
class |
TeraValidate
Generate 1 mapper per a file that checks to make sure the keys
are sorted within each file.
|
限定符和类型 | 类和说明 |
---|---|
class |
ChecksumFileSystem
Abstract Checksumed FileSystem.
|
class |
FileSystem
An abstract base class for a fairly generic filesystem.
|
class |
FilterFileSystem
A
FilterFileSystem contains
some other file system, which it uses as
its basic file system, possibly transforming
the data along the way or providing additional
functionality. |
class |
FsShell
Provide command line access to a FileSystem.
|
class |
HarFileSystem
This is an implementation of the Hadoop Archive
Filesystem.
|
class |
InMemoryFileSystem
已过时。
|
class |
LocalFileSystem
Implement the FileSystem API for the checksumed local filesystem.
|
class |
RawLocalFileSystem
Implement the FileSystem API for the raw local filesystem.
|
class |
Trash
Provides a trash feature.
|
限定符和类型 | 类和说明 |
---|---|
class |
FTPFileSystem
A
FileSystem backed by an FTP client provided by Apache Commons Net. |
限定符和类型 | 类和说明 |
---|---|
class |
KosmosFileSystem
A FileSystem backed by KFS.
|
限定符和类型 | 类和说明 |
---|---|
class |
MigrationTool
This class is a tool for migrating data from an older to a newer version
of an S3 filesystem.
|
class |
S3FileSystem
A block-based
FileSystem backed by
Amazon S3. |
限定符和类型 | 类和说明 |
---|---|
class |
NativeS3FileSystem
A
FileSystem for reading and writing files stored on
Amazon S3. |
限定符和类型 | 类和说明 |
---|---|
class |
Command
An abstract class for the execution of a file system command
|
class |
Count
Count the number of directories, files, bytes, quota, and remaining quota.
|
限定符和类型 | 类和说明 |
---|---|
class |
ChecksumDistributedFileSystem
An implementation of ChecksumFileSystem over DistributedFileSystem.
|
class |
DistributedFileSystem
Implementation of the abstract FileSystem for the DFS system.
|
class |
HftpFileSystem
An implementation of a protocol for accessing filesystems over HTTP.
|
class |
HsftpFileSystem
An implementation of a protocol for accessing filesystems over HTTPS.
|
限定符和类型 | 类和说明 |
---|---|
class |
DataNode
DataNode is a class (and program) that stores a set of
blocks for a DFS deployment.
|
限定符和类型 | 类和说明 |
---|---|
class |
DFSAdmin
This class provides some DFS administrative access.
|
class |
DFSck
This class provides rudimentary checking of DFS volumes for errors and
sub-optimal conditions.
|
限定符和类型 | 类和说明 |
---|---|
class |
WebHdfsFileSystem
A FileSystem for HDFS over the web.
|
限定符和类型 | 类和说明 |
---|---|
class |
SerializationFactory
A factory for
Serialization s. |
class |
WritableSerialization
A
Serialization for Writable s that delegates to
Writable.write(java.io.DataOutput) and
Writable.readFields(java.io.DataInput) . |
限定符和类型 | 类和说明 |
---|---|
class |
JobClient
JobClient is the primary interface for the user-job to interact
with the JobTracker . |
限定符和类型 | 类和说明 |
---|---|
class |
Submitter
The main entry point and job submitter.
|
限定符和类型 | 类和说明 |
---|---|
class |
MRAdmin
Administrative access to Hadoop Map-Reduce.
|
限定符和类型 | 类和说明 |
---|---|
class |
InputSampler<K,V>
Utility for collecting samples and writing a partition file for
TotalOrderPartitioner . |
限定符和类型 | 类和说明 |
---|---|
class |
CopyListing
The CopyListing abstraction is responsible for how the list of
sources and targets is constructed, for DistCp's copy function.
|
class |
DistCp
DistCp is the main driver-class for DistCpV2.
|
class |
FileBasedCopyListing
FileBasedCopyListing implements the CopyListing interface,
to create the copy-listing for DistCp,
by iterating over all source paths mentioned in a specified input-file.
|
class |
GlobbedCopyListing
GlobbedCopyListing implements the CopyListing interface, to create the copy
listing-file by "globbing" all specified source paths (wild-cards and all.)
|
class |
SimpleCopyListing
The SimpleCopyListing is responsible for making the exhaustive list of
all files/directories under its specified list of input-paths.
|
限定符和类型 | 类和说明 |
---|---|
class |
HadoopLogsAnalyzer
已过时。
|
class |
TraceBuilder
The main driver of the Rumen Parser.
|
限定符和类型 | 类和说明 |
---|---|
class |
LinuxMemoryCalculatorPlugin
已过时。
Use
LinuxResourceCalculatorPlugin
instead |
class |
LinuxResourceCalculatorPlugin
Plugin to calculate resource information on Linux systems.
|
class |
MemoryCalculatorPlugin
已过时。
Use
ResourceCalculatorPlugin
instead |
class |
ResourceCalculatorPlugin
Plugin to calculate resource information on the system.
|
Copyright © 2009 The Apache Software Foundation