Hadoop 1.2.2-SNAPSHOT Developer API

Hadoop is a distributed computing platform.

请参阅: 说明

Core 
程序包 说明
org.apache.hadoop  
org.apache.hadoop.classification  
org.apache.hadoop.conf
Configuration of system parameters.
org.apache.hadoop.filecache  
org.apache.hadoop.fs
An abstract file system API.
org.apache.hadoop.fs.ftp  
org.apache.hadoop.fs.kfs
A client for the Kosmos filesystem (KFS) Introduction This pages describes how to use Kosmos Filesystem ( KFS ) as a backing store with Hadoop.
org.apache.hadoop.fs.permission  
org.apache.hadoop.fs.s3
A distributed, block-based implementation of FileSystem that uses Amazon S3 as a backing store.
org.apache.hadoop.fs.s3native
A distributed implementation of FileSystem for reading and writing files on Amazon S3.
org.apache.hadoop.fs.shell  
org.apache.hadoop.hdfs
A distributed implementation of FileSystem.
org.apache.hadoop.hdfs.protocol  
org.apache.hadoop.hdfs.security.token.block  
org.apache.hadoop.hdfs.security.token.delegation  
org.apache.hadoop.hdfs.server.balancer  
org.apache.hadoop.hdfs.server.common  
org.apache.hadoop.hdfs.server.datanode  
org.apache.hadoop.hdfs.server.datanode.metrics  
org.apache.hadoop.hdfs.server.datanode.web.resources  
org.apache.hadoop.hdfs.server.namenode  
org.apache.hadoop.hdfs.server.namenode.metrics  
org.apache.hadoop.hdfs.server.namenode.web.resources  
org.apache.hadoop.hdfs.server.protocol  
org.apache.hadoop.hdfs.tools  
org.apache.hadoop.hdfs.tools.offlineImageViewer  
org.apache.hadoop.hdfs.util  
org.apache.hadoop.hdfs.web  
org.apache.hadoop.hdfs.web.resources  
org.apache.hadoop.http  
org.apache.hadoop.http.lib  
org.apache.hadoop.io
Generic i/o code for use when reading and writing data to the network, to databases, and to files.
org.apache.hadoop.io.compress  
org.apache.hadoop.io.compress.bzip2  
org.apache.hadoop.io.compress.snappy  
org.apache.hadoop.io.compress.zlib  
org.apache.hadoop.io.file.tfile  
org.apache.hadoop.io.nativeio  
org.apache.hadoop.io.retry
A mechanism for selectively retrying methods that throw exceptions under certain circumstances.
org.apache.hadoop.io.serializer
This package provides a mechanism for using different serialization frameworks in Hadoop.
org.apache.hadoop.ipc
Tools to help define network clients and servers.
org.apache.hadoop.ipc.metrics  
org.apache.hadoop.jmx
This package provides access to JMX primarily through the JMXJsonServlet class.
org.apache.hadoop.log  
org.apache.hadoop.log.metrics  
org.apache.hadoop.mapred
A software framework for easily writing applications which process vast amounts of data (multi-terabyte data-sets) parallelly on large clusters (thousands of nodes) built of commodity hardware in a reliable, fault-tolerant manner.
org.apache.hadoop.mapred.jobcontrol
Utilities for managing dependent jobs.
org.apache.hadoop.mapred.join
Given a set of sorted datasets keyed with the same class and yielding equal partitions, it is possible to effect a join of those datasets prior to the map.
org.apache.hadoop.mapred.lib
Library of generally useful mappers, reducers, and partitioners.
org.apache.hadoop.mapred.lib.aggregate
Classes for performing various counting and aggregations.
org.apache.hadoop.mapred.lib.db
org.apache.hadoop.mapred.lib.db Package This package contains a library to read records from a database as an input to a mapreduce job, and write the output records to the database.
org.apache.hadoop.mapred.pipes
Hadoop Pipes allows C++ code to use Hadoop DFS and map/reduce.
org.apache.hadoop.mapred.tools  
org.apache.hadoop.mapreduce  
org.apache.hadoop.mapreduce.lib.db  
org.apache.hadoop.mapreduce.lib.fieldsel  
org.apache.hadoop.mapreduce.lib.input  
org.apache.hadoop.mapreduce.lib.jobcontrol  
org.apache.hadoop.mapreduce.lib.map  
org.apache.hadoop.mapreduce.lib.output  
org.apache.hadoop.mapreduce.lib.partition  
org.apache.hadoop.mapreduce.lib.reduce  
org.apache.hadoop.mapreduce.security  
org.apache.hadoop.mapreduce.security.token  
org.apache.hadoop.mapreduce.security.token.delegation  
org.apache.hadoop.mapreduce.server.jobtracker  
org.apache.hadoop.mapreduce.server.tasktracker  
org.apache.hadoop.mapreduce.server.tasktracker.userlogs  
org.apache.hadoop.mapreduce.split  
org.apache.hadoop.metrics
Note, this package is deprecated in favor of org.apache.hadoop.metrics2 usage.
org.apache.hadoop.metrics.file
Implementation of the metrics package that writes the metrics to a file.
org.apache.hadoop.metrics.ganglia
Implementation of the metrics package that sends metric data to Ganglia.
org.apache.hadoop.metrics.jvm  
org.apache.hadoop.metrics.spi
Note, this package is deprecated in favor of org.apache.hadoop.metrics2 usage.
org.apache.hadoop.metrics.util  
org.apache.hadoop.metrics2
Metrics 2.0
org.apache.hadoop.metrics2.filter  
org.apache.hadoop.metrics2.impl  
org.apache.hadoop.metrics2.lib  
org.apache.hadoop.metrics2.sink  
org.apache.hadoop.metrics2.sink.ganglia  
org.apache.hadoop.metrics2.source  
org.apache.hadoop.metrics2.util  
org.apache.hadoop.net
Network-related classes.
org.apache.hadoop.record
Hadoop record I/O contains classes and a record description language translator for simplifying serialization and deserialization of records in a language-neutral manner.
org.apache.hadoop.record.compiler
This package contains classes needed for code generation from the hadoop record compiler.
org.apache.hadoop.record.compiler.ant  
org.apache.hadoop.record.compiler.generated
This package contains code generated by JavaCC from the Hadoop record syntax file rcc.jj.
org.apache.hadoop.record.meta  
org.apache.hadoop.security  
org.apache.hadoop.security.authentication.client  
org.apache.hadoop.security.authentication.server  
org.apache.hadoop.security.authentication.util  
org.apache.hadoop.security.authorize  
org.apache.hadoop.security.token  
org.apache.hadoop.security.token.delegation  
org.apache.hadoop.tools  
org.apache.hadoop.tools.distcp2  
org.apache.hadoop.tools.distcp2.mapred  
org.apache.hadoop.tools.distcp2.mapred.lib  
org.apache.hadoop.tools.distcp2.util  
org.apache.hadoop.tools.rumen
Rumen is a data extraction and analysis tool built for Apache Hadoop.
org.apache.hadoop.typedbytes
Typed bytes are sequences of bytes in which the first byte is a type code.
org.apache.hadoop.util
Common utilities.
org.apache.hadoop.util.bloom  
org.apache.hadoop.util.hash  
Examples 
程序包 说明
org.apache.hadoop.examples
Hadoop example code.
org.apache.hadoop.examples.dancing
This package is a distributed implementation of Knuth's dancing links algorithm that can run under Hadoop.
org.apache.hadoop.examples.terasort
This package consists of 3 map/reduce applications for Hadoop to compete in the annual terabyte sort competition.
contrib: Streaming 
程序包 说明
org.apache.hadoop.streaming
Hadoop Streaming is a utility which allows users to create and run Map-Reduce jobs with any executables (e.g.
org.apache.hadoop.streaming.io  
contrib: DataJoin 
程序包 说明
org.apache.hadoop.contrib.utils.join  
contrib: Index 
程序包 说明
org.apache.hadoop.contrib.index.example  
org.apache.hadoop.contrib.index.lucene  
org.apache.hadoop.contrib.index.main  
org.apache.hadoop.contrib.index.mapred  

Copyright © 2009 The Apache Software Foundation