FixedWidthStorer (Pig 0.13.0 API)

Overview

Package

Class

Use

Tree

Deprecated

Index

Help

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: NESTED | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

org.apache.pig.piggybank.storage
Class FixedWidthStorer

java.lang.Object
  org.apache.pig.StoreFunc
      org.apache.pig.piggybank.storage.FixedWidthStorer

All Implemented Interfaces:: StoreFuncInterface

public class FixedWidthStorer
extends StoreFunc
extends StoreFunc

Stores Pig records in a fixed-width file format. Takes a string argument specifying the ranges of each column in a unix 'cut'-like format. Ex: '-5, 10-12, 14, 20-' Ranges are comma-separated, 1-indexed (for ease of use with 1-indexed text editors), and inclusive. A single-column field at position n may be specified as either 'n-n' or simply 'n'. A second optional argument specifies whether to write a header record with the names of each field. 'WRITE_HEADER' writes a header record; 'NO_HEADER' and the default does not write one. All datetimes are stored in UTC. Column spec idea and syntax parser borrowed from Russ Lankenau's FixedWidthLoader implementation at https://github.com/rlankenau/fixed-width-pig-loader

Constructor Summary
`FixedWidthStorer()`
`FixedWidthStorer(String columnSpec)`
`FixedWidthStorer(String columnSpec, String headerStr)`

Method Summary
`void`	`checkSchema(ResourceSchema s)` Set the schema for data to be stored.
`org.apache.hadoop.mapreduce.OutputFormat`	`getOutputFormat()` Return the OutputFormat associated with StoreFunc.
`String[]`	`getPartitionKeys(String location, org.apache.hadoop.mapreduce.Job job)`
`ResourceStatistics`	`getStatistics(String location, org.apache.hadoop.mapreduce.Job job)`
`void`	`prepareToWrite(org.apache.hadoop.mapreduce.RecordWriter writer)` Initialize StoreFunc to write data.
`void`	`putNext(Tuple t)` Write a tuple to the data store.
`void`	`setPartitionFilter(Expression partitionFilter)`
`void`	`setStoreFuncUDFContextSignature(String signature)` This method will be called by Pig both in the front end and back end to pass a unique signature to the `StoreFunc` which it can use to store information in the `UDFContext` which it needs to store between various method invocations in the front end and back end.
`void`	`setStoreLocation(String location, org.apache.hadoop.mapreduce.Job job)` Communicate to the storer the location where the data needs to be stored.
`void`	`storeStatistics(ResourceStatistics stats, String location, org.apache.hadoop.mapreduce.Job job)`

Methods inherited from class org.apache.pig.StoreFunc
`cleanupOnFailure, cleanupOnFailureImpl, cleanupOnSuccess, relToAbsPathForStoreLocation, warn`

Methods inherited from class java.lang.Object
`clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait`

Constructor Detail

FixedWidthStorer

public FixedWidthStorer()

FixedWidthStorer

public FixedWidthStorer(String columnSpec)

FixedWidthStorer

public FixedWidthStorer(String columnSpec,
                        String headerStr)

Method Detail