public interface Reader
| Modifier and Type | Interface and Description |
|---|---|
static class |
Reader.Options
Options for creating a RecordReader.
|
| Modifier and Type | Method and Description |
|---|---|
CompressionKind |
getCompression()
Get the compression kind.
|
int |
getCompressionSize()
Get the buffer size for the compression.
|
long |
getContentLength()
Get the length of the file.
|
OrcFile.Version |
getFileVersion()
Get the file format version.
|
Metadata |
getMetadata()
Get the metadata information like stripe level column statistics etc.
|
List<String> |
getMetadataKeys()
Get the user metadata keys.
|
ByteBuffer |
getMetadataValue(String key)
Get a user metadata value.
|
long |
getNumberOfRows()
Get the number of rows in the file.
|
ObjectInspector |
getObjectInspector()
Get the object inspector for looking at the objects.
|
long |
getRawDataSize()
Get the deserialized data size of the file
|
long |
getRawDataSizeOfColumns(List<String> colNames)
Get the deserialized data size of the specified columns
|
int |
getRowIndexStride()
Get the number of rows per a entry in the row index.
|
ColumnStatistics[] |
getStatistics()
Get the statistics about the columns in the file.
|
List<StripeInformation> |
getStripes()
Get the list of stripes.
|
List<OrcProto.Type> |
getTypes()
Get the list of types contained in the file.
|
OrcFile.WriterVersion |
getWriterVersion()
Get the version of the writer of this file.
|
boolean |
hasMetadataValue(String key)
Did the user set the given metadata value.
|
MetadataReader |
metadata() |
RecordReader |
rows()
Create a RecordReader that reads everything with the default options.
|
RecordReader |
rows(boolean[] include)
Create a RecordReader that will scan the entire file.
|
RecordReader |
rows(long offset,
long length,
boolean[] include)
Create a RecordReader that will start reading at the first stripe after
offset up to the stripe that starts at offset + length.
|
RecordReader |
rows(long offset,
long length,
boolean[] include,
SearchArgument sarg,
String[] neededColumns)
Create a RecordReader that will read a section of a file.
|
RecordReader |
rowsOptions(Reader.Options options)
Create a RecordReader that uses the options given.
|
long getNumberOfRows()
long getRawDataSize()
long getRawDataSizeOfColumns(List<String> colNames)
colNames - List<String> getMetadataKeys()
ByteBuffer getMetadataValue(String key)
key - a key given by the userboolean hasMetadataValue(String key)
key - the key to checkCompressionKind getCompression()
int getCompressionSize()
int getRowIndexStride()
List<StripeInformation> getStripes()
ObjectInspector getObjectInspector()
long getContentLength()
ColumnStatistics[] getStatistics()
Metadata getMetadata() throws IOException
IOExceptionList<OrcProto.Type> getTypes()
OrcFile.Version getFileVersion()
OrcFile.WriterVersion getWriterVersion()
RecordReader rows() throws IOException
IOExceptionRecordReader rowsOptions(Reader.Options options) throws IOException
options - the options to read withIOExceptionRecordReader rows(boolean[] include) throws IOException
include - true for each column that should be includedIOExceptionRecordReader rows(long offset, long length, boolean[] include) throws IOException
offset - a byte offset in the filelength - a number of bytes in the fileinclude - true for each column that should be includedIOExceptionRecordReader rows(long offset, long length, boolean[] include, SearchArgument sarg, String[] neededColumns) throws IOException
offset - the minimum offset of the first stripe to readlength - the distance from offset of the first address to stop reading
atinclude - true for each column that should be includedsarg - a search argument that limits the rows that should be read.neededColumns - the names of the included columnsIOExceptionMetadataReader metadata() throws IOException
IOExceptionCopyright © 2017 The Apache Software Foundation. All rights reserved.