All Classes Interface Summary Class Summary Enum Summary Exception Summary
| Class |
Description |
| AndRecordFilter |
Provides ability to chain two filters together.
|
| BaseRecordReader<T> |
Deprecated. |
| Binary |
|
| BinaryPlainValuesReader |
|
| BinaryStatistics |
|
| BinaryTruncator |
Class for truncating min/max values for binary types.
|
| BinaryValue |
|
| BitPackingValuesReader |
a column reader that packs the ints in the number of bits required based on the maximum size.
|
| BitPackingValuesWriter |
a column writer that packs the ints in the number of bits required based on the maximum size.
|
| BlockSplitBloomFilter |
|
| BloomFilter |
A Bloom filter is a compact structure to indicate whether an item is not in a set or probably
in a set.
|
| BloomFilter.Algorithm |
|
| BloomFilter.Compression |
|
| BloomFilter.HashStrategy |
|
| BloomFilterWriter |
|
| BloomFilterWriteStore |
Contains all writers for all columns of a row group
|
| BooleanPlainValuesReader |
encodes boolean for the plain encoding: one bit at a time (0 = false)
|
| BooleanPlainValuesWriter |
An implementation of the PLAIN encoding
|
| BooleanStatistics |
|
| BooleanValue |
|
| BoundaryOrder |
Enum for BoundaryOrder.
|
| ByteBitPackingValuesReader |
|
| ByteBitPackingValuesWriter |
|
| ByteStreamSplitValuesReader |
|
| ByteStreamSplitValuesReaderForDouble |
|
| ByteStreamSplitValuesReaderForFloat |
|
| ByteStreamSplitValuesWriter |
|
| ByteStreamSplitValuesWriter.DoubleByteStreamSplitValuesWriter |
|
| ByteStreamSplitValuesWriter.FloatByteStreamSplitValuesWriter |
|
| ColumnDescriptor |
Describes a column's type as well as its position in its containing schema.
|
| ColumnIndex |
Column index containing min/max and null count values for the pages in a column chunk.
|
| ColumnIndexBuilder |
|
| ColumnIndexFilter |
Filter implementation based on column indexes.
|
| ColumnIndexStore |
|
| ColumnIndexStore.MissingOffsetIndexException |
Exception thrown in case of an offset index is missing for any of the columns.
|
| ColumnIO |
a structure used to serialize deserialize records
|
| ColumnIOFactory |
Factory constructing the ColumnIO structure from the schema
|
| ColumnOrder |
Class representing the column order with all the related parameters.
|
| ColumnOrder.ColumnOrderName |
The enum type of the column order.
|
| ColumnPredicates |
ColumnPredicates class provides checks for column values.
|
| ColumnPredicates.BooleanPredicateFunction |
|
| ColumnPredicates.DoublePredicateFunction |
|
| ColumnPredicates.FloatPredicateFunction |
|
| ColumnPredicates.IntegerPredicateFunction |
|
| ColumnPredicates.LongPredicateFunction |
|
| ColumnPredicates.Predicate |
|
| ColumnPredicates.PredicateFunction<T> |
|
| ColumnReader |
Reader for (repetition level, definition level, values) triplets.
|
| ColumnReaderImpl |
ColumnReader implementation for the scenario when column indexes are not used (all values are read)
|
| ColumnReadStore |
Container which can produce a ColumnReader for any given column in a schema.
|
| ColumnReadStoreImpl |
Implementation of the ColumnReadStore
Initializes individual columns based on schema and converter
|
| ColumnRecordFilter |
Record filter which applies the supplied predicate to the specified column.
|
| ColumnWriter |
writer for (repetition level, definition level, values) triplets
|
| ColumnWriteStore |
Container which can construct writers for multiple columns to be stored
together.
|
| ColumnWriteStoreV1 |
|
| ColumnWriteStoreV2 |
|
| CompilationException |
thrown when a problem occured while compiling the column reader
|
| ConversionPatterns |
Utility functions to convert from Java-like map and list types
to equivalent Parquet types.
|
| Converter |
Represent a tree of converters
that materializes tuples
|
| CorruptDeltaByteArrays |
|
| CorruptStatistics |
There was a bug (PARQUET-251) that caused the statistics metadata
for binary columns to be corrupted in the write path.
|
| DataPage |
one data page in a chunk
|
| DataPage.Visitor<T> |
|
| DataPageV1 |
|
| DataPageV2 |
|
| DecimalMetadata |
Deprecated.
|
| DefaultV1ValuesWriterFactory |
|
| DefaultV2ValuesWriterFactory |
|
| DefaultValuesWriterFactory |
Handles ValuesWriter creation statically based on the types of the columns and the writer version.
|
| DeltaBinaryPackingValuesReader |
|
| DeltaBinaryPackingValuesWriter |
Write integers with delta encoding and binary packing
The format is as follows:
|
| DeltaBinaryPackingValuesWriterForInteger |
Write integers (INT32) with delta encoding and binary packing.
|
| DeltaBinaryPackingValuesWriterForLong |
Write longs (INT64) with delta encoding and binary packing.
|
| DeltaByteArrayReader |
|
| DeltaByteArrayWriter |
Write prefix lengths using delta encoding, followed by suffixes with Delta length byte arrays
|
| DeltaLengthByteArrayValuesReader |
|
| DeltaLengthByteArrayValuesWriter |
Write lengths of byte-arrays using delta encoding, followed by concatenated byte-arrays
|
| DevNullValuesWriter |
This is a special writer that doesn't write anything.
|
| Dictionary |
a dictionary to decode dictionary based encodings
|
| DictionaryPage |
Data for a dictionary page
|
| DictionaryPageReadStore |
Interface to read dictionary pages for all the columns of a row group
|
| DictionaryValuesReader |
Reads values that have been dictionary encoded
|
| DictionaryValuesWriter |
Will attempt to encode values using a dictionary and fall back to plain encoding
if the dictionary gets too big
|
| DictionaryValuesWriter.PlainBinaryDictionaryValuesWriter |
|
| DictionaryValuesWriter.PlainDoubleDictionaryValuesWriter |
|
| DictionaryValuesWriter.PlainFixedLenArrayDictionaryValuesWriter |
|
| DictionaryValuesWriter.PlainFloatDictionaryValuesWriter |
|
| DictionaryValuesWriter.PlainIntegerDictionaryValuesWriter |
|
| DictionaryValuesWriter.PlainLongDictionaryValuesWriter |
|
| DoubleStatistics |
|
| DoubleValue |
|
| DummyRecordConverter |
Dummy implementation for perf tests
|
| Encoding |
encoding of the data
|
| EncodingStats |
EncodingStats track dictionary and data page encodings for a single column within a row group.
|
| EncodingStats.Builder |
Used to build EncodingStats from metadata or to accumulate stats as pages are written.
|
| FallbackValuesWriter<I extends ValuesWriter & RequiresFallback,F extends ValuesWriter> |
|
| FilterApi |
The Filter API is expressed through these static methods.
|
| FilterCompat |
Parquet currently has two ways to specify a filter for dropping records at read time.
|
| FilterCompat.Filter |
|
| FilterCompat.FilterPredicateCompat |
|
| FilterCompat.NoOpFilter |
|
| FilterCompat.UnboundRecordFilterCompat |
|
| FilterCompat.Visitor<T> |
Anyone wanting to use a FilterCompat.Filter need only implement this interface,
per the visitor pattern.
|
| FilteringGroupConverter |
|
| FilteringPrimitiveConverter |
|
| FilteringRecordMaterializer<T> |
|
| FilterPredicate |
A FilterPredicate is an expression tree describing the criteria for which records to keep when loading data from
a parquet file.
|
| FilterPredicate.Visitor<R> |
A FilterPredicate Visitor must visit all the operators in a FilterPredicate expression tree,
and must handle recursion itself, per the visitor pattern.
|
| FixedLenByteArrayPlainValuesReader |
ValuesReader for FIXED_LEN_BYTE_ARRAY.
|
| FixedLenByteArrayPlainValuesWriter |
ValuesWriter for FIXED_LEN_BYTE_ARRAY.
|
| FloatStatistics |
|
| FloatValue |
|
| Group |
|
| GroupColumnIO |
Group level of the IO structure
|
| GroupConverter |
converter for group nodes
|
| GroupFactory |
|
| GroupRecordConverter |
|
| GroupType |
Represents a group type: a list of fields
|
| GroupValueSource |
|
| GroupWriter |
|
| HashFunction |
A interface contains a set of hash functions used by Bloom filter.
|
| IncompatibleSchemaModificationException |
thrown when we are trying to read together files with incompatible schemas.
|
| IncrementallyUpdatedFilterPredicate |
A rewritten version of a FilterPredicate which receives
the values for a record's columns one by one and internally tracks whether the predicate is
satisfied, unsatisfied, or unknown.
|
| IncrementallyUpdatedFilterPredicate.And |
|
| IncrementallyUpdatedFilterPredicate.BinaryLogical |
|
| IncrementallyUpdatedFilterPredicate.Or |
|
| IncrementallyUpdatedFilterPredicate.ValueInspector |
This is the leaf node of a filter predicate.
|
| IncrementallyUpdatedFilterPredicate.Visitor |
|
| IncrementallyUpdatedFilterPredicateBuilder |
|
| IncrementallyUpdatedFilterPredicateBuilderBase |
The implementation of this abstract class is auto-generated by
org.apache.parquet.filter2.IncrementallyUpdatedFilterPredicateGenerator
Constructs a IncrementallyUpdatedFilterPredicate from a FilterPredicate
This is how records are filtered during record assembly.
|
| IncrementallyUpdatedFilterPredicateEvaluator |
|
| IncrementallyUpdatedFilterPredicateResetter |
|
| IndexIterator |
Iterator implementation for page indexes.
|
| Int96Value |
|
| IntegerValue |
|
| IntList |
An append-only integer list
avoids autoboxing and buffer resizing
|
| IntList.IntIterator |
to iterate on the content of the list
not an actual iterator to avoid autoboxing
|
| IntStatistics |
|
| InvalidFileOffsetException |
|
| InvalidRecordException |
thrown when an invalid record is encountered
|
| InvalidSchemaException |
|
| LogicalInverseRewriter |
Recursively removes all use of the not() operator in a predicate
by replacing all instances of not(x) with the inverse(x),
eg: not(and(eq(), not(eq(y))) -> or(notEq(), eq(y))
The returned predicate should have the same meaning as the original, but
without the use of the not() operator.
|
| LogicalInverter |
|
| LogicalTypeAnnotation |
|
| LogicalTypeAnnotation.BsonLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.DateLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.DecimalLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.EnumLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.IntervalLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.IntLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.JsonLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.ListLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.LogicalTypeAnnotationVisitor<T> |
Implement this interface to visit a logical type annotation in the schema.
|
| LogicalTypeAnnotation.MapKeyValueTypeAnnotation |
|
| LogicalTypeAnnotation.MapLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.StringLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.TimeLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.TimestampLogicalTypeAnnotation |
|
| LogicalTypeAnnotation.TimeUnit |
|
| LogicalTypeAnnotation.UUIDLogicalTypeAnnotation |
|
| LongStatistics |
|
| LongValue |
|
| MessageColumnIO |
Message level of the IO structure
|
| MessageType |
The root of a schema
|
| MessageTypeParser |
Parses a schema from a textual format similar to that described in the Dremel paper.
|
| MinMax<T> |
This class calculates the max and min values of an iterable collection.
|
| NanoTime |
|
| NotRecordFilter |
Provides ability to negate the result of a filter.
|
| OffsetIndex |
Offset index containing the offset and size of the page and the index of the first row in the page.
|
| OffsetIndexBuilder |
Builder implementation to create OffsetIndex objects during writing a parquet file.
|
| Operators |
These are the operators in a filter predicate expression tree.
|
| Operators.And |
|
| Operators.BinaryColumn |
|
| Operators.BooleanColumn |
|
| Operators.Column<T extends Comparable<T>> |
|
| Operators.DoubleColumn |
|
| Operators.Eq<T extends Comparable<T>> |
|
| Operators.FloatColumn |
|
| Operators.Gt<T extends Comparable<T>> |
|
| Operators.GtEq<T extends Comparable<T>> |
|
| Operators.In<T extends Comparable<T>> |
|
| Operators.IntColumn |
|
| Operators.LogicalNotUserDefined<T extends Comparable<T>,U extends UserDefinedPredicate<T>> |
|
| Operators.LongColumn |
|
| Operators.Lt<T extends Comparable<T>> |
|
| Operators.LtEq<T extends Comparable<T>> |
|
| Operators.Not |
|
| Operators.NotEq<T extends Comparable<T>> |
|
| Operators.NotIn<T extends Comparable<T>> |
|
| Operators.Or |
|
| Operators.SetColumnFilterPredicate<T extends Comparable<T>> |
|
| Operators.SupportsEqNotEq |
|
| Operators.SupportsLtGt |
|
| Operators.UserDefined<T extends Comparable<T>,U extends UserDefinedPredicate<T>> |
|
| Operators.UserDefinedByClass<T extends Comparable<T>,U extends UserDefinedPredicate<T>> |
|
| Operators.UserDefinedByInstance<T extends Comparable<T>,U extends UserDefinedPredicate<T> & Serializable> |
|
| OriginalType |
Deprecated.
|
| OrRecordFilter |
Provides ability to chain two filters together.
|
| Page |
one page in a chunk
|
| PagedRecordFilter |
Filter which will only materialize a page worth of results.
|
| PageReader |
Reader for a sequence a page from a given column chunk
|
| PageReadStore |
contains all the readers for all the columns of the corresponding row group
TODO: rename to RowGroup?
|
| PageWriter |
a writer for all the pages of a given column chunk
|
| PageWriteStore |
contains all the writers for the columns in the corresponding row group
|
| Paper |
Examples from the Dremel Paper
|
| ParquetDecodingException |
thrown when an encoding problem occured
|
| ParquetEncodingException |
thrown when a decoding problem occured
|
| ParquetProperties |
This class represents all the configurable Parquet properties.
|
| ParquetProperties.Builder |
|
| ParquetProperties.WriterVersion |
|
| PlainValuesDictionary |
a simple implementation of dictionary for plain encoded values
|
| PlainValuesDictionary.PlainBinaryDictionary |
a simple implementation of dictionary for plain encoded binary
|
| PlainValuesDictionary.PlainDoubleDictionary |
a simple implementation of dictionary for plain encoded double values
|
| PlainValuesDictionary.PlainFloatDictionary |
a simple implementation of dictionary for plain encoded float values
|
| PlainValuesDictionary.PlainIntegerDictionary |
a simple implementation of dictionary for plain encoded integer values
|
| PlainValuesDictionary.PlainLongDictionary |
a simple implementation of dictionary for plain encoded long values
|
| PlainValuesReader |
Plain encoding for float, double, int, long
|
| PlainValuesReader.DoublePlainValuesReader |
|
| PlainValuesReader.FloatPlainValuesReader |
|
| PlainValuesReader.IntegerPlainValuesReader |
|
| PlainValuesReader.LongPlainValuesReader |
|
| PlainValuesWriter |
Plain encoding except for booleans
|
| Primitive |
|
| PrimitiveColumnIO |
Primitive level of the IO structure
|
| PrimitiveComparator<T> |
Comparator implementation that also supports the comparison of the related primitive type to avoid the
performance penalty of boxing/unboxing.
|
| PrimitiveConverter |
converter for leaves of the schema
|
| PrimitiveStringifier |
Class that provides string representations for the primitive values.
|
| PrimitiveToBoxedClass |
Converts a Class<primitive> to it's corresponding Class<Boxed>, eg
Class<int> to Class<Integer>
|
| PrimitiveType |
Representation of a Primitive type
|
| PrimitiveType.PrimitiveTypeName |
Supported Primitive types
|
| PrimitiveType.PrimitiveTypeNameConverter<T,E extends Exception> |
|
| RecordConsumer |
Abstraction for writing records
It decouples the striping algorithm from the actual record model
example:
|
| RecordConsumerLoggingWrapper |
This class can be used to wrap an actual RecordConsumer and log all calls
|
| RecordFilter |
Filter to be applied to a record to work out whether to skip it.
|
| RecordMaterializer<T> |
Top-level class which should be implemented in order to materialize objects from
a stream of Parquet data.
|
| RecordMaterializer.RecordMaterializationException |
This exception signals that the current record is cannot be converted from parquet columns to a materialized
record, but can be skipped if requested.
|
| RecordReader<T> |
used to read reassembled records
|
| RequiresFallback |
Used to add extra behavior to a ValuesWriter that requires fallback
|
| RequiresPreviousReader |
|
| RowRanges |
Class representing row ranges in a row-group.
|
| RowRanges.Range |
|
| RunLengthBitPackingHybridDecoder |
|
| RunLengthBitPackingHybridEncoder |
Encodes values using a combination of run length encoding and bit packing,
according to the following grammar:
|
| RunLengthBitPackingHybridValuesReader |
|
| RunLengthBitPackingHybridValuesWriter |
|
| SchemaCompatibilityValidator |
Inspects the column types found in the provided FilterPredicate and compares them
to the actual schema found in the parquet file.
|
| SimpleGroup |
|
| SimpleGroupFactory |
|
| Statistics<T extends Comparable<T>> |
Statistics class to keep track of statistics in parquet pages and column chunks
|
| Statistics<T> |
Contains statistics about a group of records
|
| Statistics.Builder |
Builder class to build Statistics objects.
|
| StatisticsClassException |
Thrown if the two Statistics objects have mismatching types
|
| Type |
Represents the declared type for a field in a schema.
|
| Type.ID |
represents a field ID
|
| Type.Repetition |
Constraint on the repetition of a field
|
| TypeConverter<T> |
to convert a MessageType tree
|
| Types |
This class provides fluent builders that produce Parquet schema Types.
|
| Types.BaseGroupBuilder<P,THIS extends Types.BaseGroupBuilder<P,THIS>> |
|
| Types.BaseListBuilder<P,THIS extends Types.BaseListBuilder<P,THIS>> |
|
| Types.BaseListBuilder.ElementBuilder<LP,L extends Types.BaseListBuilder<LP,L>> |
|
| Types.BaseListBuilder.GroupElementBuilder<LP,L extends Types.BaseListBuilder<LP,L>> |
|
| Types.BaseListBuilder.ListElementBuilder<LP,L extends Types.BaseListBuilder<LP,L>> |
|
| Types.BaseListBuilder.MapElementBuilder<LP,L extends Types.BaseListBuilder<LP,L>> |
|
| Types.BaseMapBuilder<P,THIS extends Types.BaseMapBuilder<P,THIS>> |
|
| Types.BaseMapBuilder.GroupKeyBuilder<MP,M extends Types.BaseMapBuilder<MP,M>> |
|
| Types.BaseMapBuilder.GroupValueBuilder<MP,M extends Types.BaseMapBuilder<MP,M>> |
|
| Types.BaseMapBuilder.KeyBuilder<MP,M extends Types.BaseMapBuilder<MP,M>> |
|
| Types.BaseMapBuilder.ListValueBuilder<MP,M extends Types.BaseMapBuilder<MP,M>> |
|
| Types.BaseMapBuilder.MapValueBuilder<MP,M extends Types.BaseMapBuilder<MP,M>> |
|
| Types.BaseMapBuilder.ValueBuilder<MP,M extends Types.BaseMapBuilder<MP,M>> |
|
| Types.BasePrimitiveBuilder<P,THIS extends Types.BasePrimitiveBuilder<P,THIS>> |
|
| Types.Builder<THIS extends Types.Builder,P> |
A base builder for Type objects.
|
| Types.GroupBuilder<P> |
|
| Types.ListBuilder<P> |
|
| Types.MapBuilder<P> |
|
| Types.MessageTypeBuilder |
|
| Types.PrimitiveBuilder<P> |
|
| TypeUtil |
|
| TypeVisitor |
Implement this interface to visit a schema
type.accept(new TypeVisitor() { ...
|
| UnboundRecordFilter |
Builder for a record filter.
|
| UnknownColumnException |
Thrown if the specified column is unknown in the underlying storage
|
| UnknownColumnTypeException |
Thrown if the specified column type is unknown in the underlying storage
|
| UserDefinedPredicate<T extends Comparable<T>> |
A UserDefinedPredicate decides whether a record should be kept or dropped, first by
inspecting meta data about a group of records to see if the entire group can be dropped,
then by inspecting actual values of a single column.
|
| ValidatingRecordConsumer |
Wraps a record consumer
Validates the record written against the schema and pass down the event to the wrapped consumer
|
| ValidTypeMap |
Contains all valid mappings from class -> parquet type (and vice versa) for use in
FilterPredicates
This is a bit ugly, but it allows us to provide good error messages at runtime
when there are type mismatches.
|
| ValuesReader |
Base class to implement an encoding for a given column type.
|
| ValuesType |
The different type of values we can store in columns
|
| ValuesWriter |
base class to implement an encoding for a given column
|
| ValuesWriterFactory |
Can be overridden to allow users to manually test different strategies to create ValuesWriters.
|
| XxHash |
The implementation of HashFunction interface.
|
| ZeroIntegerValuesReader |
ColumnReader which does not read any actual data, but rather simply produces
an endless stream of constant values.
|