Parquet Extract

This extract type imports data from Parquet files. A Parquet connection is required for this extract.

See also Parquet's documentation on file format types and metadata.

Connection	A connection of ParquetFile type. See ParquetFile Connection.
Column range subset	An optional list of comma-separated column indices that will be extracted. The first column of the range is 1. Example: 1,3,4
Header	Check this option if column names are needed from the Parquet schema. If not, leave it unchecked and the column names will be "column1", "column2", etc.
Schema types	If set, the data types (string, numeric, date) are taken from the Parquet schema. Otherwise, the data type is always set to string.
Date format	Columns formatted as date will be output according to the provided date format, such as dd.MM.yyyy. Details on syntax can be found in DateFormat Function.
Use caching	If caching is activated, the complete output of the extract is temporarily stored during the first call of the extract, using an internal H2 database. Subsequent calls of the extract read directly from the cache without connecting to the underlying source system of the extract. If the extract or the underlying connection contains variables, a separate cache is build for different values of these variables. See Caching in Extracts and Transforms for more information.

Updated July 3, 2025