Parquet Extract
This extract type imports data from Parquet files. A Parquet connection is required for this extract.
See also Parquet's documentation on file format types and metadata.
Connection | A connection of ParquetFile type. See Parquet Connection. |
Column range subset | An optional list of comma-separated column indices that will be extracted. The first column of the range is 1. Example: 1,3,4 |
Header | Check this option if column names are needed from the Parquet schema. If not, leave it unchecked and the column names will be "column1", "column2", etc. |
Schema types | If set, the data types (string, numeric, date) are taken from the Parquet schema. Otherwise, the data type is always set to string. |
Date format | Columns formatted as date will be output according to the provided date format, such as dd.MM.yyyy. Details on syntax can be found in DateFormat Function. |
Use caching | If caching is activated, the complete output of the extract is temporarily stored during the first call of the extract, using an internal H2 database. Subsequent calls of the extract read directly from the cache without connecting to the underlying source system of the extract. If the extract or the underlying connection contains variables, a separate cache is build for different values of these variables. See Caching in Extracts and Transforms for more information. |
Updated May 8, 2025