Parquet integration
Notice: the Parquet integration is still in the early stage and subject to change.
The Parquet integration generates datasets containing onchain data. Parquet is the standard file format used by analytics applications such as:
- The Pandas and Polars data frame libraries for Python.
- DuckDB, an in-process SQL Online Analytical Processing (OLAP) database.
Installation#
apibara plugins install sink-parquet
Configuration#
outputDir: string
: write the Parquet files to this directory.batchSize: string
: each Parquet file has data for the specified number of blocks.