pg_parquet: PostgreSQL Extension for Parquet File Management icon

pg_parquet: PostgreSQL Extension for Parquet File Management

The `pg_parquet` extension empowers PostgreSQL users to seamlessly read and write Parquet files stored in S3 or local file systems using standard `COPY` commands. It enhances data interchange capabilities, allowing for efficient export and import of complex data types, as well as comprehensive schema and metadata inspection.

Features

Supports reading and writing Parquet files directly from PostgreSQL using `COPY TO/FROM` commands.

Facilitates seamless integration with AWS S3 for object storage, enabling cloud data management.

Offers robust schema and metadata inspection features, providing insights into Parquet file structures and statistics.

Rich type support, including PostgreSQL's primitive, array, and composite types, ensuring versatile data handling.

Configurable options for compression, row group size, and file format, optimizing performance and storage.

Repository Details

366
11
Updated: 11/30/2024

Languages

Rust
Dockerfile
Makefile
Shell

Topics

columnar
data-ingestion
data-migration
parquet
postgresql

License

Other