Bash

Editor's note: while there are other snippets showing file conversion, Parth's shows you how to convert from CSV to Parquet files using DuckDB with specification of the entire schema (columns) and compression codec.

Convert CSV to Parquet and provide schema to use

Parth Patil

Editor's note: another great example of using DuckDB's wide data format support to merge/combine multiple Parquet files. Parth also kindly shows you how to compress the resulting Parquet file with the zstd codec. DuckDB also supports gzip and snappy compression codecs.

Convert CSV to Parquet and provide schema to useBash

Execute this Bash

Combine several parquet files into one and compress with zstdBash

Execute this Bash