To reduce cost in AWS Athena queries, compression is required. From current understand snappy compression for parquet is standard.
Were gzip compression might give small files (better for cost), snappy compression is better for performance. This should give the best for both worlds, compressed files but can be queried effectively.
Parquet snappy compression: https://arrow.apache.org/docs/r/reference/write_parquet.html
To reduce cost in AWS Athena queries, compression is required. From current understand snappy compression for parquet is standard.
Were
gzip
compression might give small files (better for cost), snappy compression is better for performance. This should give the best for both worlds, compressed files but can be queried effectively.