Open anna-geller opened 1 year ago
afaik, UTF-8 encoding and Apache Arrow are used behind the scenes
the main issue is that schema
property is required in this task and there is no information what is expected here and how to use it. I didn't know how to use it because e.g. when writing a Pandas dataframe to a Parquet file, you don't even have to think about the schema, the schema is inferred from the dataframe
The
ParquetWriter
is currently too difficult to use. When some file is already stored as ION, Kestra should infer the schema and should not require schema specification (i.e., the schema can be added optionally).