aws / aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
https://aws-sdk-pandas.readthedocs.io
Apache License 2.0
3.94k stars 701 forks source link

Athena cache not working with unload approach #2984

Open AndreaPaglia opened 1 month ago

AndreaPaglia commented 1 month ago

Reading the documentation I was expecting the Athena cache to work also with the unload_approach=True parameter, but looking at the code I see that it is not intended to work by design. Specifically here I see that the csv file format is imposed to all DML queries https://github.com/aws/aws-sdk-pandas/blob/3.9.1/awswrangler/athena/_cache.py#L222

I was wondering if there's a specific reason for this, or is it something that is planned to be introduced in the future? Thanks!