aws / aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
https://aws-sdk-pandas.readthedocs.io
Apache License 2.0
3.94k stars 701 forks source link

Support Ray 2.38 #3004

Closed allemp closed 4 weeks ago

allemp commented 4 weeks ago

Describe the bug

Ray has removed ray.data.datasource._default_metadata_providers in a recent commit 12e1325 which awswrangler relies on and gives an error.

Ray 2.36 works fine.

How to Reproduce

Run wr.athena.read_sql_query() with ray@2.38.0 backend.

Expected behavior

I expected awswrangler to work with ray 2.38 with no errors.

Your project

No response

Screenshots

No response

OS

linux

Python version

3.11

AWS SDK for pandas version

3.9.1

Additional context

No response

jaidisido commented 4 weeks ago

Thanks, it has been fixed in #3007 and will be available in the next release