Closed shawnzam closed 4 months ago
🤖
I appreciate your thorough investigation into the issue with the Athena Loader in the LangChain Python framework. It's great to see that you've identified a potential solution and even provided the modified code. Your proposed modification seems reasonable, and I encourage you to go ahead and create a pull request with your suggested fix. Your contribution will be valuable in improving the LangChain framework for the community. Thank you for your dedication to resolving this issue!
@dosu-bot here is the PR: https://github.com/langchain-ai/langchain/pull/17526
Hi, @shawnzam,
I'm helping the LangChain team manage their backlog and am marking this issue as stale. It looks like you've provided a pull request with a suggested fix for the "Athena Loader encountering error with missing S3 bucket URI prefix" issue.
Could you please confirm if this issue is still relevant to the latest version of the LangChain repository? If it is, please let the LangChain team know by commenting on the issue. Otherwise, feel free to close the issue yourself, or it will be automatically closed in 7 days.
Thank you for your contribution!
Checked other resources
Example Code
Error Message and Stack Trace (if applicable)
NoSuchKey: An error occurred (NoSuchKey) when calling the GetObject operation: The specified key does not exist
Description
Athena Loader errors when result s3 bucket uri has no prefix. The Loader instance call results in a "NoSuchKey: An error occurred (NoSuchKey) when calling the GetObject operation: The specified key does not exist." error.
If s3_output_path contains a prefix like:
Execution works without an error.
Suggested solution
Modify:
to
https://github.com/langchain-ai/langchain/blob/9e8a3fc4fff8e20ab5d1f113515ded14906eb6f3/libs/community/langchain_community/document_loaders/athena.py#L128
System Info
System Information
Package Information
Packages not installed (Not Necessarily a Problem)
The following packages were not found: