Closed marielaquino closed 9 months ago
Same issue here, is this because azure open ai sudio has no whisper model deployment?
Hi, @marielaquino! I'm Dosu, and I'm here to help the LangChain team manage their backlog. I wanted to let you know that we are marking this issue as stale.
From what I understand, the issue you reported is related to the GenericLoader
not working on Azure OpenAI. It seems to be throwing an InvalidRequestError: Resource Not Found
when trying to transcribe an audio file. You mentioned that you are unsure why this error is occurring, as none of the Loader methods take in the api_key
or deployment_name
as parameters. Another user, @Daily-Prophet, has also experienced the same issue and wonders if it is because Azure OpenAI Studio has no whisper model deployment.
Before we proceed, we would like to confirm if this issue is still relevant to the latest version of the LangChain repository. If it is, please let us know by commenting on this issue. Otherwise, feel free to close the issue yourself, or the issue will be automatically closed in 7 days.
Thank you for your understanding and cooperation. We look forward to hearing from you soon.
I confirm that this issue is still happening on langchain v0.0.320, where you still get the error Attempt 1 failed. Exception: Resource not found
when you try to use the GenericLoader
with Azure OpenAI credentials, as in the snippet below:
loader = GenericLoader(
YoutubeAudioLoader([url], save_dir),
OpenAIWhisperParser()
)
@baskaryan Could you please help @marielaquino with this issue? They have confirmed that the issue is still occurring in the latest version of LangChain (v0.0.320). They are getting the error Attempt 1 failed. Exception: Resource not found
when using the GenericLoader
with Azure OpenAI credentials. Thank you!
Hi, @marielaquino,
I'm helping the LangChain team manage their backlog and am marking this issue as stale.
It seems like the issue you reported regarding the GenericLoader not working on Azure OpenAI, resulting in an "InvalidRequestError: Resource Not Found" when attempting to transcribe an audio file from a public YouTube video, is still unresolved. Another user, @Daily-Prophet, also experienced the same issue and questioned if it's due to Azure OpenAI Studio lacking a whisper model deployment. The issue has been confirmed to persist in the latest version of the LangChain repository (v0.0.320) by @sergiofgonzalez. The LangChain team has been asked to assist with resolving the issue.
Could you please confirm if this issue is still relevant to the latest version of the LangChain repository? If it is, please let the LangChain team know by commenting on the issue. Otherwise, feel free to close the issue yourself, or the issue will be automatically closed in 7 days.
Thank you for your understanding and cooperation. If you have any further questions or updates, feel free to reach out.
I'm Dosu
System Info
langchain=0.0.225, python=3.9.17, openai=0.27.8 openai.api_type = "azure", openai.api_version = "2023-05-15" api_base, api_key, deployment_name environment variables all configured.
Who can help?
No response
Information
Related Components
Reproduction
Steps to reproduce the behavior: Note: This code is pulled directly from document loaders chapter of Langchain Chat With Your Data course with Harrison Chase and Andrew Ng. It downloads an audio file of a public youtube video and generates a transcript.
Successfully executes the following steps:
Usually, with "resource not found" errors, the message will tell you to input api_key or deployment_name. I'm not sure what this means, as none of the Loader methods take in these as params.
Expected behavior
Expected behavior is to finish four parts of transcription and "load" as doc in docs variable.