Azure-Samples / chat-with-your-data-solution-accelerator

A Solution Accelerator for the RAG pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences. This includes most common requirements and best practices.
https://azure.microsoft.com/products/search
MIT License
835 stars 431 forks source link

Continued issue with backend docker deployment #1266

Closed atockar closed 1 month ago

atockar commented 2 months ago

Hi, I am still getting this issue. I have tried to deploy using the one click deployment and it fails in the same place, Screenshot 2024-08-30 at 4 35 42 pm

I can see the bicep file has been updated in the code, and I added the storage blob data contributor role for the storage account to the function app:

add the storage blob data contributor and storage data reader roles to the managed identity of the function app and ensure that when deploying solution you have in bicep file @Allowed(['Enabled', 'Disabled']) param publicNetworkAccess string = 'Enabled'

However when I redeploy this section it asks for the clientKey, which I thought was generated. Any ideas?

Originally posted by @atockar in https://github.com/Azure-Samples/chat-with-your-data-solution-accelerator/issues/1195#issuecomment-2320236507

Roopan-Microsoft commented 2 months ago

Thanks @atockar for raising the issue, we are looking into this and will keep you posted.

Roopan-Microsoft commented 2 months ago

Hey @atockar,

Could you please provide us with more details about the exception you’re encountering?

We’ve been able to deploy the accelerator successfully on our end, so we’d like to investigate further if you could share more information about the error you're experiencing.

atockar commented 2 months ago

Screenshot 2024-09-06 at 9 04 50 am

I believe I found the source of the issue - it fails when I use a B1 hosting plan. See if you're able to replicate, thanks

Roopan-Microsoft commented 2 months ago

Hey @atockar, Thanks for finding out! I'll try replicating the issue with a B1 hosting plan and let you know what I find. I'll get back to you soon.

Roopan-Microsoft commented 2 months ago

Yes @atockar, You are right! choosing this Sku plan throwing an error. We are looking into it. Thanks.

Meanwhile you can proceed with default Sku plans for now. We will update you on the root cause.

TBunkley commented 1 month ago

I'm having issues with this. What are the default settings? Basic pops up first for me

Roopan-Microsoft commented 1 month ago

Hey @TBunkley, Thanks for reaching out to us.

As you may know, the B1 plan includes one core, whereas the B3 plan offers four cores. Additionally, as shown in the screenshot below, we have less RAM, which is contributing to slower function app deployment. The default timeout is set to 300 seconds, resulting in failures.

We are currently looking into potential solutions for this issue.

image
Roopan-Microsoft commented 1 month ago

Hey @atockar / @TBunkle

When you select the deployment link, the hosting plan defaults to B3. Modifying this to use B1 will not function as expected, which is why a scale down operation does not resolve the issue.

Moreover, the solution you are implementing involves several services, including Azure App Service and Azure Speech Service, which require substantial memory. The B1 plan provides a small worker with 1.75 GB of RAM, whereas B3 offers a large worker with 7 GB of RAM.

Please don’t hesitate to reach out if you have any questions or concerns.

I'm closing this issue for now. Please feel free to reopen it if any problems arise. Happy to Help!

TBunkley commented 1 month ago

Thanks @Roopan-Microsoft ,

Would it make sense to remove it as an option if it is a known case that fails? Maybe add a TODO for it?

Roopan-Microsoft commented 1 month ago

Thanks @Roopan-Microsoft ,

Would it make sense to remove it as an option if it is a known case that fails? Maybe add a TODO for it?

Yes @TBunkley . You are right! we will work on that and Thanks for your feedback.