Azure / azure-sdk-for-python

This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://docs.microsoft.com/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python.
MIT License
4.38k stars 2.71k forks source link

Azure SDK V2-Execution of concurrent Drift and Data quality monitors #34413

Open ukhan67 opened 4 months ago

ukhan67 commented 4 months ago

Describe the bug We are required to execute 150 plus data quality and data drift monitors. There are multiple errors that occur on different Azure managed components. Few errors are mentioned below.

  1. rejected, too many sessions are being created.(error1 image uploaded)
  2. something took too long to load while initial spark context was being setup.(error2 image uploaded)
  3. MountPointCreationError(Some(1)).(error3 image uploaded)
  4. No module named 'azure.storage.filedatalake.(error4 image uploaded)
  5. Cant get attribute '_execute' on module . (error5 image uploaded)

To Reproduce Steps to reproduce the behavior:

  1. schedule multiple drift and data quality monitors like 100 at a same time.
  2. errors will show up in random Azure managed components.

Expected behavior I expected multiple monitors to be executed concurrently.

Screenshots error1 error2 error3 error4 error5

swathipil commented 4 months ago

Hi @ukhan67 - Thanks for opening an issue! Directing this to the right people.

@azureml-github - Can you take a look?

yunjie-hub commented 4 months ago

Thanks for reporting this issue, I know this customer run into several issues, can we connect to the customer to have the faster support?