Azure / Enterprise-Scale

The Azure Landing Zones (Enterprise-Scale) architecture provides prescriptive guidance coupled with Azure best practices, and it follows design principles across the critical design areas for organizations to define their Azure architecture
https://aka.ms/alz
MIT License
1.69k stars 963 forks source link

Deploy-Diagnostics-MlWorkspace is not working for all types of "Microsoft.MachineLearningServices/workspaces" #1583

Closed NucLabs closed 3 months ago

NucLabs commented 7 months ago

Describe the bug Apparently more then one (sub?) types of machine learning workspaces (Microsoft.MachineLearningServices/workspaces) do exist in Azure. In the portal the type is different (Azure AI vs Azure AI project), but the resource type is, confusingly, the same. The policy Deploy-Diagnostics-MlWorkspace tries to deploy diagnostic settings on both types, but only succeeds on the second one (Azure AI project). Diagnostic settings for both types are very different: Azure AI only has metrics, while the logs/metrics of Azure AI project match the ones defined in the policy definition.

The error reported by the policyaction (deployment) is Category 'AmlComputeClusterEvent' is not supported

Screenshots image

image

Springstone commented 7 months ago

@NucLabs thanks for raising your concern. We are aware of the issues in this space and are transitioning to a more comprehensive solution in the near term. This is a general problem with the various resource providers that we have no control over. In the near term you will have to log ALL METRICS to get all the required data for all resource providers (until all the RPs support Audit Logs only category).

NucLabs commented 7 months ago

I did a little dive into the policies that are available now. Are you referring to the "Enable logging by category group*" builtin policies. Will the CAF initiative contain those, instead of the custom policies?

Springstone commented 6 months ago

There are a new set of policies and initiatives coming soon, 412 policies and 6 initiatives, covering 137 Azure services and a set depending on the log target (Log Analytics, Event Hub or Storage).

Springstone commented 3 months ago

Closing this as we've deprecated all our diagnostic settings policies and shifted to the PG owned initiative to do the same. Please review https://aka.ms/alz/whatsnew for details.