Azure / azureml-examples

Official community-driven Azure Machine Learning examples, tested with GitHub Actions.
https://docs.microsoft.com/azure/machine-learning
MIT License
1.76k stars 1.44k forks source link

Data versioning #3437

Open ShakutaiGit opened 2 weeks ago

ShakutaiGit commented 2 weeks ago

Description

This PR adds a new notebook for dataset versioning in Azure Machine Learning. The notebook covers:

Computing a hash for a dataset. Checking if a dataset with the same hash already exists in Azure ML. If the dataset does not exist, uploading it to Azure Blob Storage, registering it as an asset, and tagging the asset with the computed hash. If it exists, retrieving the asset name, version, and tag. The notebook includes a step to assign a tag to the asset in Azure ML using the computed hash, ensuring version tracking and identification.

Checklist

jayesh-tanna commented 10 hours ago

consider to add ownership in v2 notebook file in telemetry