Azure / azureml-examples

Official community-driven Azure Machine Learning examples, tested with GitHub Actions.
https://docs.microsoft.com/azure/machine-learning
MIT License
1.73k stars 1.41k forks source link

Dataset versioning #3390

Open ShakutaiGit opened 1 week ago

ShakutaiGit commented 1 week ago

Description

This PR adds a new notebook for dataset versioning in Azure Machine Learning. The notebook covers:

Computing a hash for a dataset. Checking if a dataset with the same hash already exists in Azure ML. If the dataset does not exist, uploading it to Azure Blob Storage, registering it as an asset, and tagging the asset with the computed hash. If it exists, retrieving the asset name, version, and tag. The notebook includes a step to assign a tag to the asset in Azure ML using the computed hash, ensuring version tracking and identification.

Checklist

jayesh-tanna commented 1 week ago

Can you update this readme.md file also? https://github.com/azure/azureml-examples/blob/main/sdk/python/README.md