Azure / MachineLearningNotebooks

Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
https://docs.microsoft.com/azure/machine-learning/service/
MIT License
4.01k stars 2.49k forks source link

Enforce consistency between datasets API and Models API #1824

Closed edgBR closed 1 year ago

edgBR commented 1 year ago

Currently AzureML Python SDK as well as the CLI does not support to filter datasets by using tags.

When doing :

from azureml.core import Run, Datastore, Workspace
from azureml.core import Dataset
ws = Workspace.from_config()
Dataset.get_all(ws)

You get only the latest version of a dataset.

When you do:

from azureml.core import Run, Datastore, Workspace, Model
from azureml.core import Dataset
ws = Workspace.from_config()
Model.list(ws)

You get all of the models versions and you can actually use Model.list(ws, tags=['key', ['key2', 'key2 value']]).

The behaviour with the Model API is what I expect. But the one with datasets is inconsistent.

In fact in the REST API it is possible to use the tags property:

https://learn.microsoft.com/en-us/rest/api/azureml/2022-06-01-preview/data-containers/list?tabs=HTTP

BR E

edgBR commented 1 year ago

Moved to https://github.com/Azure/azure-sdk-for-python/issues/26329