Azure / MachineLearningNotebooks

Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
https://docs.microsoft.com/azure/machine-learning/service/
MIT License
4.07k stars 2.51k forks source link

`training_data` in AutoMLConfig can not be a pandas data frame #1665

Open JCZuurmond opened 2 years ago

JCZuurmond commented 2 years ago

I see the following exception when pandas data frame is passed as training_data in the AutoMLConfig.

ConfigException: ConfigException: Message: Input of type '<class 'pandas.core.frame.DataFrame'>' is not supported. Supported types: [azureml.data.tabular_dataset.TabularDataset]Please refer to documentation for converting to Supported types: https://docs.microsoft.com/en-us/python/api/azureml-core/azureml.core.dataset.dataset?view=azure-ml-py InnerException: None ErrorResponse { "error": { "code": "UserError", "message": "Input of type '<class 'pandas.core.frame.DataFrame'>' is not supported. Supported types: [azureml.data.tabular_dataset.TabularDataset]Please refer to documentation for converting to Supported types: https://docs.microsoft.com/en-us/python/api/azureml-core/azureml.core.dataset.dataset?view=azure-ml-py", "details_uri": "https://aka.ms/AutoMLConfig", "target": "training_data", "inner_error": { "code": "BadArgument", "inner_error": { "code": "ArgumentInvalid", "inner_error": { "code": "InvalidInputDatatype" } } } } }

The documentation says that training_data can be a pandas data frame.

Versions:


Document Details

Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.

slsu0424 commented 2 years ago

Hello - I have been running into the exact same issue as @JCZuurmond. Is there any update on this?

JCZuurmond commented 2 years ago

Nope, no response on this PR yet. I don't remember what I did with this issue

majdoulinamakbal commented 2 years ago

Any update ? running into same problem ..

kryukovaeks commented 2 years ago

Any update ? running into same problem ..

majdoulinamakbal commented 2 years ago

Hello, so here's how i solved the issue:

  1. Register your pandas dataframe as a tabular dataset using Dataset.Tabular.register_pandas_dataframe(your_dataframe,your_datastore,'a description')
  2. use the registered tabular dataset in your automl config
kryukovaeks commented 2 years ago

Thank you!

Le 20 juil. 2022 à 17:09, Majdoulina Makbal @.***> a écrit :

 Hello, so here's how i solved the issue:

Register your pandas dataframe as a tabular dataset using Dataset.Tabular.register_pandas_dataframe(your_dataframe,your_datastore,'a description') use the registered tabular dataset in your automl config — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.