openml / OpenML

Open Machine Learning
https://openml.org
BSD 3-Clause "New" or "Revised" License
664 stars 90 forks source link

Cannot delete dataset associated with flow created automatically #1050

Closed marcoslbueno closed 4 years ago

marcoslbueno commented 4 years ago

Description

Hi,

I uploaded a dataset via the python API (https://www.openml.org/d/42453) and after a few days a task (https://www.openml.org/t/233078) was associated to it. I guess the task was created automatically (by a bot, for example). When I try to delete this dataset I get the error 354 'Dataset is in use by other content. Can not be deleted'. Then, I tried to delete the associated task, but I cannot do so (no remove button is shown to me).

Not sure this is a bug, or just the way I'm trying to delete the dataset.

Steps/Code to Reproduce

Expected Results

Actual Results

Versions

mfeurer commented 4 years ago

Hi, the issue is that a dataset on which content was created cannot be deleted any more, as the other content would then be meaningless.

I'm moving this to the OpenML issue tracker as this is an issue with OpenML an not the python API.

joaquinvanschoren commented 4 years ago

@marcoslbueno: Were you logged in? I could simply remove the task (just did).

@mfeurer: Is it possible that there is no task deletion function in the Python API? I couldn't find it?

marcoslbueno commented 4 years ago

@joaquinvanschoren Yes, I was logged in. The way I was trying to delete the task was probably wrong I guess (via browser)? Thanks anyway, I could now delete the dataset.

LucaUrbinati44 commented 1 year ago

@marcoslbueno How did you manage to delete a dataset? I don't find any instruction about it. Thank you

joaquinvanschoren commented 1 year ago

@PGijsbers is this supported by the Python API now?

PGijsbers commented 1 year ago

I am currently working on the PR to allow people to delete any of their uploaded OpenML entities, including datasets. I hope to finish it next week.

LucaUrbinati44 commented 1 year ago

I found this function by looking at the source code of OpenML (in the documentation there is nothing about it) and it works:

# Remove dataset
openml.utils._delete_entity("data", <your_dataset_id>)
PGijsbers commented 1 year ago

Yes, the PR is just to make documented functions and better error messaging surrounding that function: #1060