I'll share with you my practical experiences from trying to use openml in the last days. Maybe it helps to improve the package :)
I started by uploading a CSV dataset as a dataset using the web interface at https://www.openml.org/new/data. No matter what I put in and how many fields I fileld out I always faced the error below. It worked using uploadOMLDataSet().
"Error 131: Problem validating uploaded description file - XML does not correspond to XSD schema. Please fill in all required (red) fields, upload a file or give a URL (not both), and avoid spaces in the dataset name."
I then tried the CL function uploadOMLDataSet(). Two points here: I was expecting to upload a dataset. A dataset for me is a CSV like file. However, the function actually expects what is a "task" object in mlr. If one is used to work with mlr, it is not logical to upload a mlr task as a "dataset" to openml. But maybe openML makes more sense here and the mlr logic lacks.
Creating an openML task using makeOMLTask could really need a better help page. How should the list for args parameters and input look like?
Once I created a task using makeOMLTask, how can I upload it? There is an upload function for datasets, flows and runs but none for tasks?
Is it possible to use a different resamp strategies in the web interface then the preselected ones? E.g. I would like to use "repeated spatial CV" from mlr or just "5-fold CV".
Hi guys,
I'll share with you my practical experiences from trying to use
openml
in the last days. Maybe it helps to improve the package :)uploadOMLDataSet()
.I then tried the CL function
uploadOMLDataSet()
. Two points here: I was expecting to upload a dataset. A dataset for me is a CSV like file. However, the function actually expects what is a "task" object inmlr
. If one is used to work withmlr
, it is not logical to upload amlr
task as a "dataset" toopenml
. But maybeopenML
makes more sense here and themlr
logic lacks.Creating an
openML
task usingmakeOMLTask
could really need a better help page. How should the list for argsparameters
andinput
look like?Once I created a task using
makeOMLTask
, how can I upload it? There is an upload function for datasets, flows and runs but none for tasks?Is it possible to use a different resamp strategies in the web interface then the preselected ones? E.g. I would like to use "repeated spatial CV" from
mlr
or just "5-fold CV".How can I "activate" my dataset? E.g. https://www.openml.org/d/41110