openml / OpenML

Open Machine Learning
https://openml.org
BSD 3-Clause "New" or "Revised" License
661 stars 91 forks source link

Test Server Synchronisation #386

Open janvanrijn opened 7 years ago

janvanrijn commented 7 years ago

I have the impression that the Test Server is hopelessly out of sync with the live server. For example, most of the UCI datasets are not available or linked correctly, e.g.: https://test.openml.org/data/download/2/anneal https://test.openml.org/data/download/24/mushroom https://test.openml.org/data/download/42/soybean

This makes large scale testing of new features pretty much impossible. Would there be a possibility to a) correct this (at least for the main number of UCI datasets) b) synchronize this at various timestamps, so for example every week we have guaranteed a direct copy of the live server on the test server? (And in this case, I do not mean for backup purposes, but for test purposes)

I can imagine also @giuseppec and @mfeurer would greatly benefit from this.