automl / TabPFN

Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.
http://priorlabs.ai
Apache License 2.0
1.2k stars 107 forks source link

Can you give an index list of the 18 datasets used for Table 1? #65

Closed yxzwang closed 10 months ago

yxzwang commented 10 months ago

Hello, I'm running your code and I'm curious what are the 18 numerical datasets in OpenML-CC18 you used for Table 1 in your paper. The 'cc_test_datasets_multiclass' used in RunFullDatasetAnalyses.ipynb contains all 30 datasets. Can you provide an index list of the 18 ones for Table 1? Thank you.

noahho commented 10 months ago

I think you can run this code: https://github.com/automl/TabPFN/blob/main/tabpfn/RunFullDatasetAnalyses.ipynb (Section Datasets)