automl / TabPFN

Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.
http://priorlabs.ai
Apache License 2.0
1.19k stars 107 forks source link

EnsembleTabPFN for bigger than 1000 rows dataset #87

Closed ericleonardo closed 5 months ago

ericleonardo commented 5 months ago

Please, check the EnsembleTabPFN if you want to use TabPFN with bigger dataset: https://github.com/ersilia-os/chempfn

There is no 1000 rows or 100 features limitation like original TabPFN. The authors could build a new version for dealing with Imbalanced class dataset also.

Thank you!

SamuelGabriel commented 5 months ago

Hey, this looks great. This codebase neither has the limits strictly enforced anymore either. You can set config flags to overwrite both, features are subsampled in that case.

We are revamping this project a lot in a few months, though, and will update with a new version which deals with this, too.