alan-turing-institute / ARC-LoCoMoSeT

Low-Cost Model Selection for Transformers
MIT License
1 stars 0 forks source link

Renggli implementation does not stratify by label when splitting data #101

Closed jack89roberts closed 8 months ago

jack89roberts commented 8 months ago

May be relevant for small dataset sizes with imbalanced classes, i.e. if there are very few samples of some classes in the training set