huggingface / setfit

Efficient few-shot learning with Sentence Transformers
https://hf.co/docs/setfit
Apache License 2.0
2.24k stars 222 forks source link

Is it possible to train `setfit` models with the Hugging Face Trainer? #568

Open umarbutler opened 3 weeks ago

umarbutler commented 3 weeks ago

Is it possible to train setfit models with the Hugging Face Trainer so that we can have control over hyperparameters such as, for example, b16, optimiser (eg, I prefer to use an 8-bit implementation of AdEMAMix), optimiser hyperparameters (eg, Adam beta1, beta2, AdEMAMix T and beta3), data collator, etc...