neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Apache License 2.0
2.08k stars 148 forks source link

TRL SFTTrainer Examples #2211

Closed Satrat closed 7 months ago

Satrat commented 8 months ago

Asana ticket: https://app.asana.com/0/1201735099598270/1206486351032763/f

Testing

See examples in integrations/huggingface-transformers/tutorials/text-generation/trl_mixin

robertgshaw2-neuralmagic commented 8 months ago

Thanks Sara - this looks really nice

Are there any other features we should flex? I am thinking we might want to look at:

Satrat commented 8 months ago

Thanks Sara - this looks really nice

Are there any other features we should flex? I am thinking we might want to look at:

  • FSDP
  • Distillation

Sure I'll test both of these scenarios, but if it ends up being more than tweaking to get FSDP working I'm going to leave that for another ticket :)

Edit: both worked with some minor tweaks!