axolotl-ai-cloud / axolotl

Go ahead and axolotl questions
https://axolotl-ai-cloud.github.io/axolotl/
Apache License 2.0
7.66k stars 839 forks source link

documentation - flan-t5 #211

Open winglian opened 1 year ago

winglian commented 1 year ago

regular sft and lora sft are supported , but only on the q and k modules.

NanoCode012 commented 1 year ago

What does sft stand for?

winglian commented 1 year ago

supervised fine tuning

Nima-Nilchian commented 5 months ago

is or will be the aya model which comes from mt5, supported for finetuning? @winglian

SoshyHayami commented 3 months ago

i'm very interested in this. but can't find any scripts in the 'examples'. enc-dec models have huge untapped potential.