Open winglian opened 1 year ago
What does sft stand for?
supervised fine tuning
is or will be the aya model which comes from mt5, supported for finetuning? @winglian
i'm very interested in this. but can't find any scripts in the 'examples'. enc-dec models have huge untapped potential.
regular sft and lora sft are supported , but only on the
q
andk
modules.