kbressem / medAlpaca

LLM finetuned for medical question answering
GNU General Public License v3.0
491 stars 57 forks source link

is this script compatible with flan-t5? #16

Closed 2533245542 closed 1 year ago

2533245542 commented 1 year ago

Is the training script compatible with flan-t5, like changing the model name to google/flan-t5-xxl, will it work?

On my task I found flan-t5 models work much better than llama and want to see if fine-tuning can improve it more.

Thanks

kbressem commented 1 year ago

This depends on what you want to do. Training in fp16/bf16 will probably work. For LoRA I don't know. You may need to specify other target layers in the model to make it work.