Closed ChuxiJ closed 7 months ago
Thanks for your question @ChuxiJ , and I'd answer them here:
Flan-t5-large
is the base model. You can checkout the first section and the experimental results for details. All these details are already in the paper.
LoraHub is a really great idea, similar to a few ideas I thought of yesterday.