Problems when changing t5 model to other Causal Models

Hi,

I am trying to use this framework on causal models such as llama based models and other LLMs. For my case, I use Tinyllama and Pythia to replace the T5 model in the original pipeline (TinyLlama-1.1B-Chat-v1.0 and EleutherAI/pythia-1.4b).

However, after I replace the model and run through all the steps provided in the code, which is, using the reasoning from GPT to fine-tune a smaller model (in this case Tinyllama and Pythia) and also use the external knowledge from KB. The response of the fine-tuned model is not readable and performs badly. For example, in medqa dataset, using wikipedia KB, and the provided reranker, the distilled model response text like

"A correct: that5). ( answer5C is the to of answer is the A is root ( A:). C, - A

also: for:. C, :) is of: isC.: 1 C a ThereforeC =:),,, C"

which is not a readable sentence and definitely fail in this task. I want to know why this happen and hope you can give me some possible explanations.

Other details:

To make sure I reproduce the code correctly, I also perform the pipeline on FlanT5-base, which works fine
When distilling causal models, I use the "run_clm_deepspeed.py" code instead of the seq2seq one
The reproduce process is based on medqa dataset and using wikipedia as the knowledge base

Nardien / KARD

Problems when changing t5 model to other Causal Models #1