microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs
https://aka.ms/GeneralAI
MIT License
3.56k stars 268 forks source link

[MiniLLM] teacher generated responses `gen_answer` not used in seqKD #250

Closed hieuchi911 closed 3 days ago

hieuchi911 commented 1 month ago

I'm running sequence level KD of llama. And in the first step of generating responses with teacher:

Is this a bug?

t1101675 commented 1 month ago

We checked our original code. We set the "output" to the value of "gen_answer" before processing the generated data for SeqKD. Thanks for pointing out. We will clarify this in the README.

hieuchi911 commented 1 month ago

Gotcha thanks for the clarification