What is the role of Mistral Inference during Natural-Instruction dataset preparation?

First, I would like to congratulate you on winning first place in the NeurIPS2023 llm-efficiency challenge! 🥳🥳

I wrote an issue since I've got a question while reading Repo's README introduced Birbal. I understand that in the process of collecting training data, the Natural-Instruction dataset goes through 'relevant task selection' and inference is performed with the Mistral model. My question is why inference was performed? I could not fully understand by just referring README file 😭

I would appreciate it if you let me know!

Upaya07 / NeurIPS-llm-efficiency-challenge

What is the role of Mistral Inference during Natural-Instruction dataset preparation? #1