Hi,
Cool work! I'm wondering if there are any details about the data used to train MobileLLM from scratch -- I seem to have missed it in the paper. Is it similar to the data mixture used to train LLaMa-2?
Hello, thank you for your question. We did not use any in-house data to train MobileLLM. We are still working with our legal team on the dataset disclosure.
Hi, Cool work! I'm wondering if there are any details about the data used to train MobileLLM from scratch -- I seem to have missed it in the paper. Is it similar to the data mixture used to train LLaMa-2?