Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
Apache License 2.0
562
stars
49
forks
source link
Eval dataset is hard coded to be "openwebtext_ppl" #121
Shouldn't be doing that. Related to #112