huggingface / cosmopedia

Apache License 2.0
454 stars 45 forks source link

Educational scoring dataset #20

Open peregilk opened 5 months ago

peregilk commented 5 months ago

@anton-l Are you also going to release the dataset with the scores for training the classifier? Ie the dataset that is referred to as "HuggingFaceTB/LLM_juries_fineweb_430k_annotations".

It would be a great set for trying to replicate your results.