centre-for-humanities-computing / danish-foundation-models

A project for training foundational Danish language model
https://foundationmodels.dk
MIT License
68 stars 4 forks source link

Convert colossal oscar da #250

Closed peterbjorgensen closed 2 months ago

peterbjorgensen commented 3 months ago

Add colossal oscar dataset (da) part. The script can easily be extended to all the other languages.