ltgoslo / talk-of-norway

This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata pulled from different sources, and augmented with sentence, token, lemma, part-of-speech and morphological feature annotations.
Other
29 stars 3 forks source link

Huggingface #8

Open simeneide opened 5 months ago

simeneide commented 5 months ago

Great dataset, think it will do exactly what I want!

Do you have any plans on publishing it as a huggingface dataset?

martigso commented 5 months ago

Thanks for the interest! We have no plans of publishing the data on Huggingface at the moment, unfortunately.

@erikve has mentioned the possibility for updating the data in the future(?)

simeneide commented 5 months ago

Ok. just wanted to add that comment here as I almost didnt find this dataset. its not too hard and great for discoverability :) I could be happy to help out