larsyencken / wide-language-index

An index of public broadcasts tagged by their primary language.
50 stars 15 forks source link

Add 268 new samples from RSS feeds. #42

Closed larsyencken closed 8 years ago

larsyencken commented 8 years ago

This brings the total sample size to about 32GB for the dataset. A cap of 50 samples per language means that these samples are contributing to languages that hadn't yet reached that cap.