spraakbanken / sparv-pipeline

Språkbanken's text analysis tool
https://spraakbanken.gu.se/sparv
MIT License
25 stars 6 forks source link

Example corpora with example configs #69

Closed anne17 closed 4 years ago

anne17 commented 4 years ago

At the moment we only have test corpora which could probably work as example data, but when installing Sparv via pypi the user won't have access to those. It would be nice to be able to point out a URL in the user manual to a downloadable zip file with some test data. But where do we upload it? And can we automatically keep this data synced with this repository (i.e. when the test data changes, the example data changes as well)?

anne17 commented 4 years ago

We can zip the test corpora and attach them as downloadable assets to the release. Not sure if this can be automated somehow (maybe with GitHub workflows?) but it's really not that big a job to do it manually once for every release.