-
Add another route that provides information about the different Sparv export formats. This can be used by the frontend. Right now it's hard for a user to know what's what.
-
Vad händer om man i indatan har en annotation som heter samma sak som en Sparvannotation (inkl name-space)? Ska vi bygga in en kontroll för det? Vi kan prefixa sparv-annotationen med `sparv` i det fal…
-
E.g. by typing `sparv plugins`. We would need to have some registry somewhere for this to work.
-
Bildens URL:
https://tidningar.kb.se/2656330/1853-02-12/edition/146134/part/1/page/1_thumb.jpg
Länk till utgåvan:
https://tidningar.kb.se/2656330/1853-02-12/edition/146134/part/1/page/1/
F…
-
Right now the different options for the available segmenters can only be found in the code:
```
whitespace=nltk.WhitespaceTokenizer
linebreaks=LinebreakTokenizer
blanklines=nltk.BlanklineTokeniz…
-
Add the possibility to upload compressed files, or whole zip/tar archives:
```
curl -X PUT -u peter:XXX -F corpus_id=press -F "files[0]=@/absolute/path/to/localfile1.txt.gz" -F "files[0]=@/absolute/…
-
We want to use the `sparv schema` output to generate a config form in Mink, but some details seem to be missing to be able to properly generate form components.
I pasted the schema at the [react-js…
-
A user should be able to upload a resource of type "metadata". This should be a yaml file matching one of the formats specified [in these templates](https://github.com/spraakbanken/metadata/tree/main/…
-
It would be nice to be able to use HTML files as the corpus. Here is code for converting HTML markup to plain text, which possibly could be transformed into a HTML parser in Sparv:
```
from bs4 impo…
-
This version of hunpos behaves differently than the original compiled binary:
```
$ hunpos-tag suc3_suc-tags_default-setting_utf8.model < example.txt
jag PN.UTR.SIN.DEF.SUB
och UO
du PN.UTR.SIN…