hipster-philology / pyrrha

A language-independent post-correction app for POS-tagging and lemmatization
https://pyrrha.huma-num.fr
MIT License
27 stars 16 forks source link

Charging large corpus (> 1 million tokens) #98

Closed Jean-Baptiste-Camps closed 2 years ago

Jean-Baptiste-Camps commented 5 years ago

I have a corpus of c. 2 500 000 tokens that I would like to add to Pyrrha for some batch corrections and vérifications, but copy/paste does not really work (38 Mo of data). For these cases, an option to upload file could be nice.

PonteIneptique commented 2 years ago

"Unfixable" more or less.