CentreForDigitalHumanities / tscan

T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf
GNU Affero General Public License v3.0
18 stars 6 forks source link

Feature/alpino lookup #71

Closed oktaal closed 1 year ago

oktaal commented 1 year ago

This allows uploading a pre-parsed Alpino treebank to skip parsing. It will also use a lookup to prevent having to re-parse the same sentence twice. (It will also store the parses in the INPUT folder, this is deliberate, because it will really speed things up when restarting a project).

Any generated Alpino-parses can be saved for download.

Furthermore, I've updated the tests to skip Alpino parsing (uses a pre-parsed treebank) and checked that it works with the latest version of all the dependencies.