harmonydata / harmony

The Harmony Python library: a research tool for psychologists to harmonise data and questionnaire items. Open source.
https://harmonydata.ac.uk
MIT License
8 stars 18 forks source link

Refactor PDF extraction to not use Spacy #11

Closed woodthom2 closed 4 months ago

woodthom2 commented 10 months ago

See training data in https://github.com/harmonydata/pdf-questionnaire-extraction

woodthom2 commented 5 months ago

See #39, this has partly been done

woodthom2 commented 4 months ago

Switched to Sklearn CRFSuite