o2r-project / o2r-meta

Metadata toolsuite for an extract-map-validate workflow supporting reproducible research
Apache License 2.0
2 stars 3 forks source link

Add NLP to extract structure and discovery metadata, including hypotheses #97

Open nuest opened 6 years ago

nuest commented 6 years ago

(original idea by @chriskray and @simonscheider - just cleaning up our internal ideas list!)

The extractor could use Natural Language Processing to find structual information and discovery metadata in the text part of a workspace, most importantly in the main document.

dkpro: https://github.com/DARIAH-DE/DARIAH-DKPro-Wrapper and https://code.google.com/archive/p/dkpro-tutorials/

Potential targetet content: