severinsimmler / figur

Figurenerkennung for German literary texts
MIT License
2 stars 0 forks source link

Figurenerkennung for German literary texts

Build Status DOI

An important step in the quantitative analysis of narrative texts is the automatic recognition of references to figures, a special case of the generic NLP problem of Named Entity Recognition (NER).

Usually NER models are not designed for literary texts resulting in poor recall. This easy-to-use package is the continuation of the work of Jannidis et al. using techniques from the field of Deep Learning.

Installation

$ pip install figur

Example

>>> import figur
>>> text = "Der Gärtner entfernte sich eilig, und Eduard folgte bald."
>>> figur.tag(text)
   SentenceId      Token      Tag
0           0        Der        _
1           0    Gärtner  AppTdfW
2           0  entfernte        _
3           0       sich     Pron
4           0     eilig,        _
5           0        und        _
6           0     Eduard     Core
7           0     folgte        _
8           0      bald.        _

Figurenerkennung statistics

Confusion Matrix