We should be able to recognize verbs in their different morphological forms in German.
This will require:
[x] Add regular suffixes to the data
[x] Add functionality to add regular suffixes in the default pipeline; regular suffixes need to be added dynamically, taking into account the ending of the stem
The following exceptions need to be implemented
[ ] -el/-le stem change verbs
[x] Strong verbs
[ ] Irregular mixed verbs (make sure to check how the stemmer stems these irregular forms)
[ ] sein/haben - make sure that all forms of these words are in the list of function words; that means that we don't need to take them into account when creating word forms
Past participles
[ ] If past participle occurs in keyphrase, reconstruct the stem from the participle form
[x] Construct a participle form from the stem found [partly implemented; some exception cases still require implementation]
[ ] Separable verbs: construct all single word forms; this means construct the infinitive from the past participle and vice versa; possibly also construct subjunctive clause forms.
In getForms:
[x] Apply regular suffixes
Stretch goals:
Recognize separable verbs, e.g. "losgehen" -> "geht los". Splitting these words is the easier part, but we need to change the overall morphology pipeline: at the moment, we build forms word-by-word. For this functionality, we need to take into account the whole keyphrase. Also when matching, a match should only occur when both parts of a separable verb are found.
We should be able to recognize verbs in their different morphological forms in German.
This will require:
The following exceptions need to be implemented
Past participles
In
getForms
:Stretch goals:
Parent #2095