Closed kanayamah closed 4 years ago
Why don't you rely on the original PoS tag (XPOS) PPOSAT/PPOSS to consistently tag them DET and PRON?
In this corpus, XPOS is not "original". It was added later, together with the morphological features and lemmas, and it was predicted automatically.
But otherwise I agree that German possessives should be tagged DET
.
@dan-zeman thank you for explanation. Waiting for your fix!
Fixed in the dev branch. It was done by a script, so certain cases may be still unresolved. For example, ihr is ambiguous between non-possessive 2nd person plural pronoun, 3rd person singular feminine possessive determiner, 3rd person plural possessive determiner and (if upper/lowercase cannot be trusted) 2nd person honorific possessive determiner.
Many of possessive pronouns (
mein
,dein
, ...) havePRON
UPOS even though they work asdet
as in (1) below, but sometimes tagged asDET
as in (2). It is reasonable to tagPRON
for the real pronoun cases as (3) - though they are rare - and only those cases they have the lemmaich
insated ofmein
. Why don't you rely on the original PoS tag (XPOS)PPOSAT
/PPOSS
to consistently tag themDET
andPRON
?(1)
(2)
(3)