codeaudit / dkpro-core-asl

Automatically exported from code.google.com/p/dkpro-core-asl
0 stars 0 forks source link

POS tagset map for BNC maps common nouns to N #192

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
The tagset map for the BNC maps common nouns to N rather than to NN, see

dkpro/core/api/lexmorph/tagset/en-c5-pos.map

Is there a particular reason for that?

All the other tagset mappings in api.lexmorph map common nouns to NN

Original issue reported on code.google.com by eckle.kohler on 5 Aug 2013 at 7:27

GoogleCodeExporter commented 9 years ago
There is only "N" (Noun) in the 12 universal pos tags - cf. issue 76.

Original comment by richard.eckart on 5 Aug 2013 at 7:42

GoogleCodeExporter commented 9 years ago
I see, so that means all the other tagset mappings (besides BNC) should be 
updated then (someday) to the N of the universal tagset?

Original comment by eckle.kohler on 5 Aug 2013 at 7:44

GoogleCodeExporter commented 9 years ago
Either that or we reconsider on the universal tags if we figure that a more 
fine-grained distinction on nouns should be made. Strictly speaking, having NN 
and NP doesn't even contradict the 12 universal tags, since NN and NP are 
currently modeled as subtypes of N.

Still, the reason that there is only N in the BNC mapping is most likely that 
the mapping was written with the 12 universal tags in mind. I think it can just 
be updated.

Original comment by richard.eckart on 5 Aug 2013 at 7:51

GoogleCodeExporter commented 9 years ago
This issue was closed by revision r1755.

Original comment by eckle.kohler on 13 Aug 2013 at 7:56