hltfbk / Excitement-Open-Platform

Excitement Open Platform for Recognizing Textual Entailments
http://hltfbk.github.io/Excitement-Open-Platform/
86 stars 74 forks source link

[distsim] LIN-PROX has no nouns (German, CONLL corpus) #291

Open gilnoh opened 10 years ago

gilnoh commented 10 years ago

After successful generation and redis-conversion; the lexical resource based on Lin proximity funtions for the German.

However, the resource has no (or almost no) nouns. I tried with various common German nouns, but couldn't generate any match.

For the moment, I do not know how I can iterate over all rules (or all entries), so this is just a suspects. But it is quite likely that LIN-PROX has only ADV, V and ADJ.

Is this normal for LIN-PROX? or something gone wrong?

gilnoh commented 10 years ago

You can reproduce this with the following intermediate size corpus: (1/30th of SDEWAC)
http://www.cl.uni-heidelberg.de/~noh/sdewac_part01.mstparsed.utf8.conll.gz