dhowe / RiTaV1

RiTa: the generative language toolkit
http://rednoise.org/rita
GNU General Public License v3.0
354 stars 78 forks source link

Incorrect pluralizations in randomWord( "nns" ) #487

Open dhowe opened 6 years ago

dhowe commented 6 years ago

Some of these are due to mass nouns being pluralized; some are just incorrect pluralizations; some have no pluralizations; and some may be mistakes in the dict (I've fixed a number of these already):

feedbacks clouts slowness (still singular) shortness (still singular) steepness (still singular) roach (still singular) sadisms refrigerations polices braveries wakes distastes upstairses rooves (should be roofs) toothpastes plutoniums

You can see the full list with the following Java code:


JSONLexicon lex = JSONLexicon.getInstance();
System.out.println(lex.getWordsWithPos("nns"));
cqx931 commented 6 years ago

Step1:

dhowe commented 6 years ago

Have we discussed removing 'vbg' from the dict?

dhowe commented 6 years ago

status @cqx931 ?

cqx931 commented 6 years ago

We have discussed removeing 'vb*' in this ticket before: https://github.com/dhowe/RiTa/issues/357 Question:

  1. Should getWordsWithPos("nns") returns words end with 'ness'/'ism' at all?
  2. I was thinking about ignoring words that are both 'nn' and 'vbg' if the request is getWordsWithPos("nns"), what's your thoughts on 'vbg' for this issue?
dhowe commented 6 years ago
  1. Should getWordsWithPos("nns") returns words end with 'ness'/'ism' at all?

No. These are not considered plurals in this case

  1. I was thinking about ignoring words that are both 'nn' and 'vbg' if the request is getWordsWithPos("nns"), what's your thoughts on 'vbg' for this issue?

That makes sense to me, please proceed