plurals / pluralize

Pluralize or singularize any word based on a count
MIT License
2.14k stars 180 forks source link

Singularize english words (R pluralize) #177

Open francoisfauteux opened 3 years ago

francoisfauteux commented 3 years ago

Testing with R pluralize on vwr english.words from CELEX (~66K) returns ~600 inconsistencies:

library(pluralize) library(vwr) dat<-data.frame(word=english.words,sing=singularize(english.words)) dat<-dat[which(dat$word!=dat$sing & !dat$sing %in% english.words),]

Examples:

abdomen: abdoman always: alway amaryllis: amarylli appendicitis: appendiciti asbestos: asbesto axis: axi

(...) and so on.

One possible workaround is to filter singulars that are not in a selected dictionary or lexicon.