amperser / proselint

A linter for prose.
http://proselint.com
BSD 3-Clause "New" or "Revised" License
4.36k stars 179 forks source link

Add check for using the right name for groups of things #132

Open suchow opened 9 years ago

suchow commented 9 years ago

e.g., a group of cats is a "glaring".

http://www.oxforddictionaries.com/us/words/what-do-you-call-a-group-of

j10sanders commented 7 years ago

Do you really want these all implemented? Most are humorous/antiquated terms that wouldn't be used seriously in modern prose.

suchow commented 7 years ago

Yes! It's true that some venery terms are so rare that they might come across as humor, but plenty are still in modern use and better than generic terms such as "bunch" — e.g., a swarm of bees, a herd of deer, a flock of sheep.

j10sanders commented 7 years ago

Ok!

I could do this with a list structure of the various terms and their recommended replacements (like most other proselint modules), but I think this would be a good place to start using pattern http://www.clips.ua.ac.be/pages/pattern-en#quantify

@suchow, do you agree?

Edit: nevermind. pattern doesn't appear to have support for this.

suchow commented 7 years ago

Do you think spaCy could fill the same role? It's something I've been interested in integrating with Proselint for some time.

j10sanders commented 7 years ago

I've been messing around with spaCy for the last hour... it's very cool. The main drawback for this project is that the package uses about 1.5 GB of disk space.

j10sanders commented 7 years ago

I'm going to do this the manual way

j10sanders commented 7 years ago

@suchow, do you have a moment to review my pull request? Thanks!