first20hours / google-10000-english

This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
Other
3.88k stars 1.93k forks source link

Add swear-free lists and lists grouped by word length #12

Closed jakebathman closed 7 years ago

jakebathman commented 7 years ago

I've adapted these lists to add a bit more, and wanted to contribute it back if you're interested.

Added are two filtered lists that remove common swear words, and three additional lists based on the US English 10k list that are grouped by word length (1-4, 5-8, and 9+ characters).