issues
search
first20hours
/
google-10000-english
This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
Other
3.93k
stars
1.93k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Minor issue: duplicated words
#47
Phil-bits
opened
4 weeks ago
0
why is "usergroupsusergroups" in the 20k list??
#46
ivan006
opened
5 months ago
1
UPdated testing
#45
pratik22dahikar
opened
1 year ago
1
Several single character words are not words
#43
barrybriggs
opened
2 years ago
6
Hangman
#42
27-04-2009
closed
2 years ago
0
divx is not an english word
#41
frankh
opened
2 years ago
0
Create sample
#40
Sashank-B
opened
2 years ago
1
Eng-words
#39
stepkar
opened
2 years ago
0
there is no word 'adept' ?
#38
pencilCool
opened
3 years ago
4
Small fixes to the USA lists
#36
obar
opened
3 years ago
0
Removing the non-word "sublimedirectory"
#35
AllenDowney
opened
3 years ago
0
Update README.md
#34
AmanSharma123456
opened
4 years ago
0
Spelling corrections.
#33
ghost
opened
4 years ago
0
Could we get a SFW 20k word list?
#32
Anonymus1
opened
4 years ago
0
'voyuer' is not a real word
#31
vkalantar
opened
4 years ago
1
Various brand names are included
#30
vkalantar
opened
4 years ago
0
"foto" listed, although does not appear to be a word
#29
ghost
closed
4 years ago
0
"profileprofile" word (duplicated)
#28
Idan503
opened
4 years ago
0
How do you get it to work?
#27
jw4wellness
opened
4 years ago
1
I don't think "mai" is an English word. It might be French.
#26
whitten
opened
5 years ago
2
Removed string 'sbjct'
#25
jbalcorn
opened
5 years ago
0
Remove more NSFW words from no-swears files
#24
nickvollmar
closed
5 years ago
1
Contractions?
#23
brandonchinn178
opened
5 years ago
0
Fixed broken URLs and updated all to https
#22
hingston
closed
5 years ago
0
Clearer copyright
#21
HubKing
closed
3 years ago
4
Masturbating?
#20
ghost
closed
5 years ago
0
Porn website
#19
DGoedtkindt
closed
6 years ago
0
All words are lowercased, even Proper Nouns
#18
giorgio79
opened
6 years ago
0
Remove more swear words from no swears files
#17
Elizafox
closed
7 years ago
0
Some bad words not filtered from clean versions
#16
Elizafox
closed
7 years ago
1
Is there a Spanish version?
#15
BayInternetGroup
opened
7 years ago
4
missing very common words
#14
mdtr
opened
7 years ago
1
Not In Order by Frequency in English Language
#13
totallyuneekname
opened
7 years ago
0
Add swear-free lists and lists grouped by word length
#12
jakebathman
closed
8 years ago
0
Unclear license
#11
l0b0
closed
8 years ago
1
top 10k english words that are words?
#10
tedder
closed
8 years ago
2
Replace the last half of 20k.txt using count_1w.txt #6
#9
koseki
closed
8 years ago
1
Fixed Peter Norvig's last name. :-)
#8
dmuth
closed
8 years ago
1
A
#7
huynhminh020391
closed
8 years ago
0
Why are there ~1500 duplicate words here?
#6
farzher
closed
8 years ago
5
Add alternative list with American English spellings
#5
skotzko
closed
9 years ago
0
Licence
#4
hugovk
closed
9 years ago
2
20000 words
#3
daviddliu
closed
9 years ago
0
Remove trailing \t characters
#2
vgel
closed
10 years ago
0
Frequency Fail
#1
trans
closed
12 years ago
1