first20hours / google-10000-english

This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
Other
3.88k stars 1.93k forks source link

Several single character words are not words #43

Open barrybriggs opened 1 year ago

barrybriggs commented 1 year ago

81: c 82: e 90: s 98: x

abhradeepde123 commented 7 months ago

yes i noticed too lemme fix it with some python rn

abhradeepde123 commented 7 months ago

got some indexes assign me ill fix it in abt a week

abhradeepde123 commented 7 months ago

nvm deleted most fake words

adelgamer commented 7 months ago

Can you share the link to this project please?

On Wed, Jan 24, 2024, 18:55 Abhradeep De @.***> wrote:

nvm deleted most fake words

— Reply to this email directly, view it on GitHub https://github.com/first20hours/google-10000-english/issues/43#issuecomment-1908645604, or unsubscribe https://github.com/notifications/unsubscribe-auth/AO7IGDWZWHD35TRRSUYVJWDYQFDKLAVCNFSM6AAAAAAR2ZUAKKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSMBYGY2DKNRQGQ . You are receiving this because you are subscribed to this thread.Message ID: @.***>

abhradeepde123 commented 7 months ago

assign me already goddamnit i have the mostly fixed text file

abhradeepde123 commented 7 months ago

oh i didnt see the email comment... whoops but here you go dictionary-en.txt (the whole file not just short words)