dwyl / english-words

:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
The Unlicense
10.46k stars 1.83k forks source link

All words lowercase - capitalization is lost #32

Open giorgio79 opened 6 years ago

giorgio79 commented 6 years ago

I noticed abbreviations and proper nouns are all lowercased unfortunately. Eg: iso.

tomprogers commented 6 years ago

I don't think that's entirely accurate. I found 78,947 strings in words.txt that include capital letters, many of which appear to be meaningful. A few samples, as they appear in the file:

When sorted alphabetically (using unix sort command), "ISO" is entry 34,630. I found "iso-" at index 240,130. I suspect that's the entry you're seeing, although I'll admit I don't know which file you're looking in.