Open Crissium opened 6 months ago
Heya,
On Sunday, I will have a look.
Thanks!
Heya,
I have added the ones you placed in normal view:
*** Words suggested by Crissium - START ***
89600) batt + batts + batt's
89601) battels
89602) battement + battements + battement's
89603) batterie + batteries + batterie's
89604) battery-operated
89605) battily
89606) battiness + battiness's
89607) battle-scarred
89608) battlewagon + battlewagons + battlewagon's
89609) bauera + bauera's
89610) bavardage + bavardage's
89611) bavarois + bavaroises + bavarois's
89612) bavian + bavians + bavian's
89613) bawdry + bawdries + bawdry's
89614) bawheid + bawheids + bawheid's
89615) bawley + bawleys + bawley's
89616) bawn + bawns + bawn's
89617) baya + bayas + baya's
89618) bayadère + bayadères + bayadère's
89619) baza + bazas + baza's
89620) bazillionaire + bazillionaires + bazillionaire's
89621) bazoom + bazooms + bazoom's
89622) artisanship + artisanships + artisanship's
89623) freedwoman + freedwoman's
89624) haply
89625) harrowingly
89626) riviera + rivieras + riviera's
89627) topstitch +s + ing +ed
89628) tsardom + tsardoms + tsardom's
89629) uhlan+ uhlans + uhlan's
89630) Brontë + Brontë's
89631) Gaskell + Gaskell's
89632) Turgenev + Turgenev's
89633) Zola + Zola's
*** Words suggested by Crissium - END ***
"bazoo" Oxford claims to be a US word.
Thank you for the wordlist, I have downloaded it and will slowly add them.
It may take a very long time to add so many words as they have to be analysed one by one to see if they accept plurals, etc.. Currently I lack the spare time as I have an ongoing PhD.
It may take years to add them, anyway.
Hi Marco, just found your website and I am impressed the amount of work you put into this. I just found another source of names that might need to be included but hopefully does not need too much checking? https://www.iamnotatypo.org/dear-tech-giants It is the names of people from the UK's office of national statistics or a CSV hosted in the above website.
I made a comment on this bug report as I feel it is a general all languages issue especially in a more global world: https://github.com/hunspell/hunspell/issues/113
My question in your case: how can I help? Both with the list mentioned by Crissum and this new proposed list. I can make pull request for the latter.
How did you add the names of cities for example? I found fairly small towns included like Frimely.
Heya, @amunizp ,
Currently, I can't do much since I have a private PhD presentation in two weeks and I don't know later when I will have a real presentation.
I can't focus much on things that take whole days/weeks/months to do, I am just doing basic stuff, such as creating a rule or two in LanguageTool, updating the website with small enhancements, etc.
Here is my plan: On 1-NOV I will release an update for the dictionary because it is the deadline in Gerrit for the next major release of LibreOffice.
On 1-JAN-2025 I will make another release, and if I already have the PhD, things will go back to normal in 2025.
This is my plan.
Thanks!
I acquired an electronic version of the Oxford Dictionary of English and compared the list of headwords with
en_GB (Marco Pinto)/wordlist_marcoagpinto_20240501_276252w.txt
. I found 72,767 missing words in total but of course this list requires manual proofreading. Here's a sample:Among these 'bawdry' seems like a serious omission, 'bbl.' actually already exists as 'bbl' and 'be-' should not be included, but others are probably too rare to be of any interest…
I browsed through the list and did find some other words that at least I don't have to consult a dictionary to understand:
nonexisting_words.txt
By the way, I'd like to suggest a couple of proper nouns: Brontë, Gaskell, Turgenev, Zola. Actually I removed all headwords with white space. I could upload another version with proper nouns included.