barrust / pyspellchecker

Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/
MIT License
691 stars 101 forks source link

Added Dutch language #162

Closed cast42 closed 7 months ago

cast42 commented 7 months ago

Hi, i used your python script to count frequencies from the subtitles in Dutch language. Next, I cleaned the obtained dictionary using the list of word from OpenTaal. The list contains correct spelled words in Dutch but no frequencies. I manually reviewed all words that were not in the OpenTaal list and added the correct spelling and it's variants.

I hope this PR helps to add Dutch ('nl') as a language to the package pyspellchecker.

If there's anything I can do to further help, please let me know.

Thank you for making and sharing pyspellchecker!

Kind regards, Lode

codecov-commenter commented 7 months ago

Codecov Report

Merging #162 (1847ea0) into master (282eb15) will not change coverage. The diff coverage is 100.00%.

Additional details and impacted files [![Impacted file tree graph](https://app.codecov.io/gh/barrust/pyspellchecker/pull/162/graphs/tree.svg?width=650&height=150&src=pr&token=4a1cJEFN28&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=Tyler+Barrus)](https://app.codecov.io/gh/barrust/pyspellchecker/pull/162?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=Tyler+Barrus) ```diff @@ Coverage Diff @@ ## master #162 +/- ## ======================================= Coverage 92.20% 92.20% ======================================= Files 4 4 Lines 295 295 ======================================= Hits 272 272 Misses 23 23 ``` | [Files](https://app.codecov.io/gh/barrust/pyspellchecker/pull/162?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=Tyler+Barrus) | Coverage Δ | | |---|---|---| | [spellchecker/spellchecker.py](https://app.codecov.io/gh/barrust/pyspellchecker/pull/162?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=Tyler+Barrus#diff-c3BlbGxjaGVja2VyL3NwZWxsY2hlY2tlci5weQ==) | `99.11% <100.00%> (ø)` | |