Open tiff opened 5 years ago
I think the same should apply to file names:
List of common file extensions: https://www.computerhope.com/issues/ch001789.htm
List of most common TLDs https://www.lifewire.com/most-common-tlds-internet-domain-extensions-817511
Hi @tiff
In my opinion this starts to enter in a tricky line in sand. While the extension in itself are already covered in some languages, and can be easily ported to others (code appended), what preceds it - i.e. the website name, or the filename - may sometimes benefit from being spellcheck. I understand the relation to emails and full URLs, but those use to be copy pasted into the texts, so they hardly contain errors. However, filenames with full filepaths, should be ignored for the same reasons emails are, so I will add this to my list of things to do, although it won't come anytime soon.
<rule name="Ignore spelling of file names" id="IGNORE_SPELLING_OF_FILE_NAMES">
<!-- Localized from English by Tiago F. Santos, 2018-09-15 -->
<pattern>
<token/>
<token spacebefore="no">.</token>
<token spacebefore="no" regexp="yes">&extensoes_de_ficheiros;</token><!-- For more extensions, refer to https://fileinfo.com -->
</pattern>
<disambig action="ignore_spelling"/>
</rule>
<rule name="Ignore spelling of @user mentions" id="IGNORE_USER_MENTION">
<pattern>
<token regexp="yes">@.+</token>
</pattern>
<disambig action="ignore_spelling"/>
</rule>
<rulegroup name="Ignore spelling of #hashtags" id="IGNORE_HASHTAG">
<rule>
<pattern>
<token spacebefore="yes">#</token>
<token spacebefore="no"/>
</pattern>
<disambig action="ignore_spelling"/>
</rule>
I fixed the problem in English for file names that contain underscores (https://github.com/languagetool-org/languagetool/commit/bb50b97c995eb02919842507920b7536ab37d4c4).
I noticed that LanguageTool finds spelling errors in domain names without
www.
I think we should ignore them if they end with common tlds (com, org, gov, edu, net. info, eu, biz, com.mx, de, es, fr, nl, pt, co.uk, uk).
German:
English:
Spanish:
Portuguese:
French:
Polish:
Dutch and Russian are fine.