siteinspector / siteinspector

A tool for catching spelling errors, grammatical errors, broken links, and other errors on websites.
https://www.triplechecker.com
GNU Affero General Public License v3.0
417 stars 42 forks source link

Multi lang support #11

Open ntimo opened 3 years ago

ntimo commented 3 years ago

Hello, first off I would like to thank you for making this awesome web app.

I would like to suggest adding support for multiple languages. Maybe different languages could then be enabled per website.

Thanks, Timo

omohokcoj commented 3 years ago

Hi @ntimo, thanks for the feedback!

SiteInspector uses https://github.com/omohokcoj/ruby-spellchecker/ library for English spellchecking - that library contains a large list of common English spelling mistakes. Adding spellchecking for other languages is quite hard and requires a decent knowledge of the language - unfortunately, I don't know other languages well enough to be able to build a spellchecker for them.

So I think adding new languages is not possible at the moment unless there are some native speakers willing to work on adding new languages directly into the ruby-spellchecker library.

kosssi commented 3 years ago

@omohokcoj Maybe this repository can help for this question https://github.com/ONLYOFFICE/dictionaries ;)

Or http://hunspell.github.io/ with Ruby support (https://rubygems.org/gems/hunspell-ffi)

omohokcoj commented 3 years ago

@kosssi hunspell dictionary spellchecker treats all unknown words as mistakes and it produces too many false positives when checking website pages. Also, I used to try languagetool and it produced too many false positives as well - that's how I ended up developing a ruby spellchecker library from scratch :)

tldev-de commented 2 years ago

Perhaps it would be a good solution to be able to configure the language per website. In addition to the languages available in Hunspell, one could then also offer the error detection you developed as less noisy optimization for english language.

I would love to use this tool, but most (for me important) websites are only available in german language, which makes siteinspector a lot less effective!

RafaelKr commented 2 years ago

Hi @omohokcoj, I just found your tool and it's awesome!

I have a multi-lingual site and it would be great if I could configure that spell checking should only be applied to pages which have set a lang="en-*" (lang="en-US", lang="en-GB", ...) attribute on the html-tag.
So I would have spell- and grammar-checking only for the english area of the page and broken link checking for english and others.

brunoamaral commented 1 year ago

This would be useful for Portuguese as well, so I am just raising my hand up in case I can help move the multilanguage support forward.

mediaarchitekt commented 1 year ago

Hi,

thank you very much for this great application. There is nothing else comparable to this.

Only one request: if multi-language support is not so easy to implement, it would be helpful to completely disable the reference to spell checking and grammar in the reports. So it is written that there are no mistakes, although grammar and spelling were just not checked. This is misleading and raises expectations to viewers, which cannot be met for all languages except English.

Have a good time Matthias

ecollart2 commented 12 months ago

Hello,

that's also a bit frustrating for me having my sites in French; I cannot use your spelling/grammar check but the broken link check part is already a treasure !

Eric Collart

WeterPeter commented 8 months ago

Still no multi language support? Planned?