languagetool-org / languagetool

Style and Grammar Checker for 25+ Languages
https://languagetool.org
GNU Lesser General Public License v2.1
12.21k stars 1.38k forks source link

[pt-BR] accented word "memoria" #4057

Open tiff opened 3 years ago

tiff commented 3 years ago

Não achou uma palavra anteriormente acentuada memoria

marcoagpinto commented 3 years ago

@tiff

Because without an accent, it is a verb.

I will fix it tomorrow.

marcoagpinto commented 3 years ago

Could you tell me the exact sentence for testing tomorrow?

jaumeortola commented 3 years ago

These errors (memoria/memória) can be detected in some contexts with precision (e.g. after a preposition). But out of these contexts some words could need greedier rules. But it is more difficult and requires a lot of testing. The first step is to look into the contexts where "memória" is used.

marcoagpinto commented 3 years ago

@jaumeortola @tiff

It seems to work in this case;

"Isto é a memoria que encontramos."

jaumeortola commented 3 years ago

An example where it doesn't work: Ele tem memoria de elefante. You could take a corpus, extract sentences with "memória", change memória → memoria, and see if the errors are detected. And then try to write more rules.

marcoagpinto commented 3 years ago

@jaumeortola

Yes, after the official release next week.

Right now I am too stressed. :-)

I only wanted to create as many comma rules as possible before the release, but I am scared if something goes wrong, although I still have a few days to check and fix.