LR-POR / MorphoBr

Resources for morphological analysis of Portuguese
Apache License 2.0
24 stars 4 forks source link

spurious masculine nouns #60

Closed leoalenc closed 4 years ago

leoalenc commented 4 years ago

The following masculine nouns doesn't seem to exist in Portuguese: apanha+N+M+SG aventurança+N+M+SG chega+N+M+SG finança+N+M+SG liga+N+M+SG manda+N+M+SG

Excepting the last two, which were derived from DELAF-PB, all were inherited from GFL. Some of these spurious entries also have plural forms. The corresponding feminine entries are correct.

arademaker commented 4 years ago

OK, what is the criteria for excluding them? It is the same point I wrote about in #61

arademaker commented 4 years ago

For chegaaccording to https://www.dicio.com.br/chega/ it is both M and F.

arademaker commented 4 years ago

For finança, according to https://www.dicio.com.br/financa/ it is F but it seems that the semantics of finanças is not related to the semantics of finança... Should we relate them morphologically?

arademaker commented 4 years ago

@leoalenc, I want to abandon the current split of data into files named after the original source of the data. The maintenance of the resource will be harder with the current split. Do you agree?

leoalenc commented 4 years ago

For chegaaccording to https://www.dicio.com.br/chega/ it is both M and F.

Ok.

leoalenc commented 4 years ago

@leoalenc, I want to abandon the current split of data into files named after the original source of the data. The maintenance of the resource will be harder with the current split. Do you agree?

Yes.

leoalenc commented 4 years ago

OK, what is the criteria for excluding them? It is the same point I wrote about in #61 See my answer there.

vcvpaiva commented 4 years ago

Alo, nao concordo nao. So' conheco "chega" ( tipo dar um "chega-pra-la') no masculino. nao conheco uma "chega" e os exemplos em https://www.dicio.com.br/chega/ sao so' de verbo! qual 'e o pedigree do dicio.com? quem faz? acho estranho que na entrada pra substantivo os exemplos sejam de verbo! saudacoes, Valeria

On Wed, Feb 5, 2020 at 5:41 PM Leonel Figueiredo de Alencar < notifications@github.com> wrote:

For chegaaccording to https://www.dicio.com.br/chega/ it is both M and F.

Ok.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/LFG-PTBR/MorphoBr/issues/60?email_source=notifications&email_token=AAIZ3HZY5HNTUPQ6OWMLOPTRBNTCZA5CNFSM4KQRBWHKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEK5TWTQ#issuecomment-582695758, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAIZ3HYTX4WTLXPBPZTJKZTRBNTCZANCNFSM4KQRBWHA .

-- Valeria de Paiva http://vcvpaiva.github.io/ http://www.cs.bham.ac.uk/~vdp/

leoalenc commented 4 years ago

For finança, according to https://www.dicio.com.br/financa/ it is F but it seems that the semantics of finanças is not related to the semantics of finança... Should we relate them morphologically?

I think that one of the main purposes of morphological analysis is to provide inflected forms with a lemma, which is a pointer to dictionary entries. In the Houaiss Dictionary, for example, the keyword of the entry is “finança”. In the explanation associated with this keyword, one finds the plural form with its particular semantics. So I think there is no need to include an additional entry with the plural form as a lemma.

leoalenc commented 4 years ago

Alo, nao concordo nao. So' conheco "chega" ( tipo dar um "chega-pra-la') no masculino. nao conheco uma "chega" e os exemplos em https://www.dicio.com.br/chega/ sao so' de verbo! qual 'e o pedigree do dicio.com? quem faz? acho estranho que na entrada pra substantivo os exemplos sejam de verbo! saudacoes, Valeria

@vcvpaiva chama atenção para um ponto importante: realmente, o dicio parece não ter pedigree e deve ser usado com cautela, checando fontes confiáveis. No dicionário Houaiss impresso, encontro a confirmação de que existe o substantivo de dois gêneros chega com o sentido de repreensão.

vcvpaiva commented 4 years ago

ok, obrigada! Houaiss e' Houaiss. a gente confia.

arademaker commented 4 years ago

OK. Since @leoalenc has already accepted the PR and merged it in the master, see cad4992. I will close this issue.