legal-nlp / lei-8906

We are applying NLP to http://www.planalto.gov.br/ccivil_03/Leis/L8906.htm
0 stars 2 forks source link

constitucionalmente #45

Open vcvpaiva opened 6 years ago

vcvpaiva commented 6 years ago

constitucionalmente in line 7227 of https://github.com/own-pt/lei-8906-ownpt/blob/c53f645658f5f215fee5dc13587f996aaa6d0455/freeling/lei-8906.out still sending to constitucional. is this the fact that the suffix fix was not applied in general?

we have both "constitucionalmente" in synset 00122124-r and "constitucional" in 3 synsets:

  1. 02702332-a constitutional | constitucional (of benefit to or intended to benefit your physical makeup; "constitutional walk")
    1. 00179925-a constitutional | constitucional (sanctioned by or consistent with or operating under the law determining the fundamental political principles of a government; "the constitutional right of free speech"; "constitutional government"; "constitutional guarantees")
    2. 00901345-a organic, constitutive, constituent, constitutional | orgânico, constituinte, constitucional (constitutional in the structure of something (especially your physical makeup))

but "constitucional" is in the list of missing adverbs that Bruno produced

vcvpaiva commented 6 years ago

same thing seems to ocurr with "habitualmente" habitualmente habitual RG 1 - line 2657 of https://github.com/own-pt/lei-8906-ownpt/blob/c53f645658f5f215fee5dc13587f996aaa6d0455/freeling/lei-8906.out

vcvpaiva commented 6 years ago

same for "regularmente" line 5393 of the law. same for "exclusivamente" line 4733. same for "verbalmente" line 933 **respectivomente line 2102 same for "trienalmente" line 10149, etc...

in the same case: abusivo, consciente, cumulativo, efetivo (line 8741)

vcvpaiva commented 6 years ago

By contrast:

  1. subsidiariamente
  2. solidariamente
  3. originariamente
  4. juridicamente are not in OWN-PT.
vcvpaiva commented 6 years ago

arts. arts. RG 0.333333 should be abbreviation of noun, not adverb.

vcvpaiva commented 6 years ago

We have 4 kinds of problem:

  1. I don't understand the code, but it seems that Freeling always try to transform an adverb into a corresponding adjective, like "respectivamente==> respectivo" and then somehow does not find the adjective or the adverb. in particular this gives us respectivomente R??? (note that the acute accent causes some problems: juridicamente==>juridico, need to add the acute, otherwise the word doesn't exist)
  2. it is true that OWN-PT and also PWN don't have all the adverbs associated with the adjectives, morphology issues.
  3. when/where and others are considered adverbs (sometimes) in PT when they are not in English. so PWN does not have "when" or "where" and we get other things "enquanto/senao" that do not look like adverbs to me, but conjunctions or prepositions.
  4. the usual problems of prefixes apply: extrajudicial, injustificado, ilimitado.