debatable compounds #18

Closed vcvpaiva closed 3 years ago

vcvpaiva commented 3 years ago
  1. Primeira Dama below.

'Dama' is a noun, not X, 'primeira' here seems to me ADJ, not 'noun', the whole thing does not look "compound" at all.

newdoc id = n01061 sent_id = n01061016 text = Mas, à medida que a crítica contra Clinton como Primeira Dama se tornou mais ácida, Karel demonstrou mais empatia. texten = But, as the criticism of Clinton as First Lady became more caustic, Karel became more empathetic. [..] 12 Primeira NOUN NN Gender=Fem|Number=Sing 10 nmod 13 Dama X GW 12 compound

  1. 'porta voz' in no compound here, as far as I can see

    sent_id = n01133022 text = Um porta voz da Harley-Davidson disse que os títulos que têm sobrevoado os meios de comunicação - 2017 ou 2020 - não são precisos. texten = A Harley-Davidson spokeswoman said timelines that have been floated in the media -- 2017 or 2020 -- aren't accurate. 1 Um DET DT Gender=Masc|Number=Sing 2 det 2 porta NOUN NN Gender=Masc|Number=Sing 9 nsubj 3 voz X GW 2 compound 4-5 da 4 de de ADP INDT 6 case 5 a o DET Gender=Fem|Number=Sing 6 det 6 Harley PROPN NNP Gender=Fem|Number=Sing 2 nmod SpaceAfter=No 7 - PUNCT - 8 punct Proper=True|SpaceAfter=No 8 Davidson PROPN NNP Foreign=Yes|Gender=Fem|Number=Sing 6 flat 9 disse VERB VBC Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past 0 root 10 que ADP IN 27 mark 11 os DET DT Gender=Masc|Number=Plur 12 det 12 títulos NOUN NN Gender=Masc|Number=Plur 27 nsubj 13 que PRON WP 15 nsubj 14 têm VERB VBC Aspect=Imp|Mood=Ind|Number=Plur|Person=3|Tense=Pres 15 aux 15 sobrevoado VERB VBN Aspect=Perf 12 acl:relcl 16 os DET DT Gender=Masc|Number=Plur 17 det 17 meios NOUN NN Gender=Masc|Number=Plur 15 obj 18 de ADP IN 19 case 19 comunicação NOUN NN Gender=Fem|Number=Sing 17 nmod 20 - PUNCT - 21 punct 21 2017 NUM CD Gender=Masc 12 appos 22 ou CCONJ CC 23 cc 23 2020 NUM CD Gender=Masc 21 conj 24 - PUNCT - 21 punct 25 não ADV RB Polarity=Neg 27 advmod 26 são AUX VBC Aspect=Imp|Mood=Ind|Number=Plur|Person=3|Tense=Pres 27 cop 27 precisos ADJ JJ Gender=Masc|Number=Plur 9 ccomp SpaceAfter=No 28 . PUNCT . 9 punct _

'voz' is definitely noun, "porta-voz" is a noun ('porta' is the verb 'portar=carry' conjugated), the woman was transformed into a man and "titulos" for timelines is lame!

vcvpaiva commented 3 years ago
  1. "norte coreano"

    newdoc id = n01143 sent_id = n01143003 text = Pair fez uma pegadinha ao entrar no torneio de golfe norte coreano, saiu nas notícias do Brisbane's Nine. text_en = "Pair prank their way into North Korean golf tournament," read Brisbane's Nine News.

1 Pair PROPN NNP Gender=Masc|Number=Sing 2 nsubj 2 fez VERB VBC Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past 0 root 3 uma DET DT Gender=Fem|Number=Sing 4 det 4 pegadinha NOUN NN Gender=Fem|Number=Sing 2 obj 5-6 ao 5 a a ADP INDT 7 case 6 o o DET Gender=Masc|Number=Sing 7 det 7 entrar VERB VB 2 xcomp 8-9 no 8 em em ADP INDT 10 case 9 o o DET Gender=Masc|Number=Sing 10 det 10 torneio NOUN NN Gender=Masc|Number=Sing 7 obl 11 de ADP IN 12 case 12 golfe NOUN NN Gender=Masc|Number=Sing 10 nmod 13 norte ADJ JJ Gender=Masc|Number=Sing 10 amod 14 coreano X GW 13 compound SpaceAfter=No 15 , PUNCT , 16 punct 16 saiu VERB VBC Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past 2 parataxis 17-18 nas 17 em em ADP INDT 19 case 18 as o DET Gender=Fem|Number=Plur 19 det 19 notícias NOUN NN Gender=Fem|Number=Plur 16 obl 20-21 do 20 de de ADP INDT 22 case 21 o o DET Gender=Masc|Number=Sing 22 det 22 Brisbane's PROPN NNP Gender=Masc|Number=Sing 19 nmod 23 Nine PROPN NNP Foreign=Yes|Gender=Masc|Number=Sing 22 flat SpaceAfter=No 24 . PUNCT . 2 punct _

this misrepresents the story, there was a pair of Australians, not someone called "Pair". better translation "Dupla faz pegadinha para entrar no torneio de golfe norte coreano," saiu no Brisbane's Nine News.

norte coreano is ADJ not X!

vcvpaiva commented 3 years ago

"bem-sucedidas" in

sent_id = n01148035 text = Nem todas as transformações na região têm sido bem sucedidas texten = Not all transformations in the region have been successful. 1 Nem ADV RB Polarity=Neg 2 advmod 2 todas DET PDT Gender=Fem|Number=Plur 4 det:predet 3 as DET DT Gender=Fem|Number=Plur 4 det 4 transformações NOUN NN Gender=Fem|Number=Plur 9 nsubj 5-6 na 5 em em ADP INDT 7 case 6 a o DET Gender=Fem|Number=Sing 7 det 7 região NOUN NN Gender=Fem|Number=Sing 4 nmod 8 têm VERB VBC Aspect=Imp|Mood=Ind|Number=Plur|Person=3|Tense=Pres 9 aux 9 sido VERB VBN Aspect=Perf 0 root 10 bem ADV RB 9 advmod 11 sucedidas ADJ JJ Gender=Fem|Number=Plur 10 compound _

vcvpaiva commented 3 years ago

I've checked (both the French and) the Spanish PUDs. Primeira Dama/First Lady is not a compound in either. "portavoz" is written like so in Spanish, so clearly a noun. Both Spanish and French write a "tournament in North Korea", instead of making it an adjective.

vcvpaiva commented 3 years ago

improving my translation: better translation "Dupla faz pegadinha para entrar no torneio de golfe norte-coreano," saiu no Nine News de Brisbane.

arademaker commented 3 years ago

I believe 1c6a2c6 solves everything reported here. For norte-coreano, I kept the compound analysis.

vcvpaiva commented 3 years ago

there's no reason to keep the compound for "norte-coreano". the Spanish and French PUDs use instead "North Korea tournament", which bypasses the fact that in Portuguese we don't say "norte-coreano" but we say instead "da Coreia do Norte" ou "coreano do norte".

this is a bad Anglicism.

arademaker commented 3 years ago

@vcvpaiva I didn't understand your argument. I believe versions 1 and 2 below are perfectly fine:

  1. torneio de golfe norte-coreano
  2. torneio de golfe da Corea do Norte
  3. *torneio de golfe coreano do norte

What I would say invalid is 3.

Regarding the use of compound, we have some alternatives:

  1. consider norte-coreano as a single word
  2. *keep the norte-coreano multi-word and make amod(torneio,norte) and amod(torneio,coreano) both as ADJ
  3. keep the norte-coreano multi-word and make amod(coreano,norte) and amod(torneio,coreano) both ADJ
  4. keep the norte-coreano multi-word and make compound(coreano,norte) and amod(torneio,coreano) both ADJ

I like the way it is now. But I am open to considering alternatives if we have arguments to change.

vcvpaiva commented 3 years ago

Spanish translation: La jugarreta de la pareja les dio acceso al torneo de golf en Corea del Norte, informó la cadena Nine News de Brisbane. Suggested Portuguese translation: "Dupla faz pegadinha para entrar no torneio de golfe da Corea do Norte," noticiou o Nine News de Brisbane. or "Dupla faz pegadinha para entrar no torneio de golfe norte-coreano," noticiou o Nine News de Brisbane. single-word, both work for me. However "norte coreano" as a compound does not work for me.