UniversalDependencies / UD_Portuguese-Bosque

This Universal Dependencies (UD) Portuguese treebank.
Other
48 stars 11 forks source link

cases of `no` that should be a contraction `em+o` #288

Closed arademaker closed 4 years ago

arademaker commented 4 years ago

This is a follow up from #280

The remaing cases are:

% awk '$0 ~ /sent_id/ {sent=$0} $1 ~ /^[0-9]+$/ && $3 ~ /^no$/ {print sent,$0}' *.conllu
# sent_id = CP41-1 1    No  no  ADP PRP|@ADVL>  _   13  cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP76-3 1    No  no  ADP PRP|@ADVL>  _   7   case    _   MWE=No_que_se_refere
# sent_id = CP77-7 1    No  no  ADP <sam->|PRP|@ADVL>   _   2   case    _   MWE=No_caso_de
# sent_id = CP94-4 1    No  no  ADP PRP|@ADVL>  _   11  cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP115-5 1   No  no  ADP PRP|@ADVL>  _   6   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP123-7 1   No  no  ADP PRP|@ADVL>  _   2   case    _   MWE=No_caso_de
# sent_id = CP181-4 1   No  no  ADP PRP|@ADVL>  _   5   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP209-8 1   No  no  ADP PRP|@ADVL>  _   7   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP211-2 1   No  no  ADP PRP|@ADVL>  _   15  cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP322-1 1   No  no  ADP PRP|@ADVL>  _   4   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP357-3 1   No  no  ADP PRP|@ADVL>  _   6   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP369-1 1   No  no  ADP PRP|@ADVL>  _   20  cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP493-6 1   No  no  ADP <sam->|PRP|@ADVL>   _   2   case    _   MWE=No_caso_de
# sent_id = CP584-10 1  No  no  CCONJ   PRP|@ADVL>  _   4   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP713-3 1   No  no  ADP PRP|@ADVL>  _   12  cc  _   MWEPOS=CCONJ
# sent_id = CP813-5 1   No  no  CCONJ   PRP|@ADVL>  _   6   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP813-9 1   No  no  CCONJ   PRP|@ADVL>  _   6   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP826-2 1   No  no  CCONJ   PRP|@ADVL>  _   15  cc  _   MWE=No_entanto
# sent_id = CP849-2 1   No  no  CCONJ   PRP|@ADVL>  _   6   cc  _   MWE=No_entanto
# sent_id = CP910-3 1   No  no  CCONJ   PRP|@ADVL>  _   9   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP943-2 1   No  no  ADP PRP|@ADVL>  _   41  cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP968-3 1   No  no  ADP PRP|@ADVL>  _   16  cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP971-1 1   No  no  ADP <sam->|PRP|@ADVL>   _   2   case    _   MWE=No_decurso_de
# sent_id = CP973-1 1   No  no  ADP PRP|@ADVL>  _   7   cc  _   MWE=No_entanto|MWEPOS=CCONJ
# sent_id = CP993-3 1   No  no  ADP PRP|@ADVL>  _   4   case    _   MWE=No_que_se_refere
# sent_id = CP1002-4 1  No  no  CCONJ   PRP|@ADVL>  _   4   cc  _   MWE=No_entanto|MWEPOS=CCONJ
arademaker commented 4 years ago

todos os errors resolvidos.