Open arademaker opened 3 years ago
% awk '$0 ~ /^[0-9]/ && $8 ~ /case/ {print $4}' *.conllu | sort | uniq -c | sort -nr 32675 ADP 88 ADV <- http://match.grew.fr/?corpus=UD_Portuguese-Bosque@dev&custom=6170cd3adde6e 3 DET 2 SCONJ <- http://match.grew.fr/?corpus=UD_Portuguese-Bosque@dev&custom=6170c9fce1534 % awk '$0 ~ /^[0-9]/ && $8 ~ /mark/ {print $4}' *.conllu | sort | uniq -c | sort -nr 5161 SCONJ 169 ADV 68 ADP <- http://match.grew.fr/?corpus=UD_Portuguese-Bosque@dev&custom=6170c9959ff7a 10 DET <- 'uma vez que' http://match.grew.fr/?corpus=UD_Portuguese-Bosque@dev&custom=6170cc76e7eb1 3 VERB <- http://match.grew.fr/?corpus=UD_Portuguese-Bosque@dev&custom=6170ca763c7a3
Originally posted by @arademaker in https://github.com/LR-POR/tools/issues/18#issuecomment-948178645
Originally posted by @arademaker in https://github.com/LR-POR/tools/issues/18#issuecomment-948178645