amir-zeldes / gum

Repository for the Georgetown University Multilayer Corpus (GUM)
https://gucorpling.org/gum/
Other
89 stars 50 forks source link

numeric lemmas should be NUM #147

Closed nschneid closed 11 months ago

nschneid commented 1 year ago
WARN: numeric lemma '9/11' is not NUM in GUM_speech_austria-3 @ token 4 (years -> 9/11) en_gum-ud-test.conllu
WARN: numeric lemma '8' is not NUM in GUM_bio_holt-29 @ token 12 (District -> 8) en_gum-ud-train.conllu
WARN: numeric lemma '8' is not NUM in GUM_bio_holt-30 @ token 10 (inclusion -> 8) en_gum-ud-train.conllu
WARN: numeric lemma '6' is not NUM in GUM_bio_holt-31 @ token 14 (District -> 6) en_gum-ud-train.conllu
WARN: numeric lemma '9/11' is not NUM in GUM_interview_messina-29 @ token 8 (experienced -> 9/11) en_gum-ud-train.conllu
WARN: numeric lemma '30' is not NUM in GUM_interview_onion-57 @ token 10 (Studio -> 30) en_gum-ud-train.conllu
WARN: numeric lemma '430' is not NUM in GUM_voyage_isfahan-9 @ token 4 (km -> 430) en_gum-ud-train.conllu
WARN: numeric lemma '#13' is not NUM in GUM_voyage_phoenix-46 @ token 3 (goes -> #13) en_gum-ud-train.conllu
WARN: numeric lemma '#1' is not NUM in GUM_voyage_phoenix-47 @ token 20 (catch -> #1) en_gum-ud-train.conllu
WARN: numeric lemma '#44' is not NUM in GUM_voyage_phoenix-47 @ token 42 (bus -> #44) en_gum-ud-train.conllu
WARN: numeric lemma '169' is not NUM in GUM_voyage_tulsa-45 @ token 26 (US -> 169) en_gum-ud-train.conllu
WARN: numeric lemma '75' is not NUM in GUM_voyage_tulsa-45 @ token 42 (US -> 75) en_gum-ud-train.conllu
WARN: numeric lemma '51' is not NUM in GUM_voyage_tulsa-45 @ token 45 (Hwy -> 51) en_gum-ud-train.conllu
amir-zeldes commented 1 year ago

Thanks, will fix