UniversalDependencies / UD_English-EWT

English data
Creative Commons Attribution Share Alike 4.0 International
197 stars 41 forks source link

Missing Abbr=Yes on initialisms #490

Closed rhdunn closed 7 months ago

rhdunn commented 7 months ago

The following initialisms are missing Abbr=Yes annotations:

ERROR: Sentence email-enronsent28_02-0009 token 10 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence answers-20111108111112AAAjhoy_ans-0008 token 8 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence email-enronsent30_01-0030 token 8 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence email-enronsent08_02-0055 token 4 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence email-enronsent24_01-0038 token 7 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence email-enronsent24_01-0095 token 7 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence email-enronsent16_01-0062 token 6 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence email-enronsent10_01-0030 token 13 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence email-enronsent10_01-0036 token 6 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'asap', expected 'asap'
ERROR: Sentence email-enronsent43_01-0128 token 10 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence answers-20111108105520AA73Axw_ans-0058 token 4 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence reviews-071650-0006 token 8 -- RB lemma 'ASAP' does not match lowercase-form applied to form 'ASAP', expected 'asap'
ERROR: Sentence email-enronsent13_01-0016 token 4 -- VBN lemma 'OK' does not match past-participle-verb applied to form 'OK'd', expected 'ok'

The following is incorrectly lemmatized as well:

ERROR: Sentence newsgroup-groups.google.com_magicworld_04c89d43ff4fd6ea_ENG_20050104_152000-0101 token 1 -- VB lemma 'sm' does not match lowercase-form applied to form 'SMS', expected 'sms'