magarw / limit

LIMIT: Language Identification, Misidentification, and Translation
4 stars 1 forks source link

gcld3 maps #1

Open kargaranamir opened 1 year ago

kargaranamir commented 1 year ago

It just happened to catch my eye: https://github.com/magarw/limit/blob/main/data/language-id/mappings_gcld3.json

"kk": "kax", in the cld3 maps i think is not correct. source: https://iso639-3.sil.org/code/kaz

kargaranamir commented 1 year ago

You may also want to add az and be. I suggested here (https://github.com/google/cld3/issues/86), it seems it's not on the readme, but CLD3 supports it.

magarw commented 10 months ago

Hi @kargaranamir, thank you for pointing this out. We saw your message when you posted it but thought it best to not respond since the paper was still under review at EMNLP'23 and we were under the anonymity period.

I'll rectify this and also check the rest of the maps for any inconsistencies that we may have missed. Thanks again! I'll leave the issue open till I push the fix.

kargaranamir commented 10 months ago

Hi, thank you so much for your response. I apologize for not being aware that the paper was under review. Congratulations on the acceptance of your paper.