google-research / url-nlp

195 stars 23 forks source link

Kurdish #1

Open Sarchia opened 2 years ago

Sarchia commented 2 years ago

Hello,

Kurdish should be defined as a macro language on the list. Also, Northern Kurdish (kmr) should be included on the list, spoken in Turkey, Iran, Iraq, and Syria by 15-20 million speakers.

And better name representations for Gurani and Laki are Gorani Kurdish and Laki Kurdish.

icaswell commented 1 year ago

Hi Sarchia!

Currently we only have Sorani Kurdish and Zazaki in this list. We do not yet have Kurmanji, Gorani, or Laki, but we are steadily expanding coverage of this dataset. Since we have these specific languages rather than the macrolanguage, we list the languages individually, not as a macrolanguage.