Helsinki-NLP / OPUS-ingest

4 stars 0 forks source link

add Kurdish BLARK dataset #35

Open jorgtied opened 2 months ago

jorgtied commented 2 months ago

Would you please add that dataset for Central Kurdish (ckb) and Northern Kurdish (kmr) - English? Thanks. https://github.com/KurdishBLARK/InterdialectCorpus/tree/master This project is openly available. Please consider adding it to OPUS. It contains the following data: - Kurmanji (or Northern Kurdish) - English: https://github.com/KurdishBLARK/InterdialectCorpus/tree/master/KMR-ENG - Sorani (or Central Kurdish) - English: https://github.com/KurdishBLARK/InterdialectCorpus/tree/master/CKB-ENG - Sorani (or Central Kurdish) - Kurmanji (Northern Kurdish): https://github.com/KurdishBLARK/InterdialectCorpus/tree/master/CKB-KMR Here is the article: https://arxiv.org/abs/2010.01554