IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
261 stars 61 forks source link

Create dataset loader for Cross-lingual Outline- based Dialogue (COD) #304

Closed SamuelCahyawijaya closed 1 year ago

SamuelCahyawijaya commented 1 year ago

NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?cod_id

Dataset cod_id
Description Cross-lingual Outline-based Dialogue dataset (termed COD) enables natural language under- standing, dialogue state tracking, and end-to-end dialogue modelling and evaluation in 4 diverse languages: Arabic, Indonesian, Russian, and Kiswahili. The data covers multi domain instances, e.g., bank, travel, weather, movies, music.
License Unknown
madenindya commented 1 year ago

self-assign