SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
55 stars 54 forks source link

Create dataset loader for TrueVoice Intent #680

Open SamuelCahyawijaya opened 1 month ago

SamuelCahyawijaya commented 1 month ago

Dataloader name: truevoice_intent/truevoice_intent.py DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?truevoice_intent

Dataset truevoice_intent
Description TrueVoice Intent contains transcripts of TrueVoice customer service phone calls and are labelled with intents, including billing and payment, promotions, internet, other queries, international dialing, true money, and lost and stolen. This benchmark dataset is part of the PyThaiNLP benchmarks. However, the respective GitHub repository seems to have been made non-public.
Subsets -
Languages tha
Tasks Intent Classification
License Apache license 2.0 (apache-2.0)
Homepage https://github.com/PyThaiNLP/classification-benchmarks
HF URL -
Paper URL https://aclanthology.org/2023.nlposs-1.4/