Closed SamuelCahyawijaya closed 2 years ago
Hi @SamuelCahyawijaya 👋
I need your guidance on this one. What's the suitable nusantara schema/task for this dataset? Is it SELF_SUPERVISED_PRETRAINING
?
I have only used this dataset for training GPT here.
Hi @ilhamfp 👋. Thanks for contributing!
Yeah, I agree, since the data is unlabelled, I think the most suitable one is to use it for SELF_SUPERVISED_PRETRAINING
.
https://indonlp.github.io/nusa-catalogue/card.html?indopuisi