Closed yana-xuyan closed 2 years ago
@SamuelCahyawijaya I paste the chat history below: @yana-xuyan : Thank you for your contribution! For "tiociu" and "khek", since there is no ISO code for those two, let's just keep it in this way.
On the other hand, there seems to be some conflicting files on your commit, can you merge the latest update from the master branch to your branch and push again? Thank you!
/test dataset=korpus_nusantara subset_id=korpus_nusantara_ind_day_nusantara_t2t
Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071573252
Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071574522
/test dataset=korpus_nusantara subset_id=korpus_nusantara_ind_day
Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071611779
/test dataset=korpus_nusantara subset_id=korpus_nusantara_khek_ind
Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071623556
@yana-xuyan : Thanks for contributing, the dataset looks good to me! Approving this PR!
Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.
Checkbox
nusantara/nusa_datasets/my_dataset/my_dataset.py
(please use only lowercase and underscore for dataset naming)._CITATION
,_DATASETNAME
,_DESCRIPTION
,_HOMEPAGE
,_LICENSE
,_URLs
,_SUPPORTED_TASKS
,_SOURCE_VERSION
, and_NUSANTARA_VERSION
variables._info()
,_split_generators()
and_generate_examples()
in dataloader script.BUILDER_CONFIGS
class attribute is a list with at least oneNusantaraConfig
for the source schema and one for a nusantara schema.datasets.load_dataset
function.python -m tests.test_nusantara --path=nusantara/nusa_datasets/my_dataset/my_dataset.py
.