Closed IvanHalimP closed 2 years ago
/test dataset=toxicity_200 subset_id=toxicity_200_jav
Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071201693
Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071203988
/test dataset=toxicity_200 subset_id=toxicity_200_bjn
Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071303745
/test dataset=toxicity_200 subset_id=toxicity_200_jav
Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071951339
…urce only scheme)
Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.
Checkbox
nusantara/nusa_datasets/my_dataset/my_dataset.py
(please use only lowercase and underscore for dataset naming)._CITATION
,_DATASETNAME
,_DESCRIPTION
,_HOMEPAGE
,_LICENSE
,_URLs
,_SUPPORTED_TASKS
,_SOURCE_VERSION
, and_NUSANTARA_VERSION
variables._info()
,_split_generators()
and_generate_examples()
in dataloader script.BUILDER_CONFIGS
class attribute is a list with at least oneNusantaraConfig
for the source schema and one for a nusantara schema.datasets.load_dataset
function.python -m tests.test_nusantara --path=nusantara/nusa_datasets/my_dataset/my_dataset.py
.