[x] Confirm that this PR is linked to the dataset issue.
[x] Create the dataloader script seacrowd/sea_datasets/ind_proner/ind_proner.py (please use only lowercase and underscore for dataset naming).
[x] Provide values for the _CITATION, _DATASETNAME, _DESCRIPTION, _HOMEPAGE, _LICENSE, _URLs, _SUPPORTED_TASKS, _SOURCE_VERSION, and _SEACROWD_VERSION variables.
[x] Implement _info(), _split_generators() and _generate_examples() in dataloader script.
[x] Make sure that the BUILDER_CONFIGS class attribute is a list with at least one SEACrowdConfig for the source schema and one for a seacrowd schema.
[x] Confirm dataloader script works with datasets.load_dataset function.
[ ] Confirm that your dataloader script passes the test suite run with python -m tests.test_seacrowd seacrowd/sea_datasets/ind_proner/ind_proner.py.
[x] If my dataset is local, I have provided an output of the unit-tests in the PR (please copy paste). This is OPTIONAL for public datasets, as we can test these without access to the data files.
The dataloader script does not pass the test suite because this uses a modified naming convention for loading subsets: ind_proner_automatic_source, ind_proner_manual_source, ind_proner_automatic_l1_seacrowd_seq_label, ind_proner_manual_l1_seacrowd_seq_label, ind_proner_automatic_l2_seacrowd_seq_label, ind_proner_manual_l2_seacrowd_seq_label. This was previously discussed in the #350 issue ticket.
Closes #350.
Checkbox
seacrowd/sea_datasets/ind_proner/ind_proner.py
(please use only lowercase and underscore for dataset naming)._CITATION
,_DATASETNAME
,_DESCRIPTION
,_HOMEPAGE
,_LICENSE
,_URLs
,_SUPPORTED_TASKS
,_SOURCE_VERSION
, and_SEACROWD_VERSION
variables._info()
,_split_generators()
and_generate_examples()
in dataloader script.BUILDER_CONFIGS
class attribute is a list with at least oneSEACrowdConfig
for the source schema and one for a seacrowd schema.datasets.load_dataset
function.python -m tests.test_seacrowd seacrowd/sea_datasets/ind_proner/ind_proner.py
.The dataloader script does not pass the test suite because this uses a modified naming convention for loading subsets:
ind_proner_automatic_source
,ind_proner_manual_source
,ind_proner_automatic_l1_seacrowd_seq_label
,ind_proner_manual_l1_seacrowd_seq_label
,ind_proner_automatic_l2_seacrowd_seq_label
,ind_proner_manual_l2_seacrowd_seq_label
. This was previously discussed in the #350 issue ticket.