Closed jakobchwastek closed 8 months ago
we should integrate: dataset.save() dataset.load(preprocessed=True)
to our integration-test
Thank you for creating this bug report!
we should integrate: dataset.save() dataset.load(preprocessed=True) to our integration-test
Actually this is a functional tests you are talking about, and this should go into a new file called dataset_processing_test.py
in the functional tests directory.
Of course adding it to the integration tests won't hurt, but remember:
Wouldn't it be necessary to store the databases test files with proper name as present in actual dataset. Otherwise I would need to adjust the filename format and thus leave out parsing the attributes in filenames.
Current Behavior
When loading preprocessed data using the
Dataset.load()
method with thepreprocessed=True
flag, I encounter aSchemaFieldNotFoundError
related to thetrialId
column. The preprocessed data already has renamed columns, and the error occurs because the renaming operation is performed again, regardless of whether the data is preprocessed or not.Expected Behavior
The preprocessed data should be successfully loaded without any errors.
Minimum Acceptance Criteria
dataset.load(preprocessed=True)
can correctly load preprocessed data without column renaming issues.Failure Information (for bugs)
Steps to Reproduce
pix2deg
andpos2vel
.dataset.save()
.dataset.load(preprocessed=True)
.Code to Reproduce
Error Log
Relevant Code Snippets
The datasets
DatasetDefinition
describes acolumn_map
for renaming columns in the raw input data.I tested by running the code with removing the column map, and it worked:
Context
Checklist