sensein / b2aiprep

Apache License 2.0
5 stars 6 forks source link

BIDS-like format #52

Closed alistairewj closed 4 months ago

alistairewj commented 4 months ago

This is an initial PR for a BIDS-like format for the data. This PR adds (1) a prepare module which reformats data into BIDS-like data structure, and (2) a BIDSDataset and VBAIDataset class which provide utilities for loading data in from this format.

The conversion can be run with:

b2aiprep-cli redcap2bids bridge2ai_voice_data.csv --outdir output --audiodir audio

Once that's done, the data is in the output folder. See the tutorial.ipynb for an example of how to use the dataset classes for loading in dataframes from this format.

alistairewj commented 4 months ago

Interesting! I didn't expect us to have additional columns in the F2F one. I can make that work. We actually need to have a better idea of the RedCap export in general so that it is somewhat reproducible.

Rahul-Brito commented 4 months ago

Ahhh i see what happened, for the F2F they removed columns that the ethics team flagged as having identifiable information. I expect that these columns could change for each release since that process seems to be ever evolving fyi @alistairewj @ibevers

alistairewj commented 4 months ago

Thanks! I addressed most of your concerns. I haven't added tests yet but I am also not sure we will keep this dataset API yet, it merits further discussion. My use of the word "questionnaire" in the dataset API is probably wrong.