Composed of T2-sag, T2-ax and T2*-ax data per patient.
~54 patients
already under git-annex of data.neuro.polymtl.ca
We can assume each patient has the three data (ie: no missing contrast).
EDIT: I've noticed that in some subjects, the T2-ax and T2*-ax do not overlap (or minimally), yielding effectively 2 contrasts only for MS lesion segmentation. So maybe we should consider 2 contrasts (not 3) per patient? --> TO DISCUSS
Contrasts need to be co-registered. In order to always work in a fixed resolution, we could maybe consider straightening all contrasts, and interpolating them to a 0.5mm iso resolution. This target resolution is a reasonable tradeoff between required resolution (for lesion segmentation) and computation time. Also, the cord segmentation routine will resample the input image to 0.5mm anyway, so we don't introduce additional interpolation errors.
We could also consider registering the data to the PAM50 template, but this will require additional info, which might not be required: ie: vertebral labeling. Moreover, reg to template is a longer procedure than simply straightening the data
cons: a non-linear transformation will be required --> might alter the integrity of the data (in addition to being slower than not doing it)
Proposal
Resample to 0.5mm iso
Segment cords
We should probably deactivate the resampling to 0.5mm (since they are already at 0.5mm)
Dataset
Notable info about the Karolinska dataset:
Creating ground truth
See discussion https://github.com/ivadomed/pipeline-ms-lesion/issues/2
Preprocessing
Discussion
Contrasts need to be co-registered. In order to always work in a fixed resolution, we could maybe consider straightening all contrasts, and interpolating them to a 0.5mm iso resolution. This target resolution is a reasonable tradeoff between required resolution (for lesion segmentation) and computation time. Also, the cord segmentation routine will resample the input image to 0.5mm anyway, so we don't introduce additional interpolation errors.
We could also consider registering the data to the PAM50 template, but this will require additional info, which might not be required: ie: vertebral labeling. Moreover, reg to template is a longer procedure than simply straightening the data
Proposal
Data management
Discussion
Where should we store the preprocessed data?
sct-testing-data/derivatives
Proposal
Go with separate new version-tracked bundled preprocessed dataset
Training