rpomponio / neuroHarmonize

Harmonization tools for multi-site neuroimaging analysis. Implemented as a python package. Harmonization of MRI, sMRI, dMRI, fMRI variables with support for NIFTI images. Complements the work in Neuroimage by Pomponio et al. (2019).
https://pypi.org/project/neuroHarmonize/
MIT License
82 stars 28 forks source link

Add capability to handle categorical covariates #29

Open melhemr opened 1 year ago

rpomponio commented 1 year ago

@AbdulkadirA 's comments are all very relevant. I'm guessing some internal data structure will need to keep track of the original category levels and the encoding (i.e., the way values were mapped to numbers). Then, the apply stage will need to compare the new data to the original data and reconcile differences. The Sci-Kit Learn method preprocessing.OneHotEncoder could be useful here.

Sorry I'm just getting around to seeing this. I greatly appreciate your work!