manifoldai / merf

Mixed Effects Random Forest
MIT License
219 stars 52 forks source link

Longitudinal Data #67

Closed simonprovost closed 1 year ago

simonprovost commented 1 year ago

Dear Authors,

Thank you so much for this MERF approach. I would like to know if I could use MERF with longitudinal data represented as follows, which I believe to be the case as I have read on several websites that this is the case but did not find any exemples:

The following is a representation of the data (very simplified):

The sole non-longitudinal characteristic is the patient's name; the rest are longitudinally represented using the suffixes _1, _2, and _3 to designate waves (timepoints) one, two, and three (with e.g 1 year gab between waves). The death column is the class variable for predicting mortality (binary) which from what I have seen it is not really supported to do classification right #11 ? Except from rounding up or down probability from regressor but is that ideal, I do not have a clear thought of this.

The data representation:

patient_name, age_1, biomarkerX_1, smoke_1, age_2, biomarkerX_2, smoke_2, age_3, biomarkerX_3, smoke_3, death

Cheers,

simonprovost commented 1 year ago

Dear Authors, you appears busy. I am closing this now, happy to discuss it further when you become a little more available. Cheers.