Closed jackwasey closed 9 years ago
This is now partially implemented (v0.6dev) by allowing use of one row per patient data for both icd9 codes and comorbidities. Not sure if it would be helpful to do this without a patient identifier, so now closing this as complete.
As pointed out by @gforge, the current code relies on a visitId per row, and one row per ICD-9 code. This is the primary structure of the data I have been using.
An alternative layout is one row per visit, (with or without ID field), and then multiple ICD-9 codes listed across the columns. This would be presented as a list of lists, or data frame with missing blank values when there were fewer than the maximum number of ICD-9 codes per patients. The data I am using caps at 30 codes per visit.
I've already written the code for this in C++, but it needs testing.