jaybee84 / ml-in-rd

Manuscript for perspective on machine learning in rare disease
Other
2 stars 1 forks source link

Revisions: A section about combining multiple datasets #187

Closed jaclyn-taroni closed 2 years ago

jaclyn-taroni commented 3 years ago

In the current first paragraph of the Manage complex high-dimensional rare disease data section (https://github.com/jaybee84/ml-in-rd/blob/cd783d5e034838b7aa5a6690ec4d320ef712dbac/content/03.heterogeneity.md#manage-complex-high-dimensional-rare-disease-data), we cover the concepts like the curse of dimensionality, etc.

Now that we are revising the article to help onboard folks that are new to machine learning concepts, we have talked about breaking this paragraph out into its own section, which would be the first one after the intro, and expanding it since this is an important consideration (and some may even say stumbling block) for this context.

jaclyn-taroni commented 3 years ago

I don't think this can be completed (read: totally polished) before #185 is addressed, but I'm happy to get started on moving things around.