GenoML / genoml2

GenoML (genoml2) is an open source Python package. It is an automated machine learning (autoML) platform for genomics data
Apache License 2.0
28 stars 17 forks source link

Adjust genotypes for tSNE/UMAP/PC loadings #11

Closed mikeDTI closed 4 years ago

mikeDTI commented 4 years ago

Please make sure that this is a feature request.

System information:

Describe Current Behavior/State and Recommended Feature Request: Adjust genotypes to be scores at munge. This is already common in preprocessing of RNAseq data. A way to account for population substructure in genotypes for repeated use in ML.

Will this change the current API? How? Extra munge option.

Who Will Benefit from this Feature? Anyone getting beat up by population stratification.

Any Additional Information? Dogs do not like thunderstorms.

mikeDTI commented 4 years ago

I'm also going to add a quick PCs versus outcome module to check for population stratification and if it needs to be adjusted for.

mikeDTI commented 4 years ago

😬😵

On Thu, Aug 6, 2020 at 5:41 PM Mary B. Makarious notifications@github.com wrote:

Assigned #11 https://github.com/GenoML/genoml_v2/issues/11 to @mikeDTI https://github.com/mikeDTI.

— You are receiving this because you were assigned. Reply to this email directly, view it on GitHub https://github.com/GenoML/genoml_v2/issues/11#event-3630941321, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJTEJEPIFWYOPNHSHMVZBDLR7MPQ3ANCNFSM4PWQILOA .

--

Mike A. Nalls, PhD

Data Tecnica International http://www.datatecnica.com/ Note: I check emails only in bursts ... for immediate project specific issues please use the relevant BaseCamp.

m-makarious commented 4 years ago

New version of package now has this - look at README for additional help and update pip package using pip install genoml2 --upgrade