Closed mmulvahill closed 6 years ago
via KK
For the imputation method, isn't this after the summarization step? In that case, you won't have multiple replicates per subject? If so, then use minimum across all subjects for that metabolite that have data (excluding zeros). If you want to separate the steps, then what do you think? Take the minimum of the replicates for the subject, then take the minimum of those that are not zero across subjects? I don't think there are hard established rules.
I didn't remember this BPCA/half-min approach. How often does this happen? Is it frequent? I need to think about what's best in this scenario.
My responses
GET NEW DATASET -- ask Dominick for raw data NGH131 and Emory datasets
Emory dataset has no replicates, so 'summarization' not entirely necessary.
Closing issue -- removed spike from pipeline (build broken due to other changes though) and removed bpca+0 approach
Also got new dataset from Dominik
@kechrisk
For reference, from the manuscript: