dereneaton / ipyrad

Interactive assembly and analysis of RAD-seq data sets
http://ipyrad.readthedocs.io
GNU General Public License v3.0
70 stars 39 forks source link

analysis.pca: Impute missing as random value #414

Open isaacovercast opened 4 years ago

isaacovercast commented 4 years ago

Right now the default imputation scheme converts missing data into ancestral alleles, which creates a strong affinity among samples with lots of missingness. What if instead of this, missing data is imputed randomly as ancestral or derived. The idea is that the random values at the missing sites won't by chance create spurious affinities, and the signal in the true shared variation should be recovered. Just a thought....