privefl / bigsnpr

R package for the analysis of massive SNP arrays.
https://privefl.github.io/bigsnpr/
183 stars 43 forks source link

Add support for plink2 binary format dosage data (pgen) #253

Open sritchie73 opened 2 years ago

sritchie73 commented 2 years ago

E.g. via a snp_readPgen() function analogous to snp_readBed().

The plink2 binary format (https://www.cog-genomics.org/plink/2.0/input#pgen) has several advantages over the plink1 binary format (bed/bim/fam):

Disavantages:

privefl commented 2 years ago

Please see https://github.com/privefl/bigsnpr/issues/176#issuecomment-791629700.

privefl commented 2 years ago

If anyone is willing to help implementing this, please discuss here.

privefl commented 1 year ago

In the meantime, you can have a look at the last point of https://privefl.github.io/bigsnpr-extdoc/inputs-and-formats.html#getting-FBM for a workaround.