ness-lab / mrt

0 stars 2 forks source link

Parallel conversion of VCFs to LDhat format #6

Closed James-S-Santangelo closed 4 years ago

James-S-Santangelo commented 4 years ago

We need to write a script that converts bgzipped VCFs into LDhat input format. It'll be important to preserve the naming convention of the VCFs to avoid losing the parameter combinations used to generate them (since they're in the filename).

LDhat requires two files as input: a .sites file and a .locs file.

vcftools can do this. HERE is draft version of a previous script that does this. It should provide a good launching-off point.

Note that VCFs need to be sorted first, using a script that is similar to this one here

James-S-Santangelo commented 4 years ago

Closing since no longer using LDhat