bystrogenomics / bystro

Natural Language Search and Analysis of High Dimensional Genomic Data
Mozilla Public License 2.0
44 stars 14 forks source link

Document sites that didn't liftover to hg38 for gnomad #15

Open cristinaetrv opened 6 years ago

cristinaetrv commented 6 years ago

There should be a list of sites/coordinates where missing values represent sites that didn't lift over from hg19 to hg38 for quality control measures to separate those sites from missing data representing private mutations.

akotlar commented 6 years ago

Here are the sites that lifted over, but were dropped for QC reasons besides PASS (or 99% of them).

The sites that didn't lift over I should make a repo for. gnomad_skipped.zip

Leaving this open to discuss whether we should stick the sites with discordant reference bases in the main gnomad databases, or make separate tracks that include those.