PMBio / deeprvat

Other
15 stars 1 forks source link

Regenerate IDs for VEP annotation inout VCFs #107

Closed Marcel-Mueck closed 3 weeks ago

Marcel-Mueck commented 1 month ago

What

When VCF files are normalized, sometimes there are multiple IDs for each variant concatenated with a semicolon(;) THis however leads to problems with the #Uploaded_variation column in VEP. This PR regenerates the IDs and makes sure each variant gets a unique ID. Related to Issue #76, specifically this comment https://github.com/PMBio/deeprvat/issues/76#issuecomment-2139008285

Testing

Tested locally on example data

Test scenarios

Run pipeline locally