vector-engineering / covidcg

A COVID-19 CoV Genetics (CG) browser to inform therapeutics development
https://covidcg.org
MIT License
26 stars 5 forks source link

Remove degenerate NTs from larger DNA mutations #524

Closed atc3 closed 2 years ago

atc3 commented 2 years ago

Degenerate NTs ('N', 'R', 'K', 'M', 'Y', 'W', 'V', 'S', 'D') are somehow making it past our filter - probably because our filter only applies to mutations of length 1.

Intended behavior should just be to ignore these - so apply the filter upstream of the mutation concatenation step