Snitkin-Lab-Umich / scripts

Frequently used Scripts
2 stars 1 forks source link

Separate locus tag and strand information in snp and indel parser #21

Open zenalapp opened 5 years ago

zenalapp commented 5 years ago

https://github.com/Snitkin-Lab-Umich/scripts/blob/32b965c6a25fd8f26de5b8044b37e06597104a42/variant_parser_functions.R#L161

After we added the strand part to the annotation name, the locus_tag now contains the strand information too. We have to parse this out so we have separate locus_tag and strand vectors. (Both in the snp parser and the indel parser.)

image

katiesaund commented 5 years ago

Strands are now different. They look like this now: 1 gene: "CD630_00130 Strand Information: CD630_00130=+"
2 genes: "CD630_00130-CD630_00140 Strand Information: CD630_00130=+/CD630_00140=+/"

How do we want to incorporate into parser?