Closed Microbiology closed 7 years ago
Download script is running now.
Alright reference genome set is downloaded and formatted. Now I need to get a rep contig seq from each OGU and blastn it against the dataset.
Rep set is... set... and now I am running a blast to get a feel of what I am dealing with.
Did this with tblastx. Important to interpret this all with caution though.
To kick things off I think it makes sense to pull the longest contig as the representative sequence and blast it. This means: