NCBI-Hackathons / EndoVir

Discovery of Novel Endogenous Viruses
MIT License
6 stars 4 forks source link

split magicblast and vdb-dump in two separate steps #2

Closed janpb closed 6 years ago

janpb commented 7 years ago

For the analysis steps in the involving magicblast or rpstblastn the sequences are pooled since these tolls are really good dealing with heaps of data . Therefore, these sequences have to be collected in each iteration and than split again. However, the current concatenation of each magicblast and vdb-dump leads to the loss of information of the aligned query.

ToDo: This step needs to parse the magicblast output prior to the vdb-dump and than dump the new reads for each contig.

janpb commented 6 years ago

Fixed with newest commit.