AnantharamanLab / VIBRANT

Virus Identification By iteRative ANnoTation
GNU General Public License v3.0
149 stars 37 forks source link

Problem with gbk format #72

Closed Leytoncito closed 1 year ago

Leytoncito commented 2 years ago

When I run vibrant, everything works fine for me, however, the output gbk files are unusable by other programs.

I have tried several times to fix this problem, installing earlier versions of vibrant and installing vibrant on other computers, but the problem persists.

Perhaps it is due to an update of some dependency, I am not aware of this issue, but before I used vibrant without problems.

Also, I have tried to fix the gbk files manually, apparently it has a problem with tabs and/or spaces, hawever the problems persists.

KrisKieft commented 2 years ago

Hi,

I apologize but I probably will not get to fixing this issue. Please try other methods of building a genbank from the source genomes/proteins.

MrTomRod commented 2 years ago

@Leytoncito

I just had a similar problem (#71). However, as long as the LOCUS line is long enough, BioPython can read the format.

It occurred to me that this could be useful for you: you could write a simple Python script that reads the GenBank and creates a new one, in the hope that BioPython creates more compatible output.

Leytoncito commented 1 year ago

Thaks @MrTomRod.

Besides Biopython, another way out is to simply re-annotate the predicted prophage sequence.