Closed domenico-simone closed 4 years ago
Now this looks like either a big associated with external gene calls or missing exception handling. I'm sure @ekiefl will be happy to see someone has a case to demonstrate it. Can you please send your input files (BAM, BAM.BAI, and CONTIGS.db) to us so we can try to reproduce it?
Hi @meren, thanks! Here's some links:
Contig db: https://bit.ly/2yK4TZg BAM file: https://bit.ly/357Essz BAM index: https://bit.ly/2xhnCLk
To re-create the contig db and the BAM file: Contig fasta: https://bit.ly/3cPA2ck Gene calls imported with anvi-gen-contigs-database: https://bit.ly/2y3expJ FastQ (R1): https://bit.ly/3bKbAJ8 FastQ (R2): https://bit.ly/3eXuw9B
Thank you,
Domenico
Thank you very much for all these files!!
Thanks @domenico-simone. I can reproduce the error.
Hi @domenico-simone. The issue is that you have hundreds of gene calls that have start positions that are negative. Here are 4:
What should have happened, is anvi'o should have complained during contigs database creation. I'll try and update the code now.
@meren I made a branch with my best attempt, but in that branch anvi-self-test --suite mini
fails. Please see for yourself. It seems that prodigal's own gene predictions yields negative values for the start position, therefore I don't know how to proceed.
cbdcba63e7d0314bfe9fef5d625557bb218a4d37
I see, @ekiefl. I can see that the external gene calls file contains all those -1s. I will add a sanity check for this to do the testing much earlier.
Meanwhile, @domenico-simone, I think the script that helped you generate those gene calls needs an update :/
Can you remind us how you generated that the external gene calls file?
@domenico-simone, if you pull from master
anvi'o should now perform a better sanity check on your external gene calls file.
Hi @meren and @ekiefl,
thanks for your prompt reply and test! I have re-run the files I had linked and I see the error (nice error message btw :D). Good to have that sanity check now!
I had used Metaeuk to predict genes on my eukaryotic contigs and I hadn't noticed that the gene starts were already 0-based, so when I converted gene calls for anvi'o I subtracted 1 to the gene start and this caused the -1s to appear.
Thank you,
Domenico
Hi,
Sorry to come up with another issue!
As a follow-up to issue #1416 I have changed the header of my BAM file(s), sorted them and then tried to run
anvi-profile
. But, whileanvi-profile
runs, I get this error multiple times:then the execution hangs; if I stop it with ctrl+C, I get this message:
In case these could be relevant:
aa_sequence
column (because I'm setting up a metagenomics analysis focused on Eukaryota). Everything went smoothly duringanvi-gen-contigs-database
andanvi-run-hmms
;anvi-profile
, but I get the same error.Thanks for your help!
Domenico