sanger-tol / ascc

A Nextflow DSL2 pipeline for the identification of cobiont and contaminating sequences using fasta and pacbio data.
https://pipelines.tol.sanger.ac.uk/ascc
MIT License
3 stars 0 forks source link

Blast can sometimes fail due to the provided taxon id #49

Open DLBPointon opened 3 days ago

DLBPointon commented 3 days ago

Description of the bug

The assemblies we work with have a high likely hood of not existing in the Blast DB and so their TaxID will not be found. Currently this will cause a crash along the lines of:

  Command error:
    BLAST Database error: Taxonomy ID(s) not found.Taxonomy ID(s) not found. This could be because the ID(s) provided are not at or below the species level. Please use get_species_taxids.sh to get taxids for nodes higher than species (see https://www.ncbi.nlm.nih.gov/books/NBK546209/). 

This issue will track the number of times we come across this and depending on how often we will look into a more permenant fix.

Command used and terminal output

No response

Relevant files

No response

System information

No response