genomicsITER / NanoCLUST

NanoCLUST is an analysis pipeline for UMAP-based classification of amplicon-based full-length 16S rRNA nanopore reads
MIT License
106 stars 49 forks source link

Missing headers in consensus classification #40

Open Cynthiavlu opened 3 years ago

Cynthiavlu commented 3 years ago

Hi! Thank you for this tool. There is one thing which is not very clear to me. When you get the consensus_classification file you see the blast classification output. I was wondering whether you could tell me what the different columns stand for?

image

thank you in advance!

genomicsITER commented 3 years ago

Hi!

First, I'm not sure why you are not getting the header. I'll check for bugs and give you a better response.

The BLAST command we use follows the following output structure: 10 "sscinames staxids evalue length pident"

Columns refer to name, NCBI taxid, evalue, hit length and percentage identity. More detailed information is available in https://www.ncbi.nlm.nih.gov/books/NBK279684/