Bracken (Bayesian Reestimation of Abundance with KrakEN) is a highly accurate statistical method that computes the abundance of species in DNA sequences from a metagenomics sample.
This place may not be able to find seqid for sure.
In Kraken, seqid2taxid is the mapping of NCBI sequence IDs to taxonomic IDs, which is known to be in the NCBI format. On the other hand, Kraken output results are encoding IDs from fastq sequencing files. The connection between these two lies in the fact that Kraken uses the NCBI sequence IDs from the input fastq files to identify and classify the taxonomic content of the sequences. Therefore, the one-to-one correspondence is established based on the NCBI sequence IDs present in both the input fastq files and the seqid2taxid mapping provided by Kraken.
https://github.com/jenniferlu717/Bracken/blob/8800e4bfd0c984fec46d3f8bf9fad2129d52d3e9/src/kraken_processing.cpp#L113
seqid2taxid->find(seqid)->second;
This place may not be able to find seqid for sure.
In Kraken,
seqid2taxid
is the mapping of NCBI sequence IDs to taxonomic IDs, which is known to be in the NCBI format. On the other hand, Kraken output results are encoding IDs from fastq sequencing files. The connection between these two lies in the fact that Kraken uses the NCBI sequence IDs from the input fastq files to identify and classify the taxonomic content of the sequences. Therefore, the one-to-one correspondence is established based on the NCBI sequence IDs present in both the input fastq files and theseqid2taxid
mapping provided by Kraken.