DRL / blobtools

Modular command-line solution for visualisation, quality control and taxonomic partitioning of genome datasets
GNU General Public License v3.0
187 stars 44 forks source link

Feature Request: Using Kraken for hits #62

Closed nickp60 closed 6 years ago

nickp60 commented 6 years ago

Hi, I'm not sure if I have missed something, but is there a way to use the output from Kraken as a hits file? The main thing missing is some way of inferring a score for the third column of the blobtools hits file. The scope for the Kraken output files are given here

Any advice would be appreciated!

Thanks,

~Nick

DRL commented 6 years ago

Hi Nick,

If you just want the Kraken classification, you could just pick any number for a score (e.g. 100) and then run something like the line below which will take all contigs which were classified by Kraken (and their taxids) and assigns them a score of 100:

 awk '$1=="C"' $KRAKENFILE | cut -f2,3 | perl -ne  'chomp; print $_."\t100\n"'

Comments:

but should work AFAIK ...

let me know if you have troubles.

cheers,

dom