wdecoster / NanoPlot

Plotting scripts for long read sequencing data
http://nanoplot.bioinf.be
MIT License
401 stars 48 forks source link

Using barcoded BAM files with no summary #356

Closed Ge0rges closed 1 month ago

Ge0rges commented 4 months ago

Hello,

I have a set of unaligned BAMs seperated by barcode given to me by dorado. I was wondering if it would be possible to use these with the --barcoded flag to get a summary of each barcode, and a summary of the all together? This seems to require a summary file currently.

wdecoster commented 4 months ago

So you have individual bams, and for each bam you want a summary? --barcoded does not support that, indeed, but I think you should use all bams at the same with NanoPlot to get the "all together" report and use them in NanoComp to get separate reports/comparison.

lucyintheskyzzz commented 4 months ago

@Ge0rges I wonder if its possible to concatenate all the barcoded .bam files into one file then run NanoPlot to get the total read summary from all the barcodes? I will probably try this with NanoComp as well.

wdecoster commented 4 months ago

NanoPlot will combine multiple bams, no need to concatenate first

lucyintheskyzzz commented 4 months ago

@wdecoster I am actually doing fastq.gz files from dorado and looks like my nanoplot results separated out each barcode into separate directories. Is there a way to have all the results combined?

image
wdecoster commented 4 months ago

You wrote a loop over the fastq directories (in your other issue). Just use all fastqs at the same time.

Ge0rges commented 4 months ago

@wdecoster seems like NanoComp only supports aligned BAM files however, what would be the logical way to use it with the unaligned barcodes given by dorado?

Also what is the difference between using --barcoded with a summary file, and just running each barcode individually, then all together? Surely the tool could produce a per barcode analysis with the latter option as well?