Open chucknordy opened 6 months ago
That's a very interesting observation and I totally did not expect that to be the case. Remarkable. Maybe --only-report should be the default, then. Hrm.
Just FYI, was running into similar computation time issue and included the '--no_static' option, and drastically improved the wall time. Looks like there might be an issue with the kaleido python module.
Yes, that seems to be the case. Hrm, static images can be helpful too... I think. Or maybe everyone only cares about the html report.
Possibly, the most useful output for us is the statistics, but it is nice to have the html files as supplementary.
Thanks for the responses, folks. I can confirm that using the --no_static
option (and not the --only-report
option) also allows very fast runtimes. So indeed, the generation of the static PNG files seems to be the problem. But really, I don't see those as particularly important (at least, not worth waiting for -- with things as-is, it would be much quicker to just create the HTML output and then grab screenshots, if one really wanted static images for a slide deck or something).
For the next release, I'd recommend making the --no_static
behavior the default, getting rid of that option, and instead providing, say, a --static
option for those who really want PNGs and have lots of patience. :-)
Possibly, the most useful output for us is the statistics, but it is nice to have the html files as supplementary.
I don't know which input format you are using, but if it is bam or cram have you considered using cramino?
For the next release, I'd recommend making the --no_static behavior the default, getting rid of that option, and instead providing, say, a --static option for those who really want PNGs and have lots of patience. :-)
Yes, that sounds like a good suggestion :)
For what it is worth when I tested out --only-report
using average 27X of WGS data in a single bam or cram file, I only observed marginal gains (maybe about 5%?) in run time compared to running similar sized data without --only-report
. Although I did not perform extensive benchmarking for this.
Hi, I've recently started using NanoPlot, really liking it, but I'm wondering about one thing.
If I use this command line, with the
--only-report
option, then of course I only get the one HTML output file (along with the stats and the log), but that one file contains the full output, with all of the various sub-plots.If however I use the same command line, but simply omit the
--only-report
option, it then produces a bunch more output files (PNGs and HTML versions of each subplot separately), as expected. But what I didn't expect, is that the run time is much longer for the second command. Is there some obvious reason for this? Seems strange, since presumably the same information is being processed either way.The output log here, for a small FASTQ input of only 594 Nanopore reads, shows how it took 45 minutes to run the latter way; whereas it took only seconds to run the first way. From the timestamps, it looks as if the simple act of saving the PNG files it taking many minutes each, which makes no sense to me. Please let me know if I'm missing something obvious, thank you!