What should we benchmark?

dib-lab / genome-grist

map Illumina metagenomes to genomes!

https://dib-lab.github.io/genome-grist/

Other

36 stars 6 forks source link

What should we benchmark? #263

Open ccbaumler opened 1 year ago

ccbaumler commented 1 year ago

Following the same process as sourmash-bio/sourmash#2410, we will benchmark the genome-grist workflow with a combination of the six sequences listed in the sourmash-bio/sourmash#2410. My instinct is to:

Run each sequences alone
Run a variety of sequences from small to large sets
Run the all six together

We could also include benchmarking different databases across the steps above.

ctb commented 1 year ago

I don't think that genome-grist has any individually expensive steps or computationally complex scripts that are part of it; it's just the workflow overall that involves an awful lot of steps, much like charcoal. That may change your benchmarking strategy.