nanoporetech / dorado

Oxford Nanopore's Basecaller
https://nanoporetech.com/
Other
452 stars 53 forks source link

Unclear interaction between `--modified-bases` `model` and automatic model selection #727

Closed awgymer closed 3 months ago

awgymer commented 3 months ago

Issue Report

Please describe the issue:

The purpose of, and interaction between, the --modified-bases and model params are unclear, particularly as it pertains to the automatic model selection.

Consider the following scenarios:

This inconsistent behaviour regarding the modified-bases flag and whether or not a modified model is downloaded is unclear from any documentation, and indeed it seems that with automatic model selection that parameter is just sliently ignored?

Run environment:

tijyojwad commented 3 months ago

Hi @awgymer - this was a bug in 0.5.3 that's now fixed in 0.6.0. Running hac@v4.2.0 with --modified-bases should also download the modified bases model and run basecalling with methylation.

awgymer commented 3 months ago

I see. I will try to update to 0.6.0 ASAP

But is there a reason to have multiple conflicting ways of specifying the methylation parameter?

It feels like there is some weird interactions possible e.g. what would the expected behaviour be here?

dorado basecaller hac@v4.3.0,5mCG_5hmCG,6mA pod5_files --modified-bases 5mCG_5hmCG
tijyojwad commented 3 months ago
$ dorado basecaller hac@v4.3.0,5mCG_5hmCG,6mA pod5 --modified-bases 5mCG_5hmCG
[2024-04-04 09:30:18.587] [info] Running: "basecaller" "hac@v4.3.0,5mCG_5hmCG,6mA" "pod5" "--modified-bases" "5mCG_5hmCG"
[2024-04-04 09:30:18.588] [error] Only one of --modified-bases, --modified-bases-models, or modified models set via models argument can be used at once
awgymer commented 3 months ago

Okay so that's good. Thanks.