nanoporetech / dorado

Oxford Nanopore's Basecaller
https://nanoporetech.com/
Other
477 stars 59 forks source link

Basecalling model for Direct RNAseq SQK-RNA002 #200

Closed tir-hbdx closed 1 year ago

tir-hbdx commented 1 year ago

I am trying to use Dorado for basecalling of a Direct RNAseq dataset prepared with the SQK-RNA002 kit. Using the only available RNA model for Dorado (rna003_120bps_sup@v3) I get the error: [error] Sample rate for model (4000) and data (3012) don't match.

Is there an RNA model for Dorado compatible with this dataset?

vetmohit89 commented 1 year ago

Hello, I also got the same error Resetting modules to system default [2023-05-26 10:53:13.082] [info] > Creating basecall pipeline [2023-05-26 10:54:57.730] [error] Sample rate for model (4000) and data (3012) don't match.

itslittman commented 1 year ago

@tir-hbdx I've used this model to basecall RNA002 data and had no error messages like that, but I'm not sure the results are reliable since it says 120bps and RNA002 translocation speed is 70bps. That being said, the resulting tracks look good in IGV. I heard there was an RNA003 kit in the works that was scrapped in favor of the upcoming RNA004 kit; maybe this SUP model was intended for RNA003 but can be used with RNA002 data and so was still released here? There's definitely no dorado version of the RNA002 HAC model I use during live basecalling, at least that I can see.

I actually came here to ask if this model produces reliable results with RNA002 and saw this thread/figured I'd jump in. If my question is too tangential I can open a new thread.

Noah

tijyojwad commented 1 year ago

Hi @tir-hbdx @vetmohit89 - you're seeing the error because the RNA003 released models is not technically compatible with the RNA002 dataset. We will be released the correct models in a subsequent release. I'll keep the issue open so I can get back to you when that's ready.

@itslittman - you were probably running a dorado pre-0.3.0 release, right? that might be okay... but the recommendation is to update to latest dorado and use the appropriate models once we release them :)

ChaseWLW commented 1 year ago

Hey @tijyojwad, I tried to run the just released rna002_70bps_hac@v3 model on some RNA002 datasets and got a similar error message: [error] Sample rate for model (3000) and data (3012) don't match.

Any idea how to fix this?

tijyojwad commented 1 year ago

Hi @ChaseWLW - thanks for reporting this! This is a bug, and we're working on a fix.

stegiopast commented 1 year ago

Hi @tijyojwad I am interested in the progress of the bugfix? Is there any release planned soon ?

iiSeymour commented 1 year ago

@stegiopast this fix made it into the v0.3.2 release from last week.