nanoporetech / dorado

Oxford Nanopore's Basecaller
https://nanoporetech.com/
Other
477 stars 59 forks source link

std::runtime_error (core dumped) with Dorado 0.7.2 #923

Open karlijn-doorenspleet opened 2 months ago

karlijn-doorenspleet commented 2 months ago

Issue Report: std::runtime_error (core dumped) with Dorado 0.7.2

The issue:

I have basecalled my data (short read amplicon data) with several versions of Dorado and keep on having the same (or similar) error:

terminate called after throwing an instance of 'std::runtime_error'calling
what(): Empty sequence and qstring provided for read id 5e0ec593-0bf6-4fd2-9fc8-61a20ad2be70 Aborted (core dumped)

It runs for 30-40% or so before the error. the empty string read id (that is is listed in the error) has the same ID when running dorado 0.7.0 and 0.7.2 , but is different when running dorado 0.5.3. (terminate called after throwing an instance of 'std::runtime_error' what(): Empty sequence and qstring provided for read id f76d9321-0dbb-4c6f-88ed-8acae5609c91 Aborted (core dumped)

Steps to reproduce the issue:

My code was as follows: ./dorado basecaller dna_r10.4.1_e8.2_400bps_sup@v4.3.0 /path/to/pod5 --recursive --trim adapters --barcode-sequences /path/to/Sequence_file_1_96_fw.fa --barcode-arrangement /path/to/Dorado_arrangement_file_barcode_01_96.toml --min-qscore 10 > /path/to/Dorado_basecalled.bam

Run environment:

Logs

HalfPhoton commented 2 months ago

Hi @karlijn-doorenspleet This looks like an internal error while handling short reads.

Are you able to find a few reads which demonstrate this issue so that we can look into it?

Kind regards, Rich

karlijn-doorenspleet commented 2 months ago

Hi Rich,

Thanks!

Yeah I have found the .pod5 file that has been causing an issue:

FAY62814_pass_9d451205_cda359de_792.pod5.zip

RByers commented 2 days ago

I'm hitting this as well with Dorado 0.7.3+6e6c45cd on MacOS. I have restarted with --no-trim instead of --trim adapters and will follow up here on whether or not that fixed the issue. Do you need another repro POD5? [Update] Indeed with --no-trim my basecalling is finally working reliably.