nanoporetech / dorado

Oxford Nanopore's Basecaller
https://nanoporetech.com/
Other
522 stars 62 forks source link

order for dorado modified basecalling on a barcoded dataset but starting with fast5 files? #990

Closed hilo1 closed 1 month ago

hilo1 commented 2 months ago

Is there a more correct order to doing this?

I need to do modified basecalling on a barcoded dataset, but I am starting with fast5 files.

I'm looking for a sanity check for the order of doing this.

Does this seem realistic?

  1. Create pod5 files with pod5 convert fast5 (appropriately looping to hit all the files)
  2. Run modified basecalling with dorado basecaller on the pod5 files to make an output.bam
  3. Demultiplex the output.bam with dorado demux on the output.bam

Is there a simpler way of doing this?

HalfPhoton commented 2 months ago

This workflow is sensible yes. You don't have to use pod5 files yet - but it's recommended. FAST5 support is marked as deprecated and will be removed in a future release so I'd suggest getting used to working with pod5s.