Closed katievigil closed 6 months ago
Could you please provide a sample of the lines in your file M02_barcode01_without_blank.fastq.gz
?
(base) [kvigil@qbc2 onr.raw.data]$ head M02_barcode01.fastq @b32f2590-167b-4931-85e3-55a69c3f315b runid=d559ed7f061716f0c08cf01d6d1ad021f008fb27 read=16 ch=814 start_time=2023-12-21T09:04:20.786164-06:00 flow_cell_id=PAS49596 protocol_group_id=ONR122123 sample_id=ONR122123 barcode=barcode01 barcode_alias=barcode01 parent_read_id=b32f2590-167b-4931-85e3-55a69c3f315b basecall_model_version_id=dna_r10.4.1_e8.2_400bps_fast@v4.2.0 GTTTCCAGTTCACAATCCGACAGCCAAATTTTCAAACATTATGCGCAGCTCATCACTATATCGAGACACGAGTTAGCAAGTAGCATTTGACTCTGTTTATCGAAATGAATCAAATTATTTGTTTTTGGGAAATACATTCAATTTGATAATGTCAAAAGTGGATTGACTTACAAATAAATCATCATCGATGCCCAGAATCGCTACTCAAAATTTTTATGTTACGCCAGTGCCTAAGTTTCCTCTACACCAAGACTCATTAGTGACAATGGAATACAAAAATATTATTAAGGCCAAATCAAGTACAAAGTGGAAAGGCTGAAACCAAAATTTTAAAAATTTGGAAAAGCGATAGTGACTGGGAAACCA
So I ended up unzipping the fastq.gz to fastq files and here are the sample lines
Also I want to run all my samples at the same time in medaka, but I know not all the samples will have the same basecaller. I have some older samples that used guppy and newer samples that used dorado? - Can medaka handle this?
Also my older samples were run on mk1b and newer samples using promethion
The header in the fastq shows a "fast" basecaller was used, I believe we have stopped providing medaka models for fast basecallers (only hac and sup).
Medaka processes a single sample, it cannot be used to process multiple samples simultaneously.
@cjw85 thank you for providing this information, maybe I can re-basecall all my samples using dorado HAC or SUP.
I'm curious if the v4.3.0 at the end of the header line is the dorado version?
Hi I am trying to figure out all the models for all my fastq.gz files, but I am getting this error message when I run one sample: