Closed jpn2021 closed 2 years ago
The code is failing to lookup the length of the sequence contig_1 from the provided input file. I would suggest that this is because the incorrect file has been provided, it must be the same file as originally provided as the reference sequence during the initial alignment step.
Thank you. When running mini_align
should the input reference fasta be a reference file or should it be from FLYE? Perhaps I had some confusion on when and how to use the assembly from FLYE. There is a known reference file for this genome.
If your aim is to assess variants in your sample compared to a known sequence then you should use the reference sequence throughout the whole process. You do not need to create an assembly with flye.
The medaka_haploid_variant
helper script automates the process I think you are trying to accomplish.
After successfully running
mini_align
(with -m -f -a -A) andmedaka consensus
(--model r941_min_hac_variant_g507) and getting .bam, .bam.bai, and .hdf files I get the below error from runningmedaka variant
:Sometimes the KeyError is contig_10 instead of contig_1.
Environment (if you do not have a GPU, write No GPU):
Additional context