Open tbrown91 opened 1 year ago
We have a Snakemake pipeline started at the CNAG. It fishes ONT reads with a target mitogenome, then assembles with Flye, selects candidate circular contigs, optionally polishes with Illumina (tricky to select correct Illumina reads, so we're working out some issues with that), then annotates, sets start and orientation based on tRNA annotation and removes duplication if any (sometimes two or more mitogenomes contigged in tandem), then re-annotates. There are still issues with it. It's hard to get alignment parameters that always work, given highly variable input (mitogenome references come from across the tree of life rather than just vertebrates, for example).
Hi Tyler, Do you have the pipeline in github anywhere so that we can test it? Diego also has some good experience using Unicycler and Trycycler with hybrid ONT/Illumina datasets. We would like to test some pipelines and see what is easiest to install and run alongside giving robust results.
I'll talk to Fernando and see where this is at. I know he has a few more rules to implement before it's ready for external testing.
Maybe based on Unicycler after mapping reads to reference