cambiotraining / awd-pathogen-bioinformatics

Materials for "Introduction to Cholera Genomics" course
https://cambiotraining.github.io/awd-pathogen-bioinformatics
Other
1 stars 1 forks source link

Pipeline revisions - kraken, flye, etc #10

Closed tavareshugo closed 10 months ago

tavareshugo commented 11 months ago

Update the pipeline to use de-novo assembled genomes:

Lower priority is to use kraken2 instead of mash.

tavareshugo commented 11 months ago

We have a script that runs a loop through several samples, based on a samplesheet.

Note about medaka_consensus: if a GPU is detected, then tensorflow will use it automatically. But because our laptop GPUs are relatively low-spec (4GB on our NVIDIA cards), the default medaka settings make the pipeline fail. There are two possibilities:

tavareshugo commented 10 months ago

We now have a custom-made shell script that does all these steps.

However, some notes of alternative pipelines to consider in the future: