fanglab / nanodisco

nanodisco: a toolbox for discovering and exploiting multiple types of DNA methylation from individual bacteria and microbiomes using nanopore sequencing.
Other
66 stars 7 forks source link

reference metagenome file #6

Closed yfan2012 closed 3 years ago

yfan2012 commented 3 years ago

Hi,

I was wondering how you recommend the reference metagenome file be generated when doing the 'generate current differences' step for metagenomic binning. Can the metagenomic contigs for my sample of interest be used, or is it necessary to combine reference quality genomes perhaps based on the organism abundance in my sample? If contigs can be used, do you have recommendations about using an assembly constructed from the wga run, the native run, or combining the reads from both runs? Thanks in advance!

touala commented 3 years ago

Hello @yfan2012,

Thank you for your interest in nanodisco.

We briefly discuss the choice of genome/metagenome to use in the question #3 of our FAQ. To summarize, you should use the closest matching sequences to your sample. In most cases, a good choice would be using a de novo assembly from native nanopore data, which is further polished either with WGA nanopore reads or Illumina/PacBio reads (correcting methylation-related consensus errors).

For your specific question, if you're working with a mock microbiome or a heavily studied sample, then using matching reference quality genomes could indeed be an option but it's not widely applicable.

For broad applications to microbiome samples, we recommend the following workflow as described in our study:

Please let me know if this answers your question or if you have any additional ones.

Alan

yfan2012 commented 3 years ago

Perfect, thanks very much!