fanglab / nanodisco

nanodisco: a toolbox for discovering and exploiting multiple types of DNA methylation from individual bacteria and microbiomes using nanopore sequencing.
Other
66 stars 7 forks source link

multiplexed libraries with multiple taxa #38

Closed nick-youngblut closed 1 year ago

nick-youngblut commented 2 years ago

It seems that nanodisco assumes that the fast5 files all contain sequence data from the same reference (e.g., nanodisco preprocess -r <reference_genome>). What about instances where multiple strains/species/genera/etc are multiplexed on the same nanopore run? I would think that multiplexing would be common for modified base identification, given that sequencing just one microbial genome per nanopore run is generally a waste of resources.

fanggang commented 2 years ago

Thank you for your interest in our work. Indeed, multiplexing is encouraged for doing bacterial methylation analysis with nanodisco, for both MinION and Flongle. The strains presented in our paper were also sequenced with multiplexing. Regarding your question: the current solution would probably be first de-multiplexing first and then use nanodisco (take de-multiplexed native and WGA data as input). Hope this helps.

nick-youngblut commented 2 years ago

Thanks @fanggang! I don't see anything in the documentation on dealing with multiplexed data. It would be great to have some info in the primary nanodisco docs on how to go about first demultiplex and then use nanodisco on the demultiplexed data. For instance, I'm guessing that many people don't know how to produce "de-multiplexed native" data.

fanggang commented 2 years ago

Great feedback. We will provide this information to make it easier.

Gang

On Mar 5, 2022, at 10:51 AM, Nick Youngblut @.***> wrote:

 Thanks @fanggang! I don't see anything in the documentation on dealing with multiplexed data. It would be great to have some info in the primary nanodisco docs on how to go about first demultiplex and then use nanodisco on the demultiplexed data. For instance, I'm guessing that many people don't know how to produce "de-multiplexed native" data.

— Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you were mentioned.

touala commented 2 years ago

Hello @nick-youngblut,

Thank you very much for the suggestion. We added an entry in our FAQ section with some help to process those kind of datasets (please see Q14).

Alan