theiagen / public_health_bioinformatics

Bioinformatics workflows for genomic characterization, submission preparation, and genomic epidemiology of pathogens of public health concern.
GNU General Public License v3.0
34 stars 16 forks source link

[Feature Request] Binning for TheiaMeta if reference is not provided #321

Closed cimendes closed 5 months ago

cimendes commented 6 months ago

:cool:

:pushpin: Explain the Request

As the next step for TheiaMeta development, binning can be implemented.

:books: Context

After an in-house assessment of binning performance, SemiBin2 was selected as the most suitable software for this purpose. It requires sorted bams of the metagenomic assembly files, and produces an array of fastas, one for each bin.

:chart_with_upwards_trend: Desired Behavior

If a reference is not provided, the metagenomic assembly goes through binning after mapping the read files to the assembly (with BWA). As output, an array of fasta files is provided to the Terra data table.

:information_source: Additional Information