Closed zckoo007 closed 3 years ago
Hi @zckoo007,
Thanks for your interest in mOTUs.
The meta-mOTU that you are looking for has 9 marker genes, which are the one listed by your grep
command.
If you look at db_mOTU_DB_CEN.fasta
you will find the fasta sequence of these genes.
Then I went to binning, and I assembled 100 high-quality bins. I want to find out which bins is this unknown bacteria
I would try to find these 9 genes in the 100 bins that you assembled. Maybe vsearch should do the trick. And the percentage identity should be >96%.
Maybe there are better methods than vsearch, like predict the genes from the bins and then compare them to the 9 genes. But maybe it is just extra-work that is not needed.
Also, it might be easier if you first link the bins to create some Metagenome Assembled Genomes (MAGs) and then try to identify the genes. This is important if the bins are relatively small. For example, if your genome is composed of 20 bins, you can only identify 9 with the previous method (since there are only 9 marker genes).
In case you have a MAG (but probably also on contigs it's fine), you can use this tool to extract the 10 marker genes: https://github.com/AlessioMilanese/classify-genomes
Use the command:
classify-genomes <fasta_file> -m marker_gene_seq.fasta
In marker_gene_seq.fasta
there should be the sequences of the marker genes identified in the contig/MAG.
Hope it makes sense?
Yes, very helpful!!!
Does capitalization affect the result?
No, but if you want to be sure you can transform all to uppercase letters. It might be that some tools treat differently uppercase and lower case letters.
Hi all,
I used your excellent software call for taxnomy, and I found that there is an unknown bacteria particularly high in my case samples.(meta_mOTU_v25_12476). Then I went to binning, and I assembled 100 high-quality bins. I want to find out which bins is this unknown bacteria。 when I type the following code, I found more than one CDS, so which CDS is meta_12476? would you please tell me how to match my bins to this CDS?