As discussed at the GSC board meeting, the current MiMAG definition is bacterial centric. However, it is equally possible to capture Eukaryotic MAGs, so it would be really good to modify the definition of the high-quality draft MAGs. I can not find the standard file in this repo, so hope that Ramona's link was correct. I am taking the definition from the paper:
This was the original definition:
High-quality draft (SAG/MAG)
Assembly quality | Multiple fragments where gaps span repetitive regions. Presence of the 23S, 16S, and 5S rRNA genes and at least 18 tRNAs.
Completion | >90%
Contamination | <5%
The reference to 23S, 16S is generally wrong, and should be modified to large subunit (LSU) RNA, small subunit (SSU) and the presence of 5.8S rRNA or 5S rRNA depending on whether it is a eukaryotic or prokaryotic genome, respectively.
This is really important to get fixed as it is blocking the submission of eukaryotes to INSDC.
As discussed at the GSC board meeting, the current MiMAG definition is bacterial centric. However, it is equally possible to capture Eukaryotic MAGs, so it would be really good to modify the definition of the high-quality draft MAGs. I can not find the standard file in this repo, so hope that Ramona's link was correct. I am taking the definition from the paper:
This was the original definition:
High-quality draft (SAG/MAG)
Assembly quality | Multiple fragments where gaps span repetitive regions. Presence of the 23S, 16S, and 5S rRNA genes and at least 18 tRNAs. Completion | >90% Contamination | <5%
The reference to 23S, 16S is generally wrong, and should be modified to large subunit (LSU) RNA, small subunit (SSU) and the presence of 5.8S rRNA or 5S rRNA depending on whether it is a eukaryotic or prokaryotic genome, respectively.
This is really important to get fixed as it is blocking the submission of eukaryotes to INSDC.
Thanks,
Rob