soedinglab / plass

sensitive and precise assembly of short sequencing reads
https://plass.mmseqs.com
GNU General Public License v3.0
145 stars 13 forks source link

Output FASTA header format? #6

Closed sjaenick closed 5 years ago

sjaenick commented 5 years ago

Assembled protein sequences contain additional information, e.g.

[Orf: 39, 242, 18446744073709551615, 1, 1]

What is the meaning of these numbers? Is there any coverage information included? (If not, can it be added?)

Thanks.

martin-steinegger commented 5 years ago

@sjaenick thank you for trying Plass. 👍

The information is current just the open reading frame of the center fragment that got extended. So it is not very useful. You can compute the coverage by mapping back the reads using the mmseqs2 map workflow.

sjaenick commented 5 years ago

Thanks.