FelixErnst / Structstrings

:package: Structstrings: implementation of Biostrings to work with dot bracket annotations
https://felixernst.github.io/Structstrings/
4 stars 0 forks source link

extended dot-bracket notation #3

Open brendanf opened 2 years ago

brendanf commented 2 years ago

I'm working on a BioConductor package to interface with Infernal, which produces RNA alignments with an extended dot-bracket notation called WUSS. In addition to the characters allowed by Structstrings currently, WUSS allows:

for unpaired positions:
_    hairpin loops
-    bulges, internal loops
,    multifurcation loops
:    external residues
~    unaligned residues

for paired positions:
A-Z    left part of pseudoknot
a-z    right part of pseudoknot

At the moment I am just using BString and BStringSet to represent these secondary structure strings, but it seems that it would be useful to modify or extend Structstrings to also support WUSS. Is that something you would be interested in either a) implementing yourself, or b) considering a pull request for?

FelixErnst commented 1 week ago

@brendanf Sorry for the long delay. Certainly a pull request would be much appreciated.

FelixErnst commented 1 week ago

Just had a look at the Infernal userguide. My guess would be that the following things would be on the todo list