gtDMMB / RNAStructViz

Visualization, comparison, and analysis of RNA secondary structures via a cross-platform GUI
https://github.com/gtDMMB/RNAStructViz/wiki
GNU General Public License v3.0
17 stars 5 forks source link

sample 5S_S.cerevisiae_UNAfold.ct has two structures in it #76

Closed ceheitsch closed 4 years ago

ceheitsch commented 4 years ago

File length should be 119, not 238.

maxieds commented 4 years ago

@ceheitsch

It turns out that the structures in the original 5S_S.cerevisiae_UNAfold.ct were not duplicates. I interpreted this to mean that they were both CT file samples generated by UNAfold at some point. For the record, the stochastic sampling utility bundled with RNAstructure outputs a CT file full of concatenated samples in CT format, one right after the other, so I assume the second structure in our file was generated according to a similar scheme in the former program.

To preserve both samples historically bundled with our RNAStructViz distribution, I kept the first structure in 5S_S.cerevisiae_UNAfold.ct and moved the second structure to a new file (also bundled) named 5S_S.cerevisiae_UNAfold_structure2.ct. In principle, given this convention in a number of sample generation applications, we might want to consider allowing CT/NOPCT files contain multiple samples. The current code ignores all but the first in a file.

This fix should be visible in the new v2.2.1-testing release that will be available via brew for Mac OS later today.

maxieds commented 4 years ago

@ceheitsch Please make sure to delete the existing sequences folder in your home directory:

$ rm -rf ~/RNAStructViz

Then copy the structures again using the help (question mark) button at the upper right of the LHS main window pane:

Screenshot at 2019-12-13 15-55-36

ceheitsch commented 4 years ago

All sorted. Opening a feature request for loading multiple structures in a single ct file makes sense. The number should be capped as with the current dot-bracket ones.