Closed kinow closed 6 years ago
Not necessary to really blog about it.
In the end we can write the port of the basic classes without Text :-)
The draft that I started...
The R Biostrings library contains several classes to help manipulating strings
from different alphabets.
`XString` is the base class, with the following implementations:
* BString - used to store any string (B comes from "B"ig String)
* DNAString - used to store DNA sequence strings
* RNAString - used to store RNA sequence strings
* AAString - used to store Amino Acids sequence strings
You can store any string in a BString. But DNAString, RNAString, and the
AAString have a limited alphabet, defining the characters that you may use
when storing sequence strings.
(A, C, G, T, M, R, W, S, Y, K, V, H, D, B, N, -, +, and . for the gap symbol)
*
Using the AlphabetConverter or similar classes would be an overhead
Maybe try to write a Java port of Biostrings using Apache Commons Text.