npgall / concurrent-trees

Concurrent Radix and Suffix Trees for Java
Apache License 2.0
510 stars 82 forks source link

Support 8-bit encoded strings #7

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Java's default UTF-16, 2-bytes-per-character string encoding, is inefficient 
for strings which otherwise could be encoded with a single byte per character.

It should be possible to represent characters in the trees using only a single 
byte per character, when working with compatible strings. This may reduce 
memory overhead by 50%.

Original issue reported on code.google.com by ni...@npgall.com on 20 Oct 2013 at 10:20

GoogleCodeExporter commented 9 years ago
Support for single-byte encodings added in release 2.3.0.

Original comment by ni...@npgall.com on 14 Nov 2013 at 12:00