jermp / sshash

A compressed, associative, exact, and weighted dictionary for k-mers.
MIT License
84 stars 17 forks source link

Permute and reverse-complement the strings to minimize the number of abundance runs #4

Closed jermp closed 2 years ago

jermp commented 2 years ago

This command re-orders (and possibly reverse-complement) the strings in the collection to minimize the number of runs in the abundances and, hence, optimize the encoding of the abundances. This is achieved via the program ./permute which, in turn, uses the builder/cover.hpp class to build the permutation.