OndrejSladky / fmsi

FMSI is an exact f-masked-superstrings-based index for membership queries and set operations on k-mer sets.
MIT License
9 stars 2 forks source link

README includes info about component files stored. #37

Closed OndrejSladky closed 6 days ago

karel-brinda commented 6 days ago

"It then stores the index in files with the same prefix and the .fmsi extension." - this is a bit misleading – suggests an .fmsi file, but it's still a prefix.

It's a bit unclear why there three files for the bwt:

.ac_gt, .ac and .gt for storing the BWT

karel-brinda commented 6 days ago

Also, it's unclear, and should be explained, whether it's stored as the resulting wavelet tree, or the actual BWT (like eg in BWA). The same for mask.

OndrejSladky commented 6 days ago

Thanks, all of this should be resolved now.