seqan / seqan3

The modern C++ library for sequence analysis. Contains version 3 of the library and API docs.
https://www.seqan.de
Other
413 stars 82 forks source link

How to update an already built FM-Index #3114

Closed Wang-kaifei closed 1 year ago

Wang-kaifei commented 1 year ago

Dear developers,

I am currently learning to use SeqAn3 and I am curious if there is support for updating an FM-Index that has already been constructed? For example, I first constructed the FM-Index from a collection of strings, and then a new string appeared that needed to be added. Is there a way to update the FM-index on the original one?

Sincerely, Flora

SGSSGene commented 1 year ago

Hi Flora

Currently, there is no support to update an FM-Index in the SeqAn3 library. The only way to achieve an updated FM-Index is to completely reindex it.

In theory (not in our code), there exists ways to merge multiple BWT together, which would allow to skip the most expensive part of the reindexing, which is the construction of the BWT, but these haven't been explored by us, yet.

Wang-kaifei commented 1 year ago

I see, thanks a lot for your reply. : )

Hi Flora

Currently, there is no support to update an FM-Index in the SeqAn3 library. The only way to achieve an updated FM-Index is to completely reindex it.

In theory (not in our code), there exists ways to merge multiple BWT together, which would allow to skip the most expensive part of the reindexing, which is the construction of the BWT, but these haven't been explored by us, yet.