edponce / FACET

Framework for Annotation and Concept Extraction in Text
Other
2 stars 0 forks source link

Simstring searches with range of feature sizes #16

Open edponce opened 4 years ago

edponce commented 4 years ago

Some Simstring implementations support searching for a n-gram in a range of feature sizes (i.e., Elasticsearch and RediSearch). These bulk searches reduce the database accesses at the expense of restructuring the Simstring algorithm for these cases.

edponce commented 4 years ago

Need to add batch support for Simstring matchers, first the database API needs to be extended to support batch reads. Most likely all feature sizes and features can be performed with a single database operation.