ukwa / webarchive-discovery

WARC and ARC indexing and discovery tools.
https://github.com/ukwa/webarchive-discovery/wiki
113 stars 24 forks source link

Use CommonGrams to speed up queries that contain stop words #293

Open anjackson opened 1 year ago

anjackson commented 1 year ago

Following Hathi Trust's work, e.g.

We could also explore this approach, adapting the implementation of the full text fields to use these filters. This should make some of our slowest queries much faster.

EDIT: I'm not planning to pursue this right now, but wanted to capture all these details here for future reference.