metanorma / firelight

IN DEVELOPMENT: HTML frontend for Metanorma documents
1 stars 0 forks source link

Search performance & searching by longer words #14

Open strogonoff opened 2 days ago

strogonoff commented 2 days ago

Since implementing updated Japanese tokenizer (see https://github.com/riboseinc/lunr-repro/ & the accepted PR), and setting max n-gram size to 6, search index JSON size is pretty large (300 MB). N-gram 6 is not enough, because it limits searches to 6 characters max.

Following options are to be investigated:

ronaldtse commented 1 day ago

@strogonoff can you help provide instructions on how to run the application and load the data? Thanks.