yashdavisgupta / Asperitas

clouds
0 stars 0 forks source link

Search Megaissue #7

Open YourFin opened 3 years ago

YourFin commented 3 years ago

Opening this issue as a discussion point for search and brain dumping ground.

Doing search at a half-decent clip probably means keeping an index of all the files capable of being served, and we should re-use that functionality for the directory browser. Keeping it up to date, however, will be annoying. We'll probably need to hook into the filesystem events during runtime, and re-validate the old cache on each startup. If we want to provide a truly excellent user experience we may also want to be able to gracefully degrade and verify that folder listings are correct manually per-request on startup until indexing is finished. But that's a whole other problem.

There are roughly two camps that I see wrt how to improve search beyond the braindead "does it match /.*${search_query}.*/i" approach: the fd/ivy/helm/fzf/ide "let's get clever about how we do partial character matches across the query string" approach that, for example, strongly matches "FeedMeSeymore" when the query is "memor feed", and the more google/elasticsearchy approach that tries to get more intelligent about acting on a higher level than individual characters.

Personal pet peeves to avoid:

Fun ideas:

Things to ponder:

yashdavisgupta commented 3 years ago

Thoughts in no particular order:

YourFin commented 3 years ago

For future reference: https://atg.netapp.com/wp-content/uploads/2012/12/FS_crawler_Bisson.pdf https://www.cs.nmsu.edu/~misra/papers/sc12paper.pdf