edgi-govdata-archiving / web-monitoring-db

An HTTP API for tracking and annotating changes to a set of web pages.
https://api.monitoring.envirodatagov.org/
GNU General Public License v3.0
17 stars 26 forks source link

Add search index (and endpoint) for version content #362

Open Mr0grog opened 6 years ago

Mr0grog commented 6 years ago

Listening to analyst discussions, sometimes there are particular terms or changes that it would be really helpful to search for other instances of (e.g. “state cooperation”). It would be great for our API to support searching a textual index of the content of each version (doubly awesome if we could have an index of changed text in each version).

We can use existing differs/diff routines to extract that content, but then we need something to compile, store, and search an index based on that. And of course an API design for how to use it :)

stale[bot] commented 5 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in seven days if no further activity occurs. If it should not be closed, please comment! Thank you for your contributions.