Swap in a new full index

pulibrary / dpul-collections

An inspiring environment for global communities to engage with diverse digital collections

1 stars 0 forks source link

Acceptance criteria

[ ] dpul-c contains tasks we can run to create a new collection and initiate indexing to it while the old collection continues to index from our pipeline and serve queries.

[ ] dpul-c has a documented way to figure out when it's time to swap indexes. It's okay if this involves some sample queries to run on the solr box.

[ ] dpul-c has a task we can run to swap in the new collection. If possible, automate this.

[ ] dpul-c has a task we can run to delete the old collection.

The indexing pipeline needs to know its cache version and the name of the collection it's writing to. We'll use the collection alias for reads. This way we can write to two different collections at once (using their actual names) while allowing reads to switch via a task (by moving the alias via the solr api).

So starting a new indexing pipeline with a new cache version will mean updating configuration and deploying. application code should start a broadway pipeline for each entry in a list of {cache version, index collection} tuples or what have you. First step in starting the pipeline should be to create the collection if it doesn't exist.* If code ever needs to know which collection is active it can ask solr to resolve the alias (likely will need to know this when checking whether to swap). Code can automate swapping the index by periodically seeking to swap to the collection fed by the pipeline with the highest cache_version.

Cleanup would then be driven by a human. Update configs to remove the old pipeline / collection name. maybe run a task that deletes the old collection (and the database entries for the old pipeline, as well?) code can deduce old collections / cache entries by checking configured cache version values and collection names.

* maybe even create the alias if it doesn't exist yet, for bootstrapping a new environment entirely.

pulibrary / dpul-collections

Swap in a new full index #104

Acceptance criteria

First Step