sul-dlss-deprecated / dor_indexing_app

An indexing API for Stanford's Digital Object Repository
https://sul-dlss-deprecated.github.io/dor_indexing_app/
Apache License 2.0
0 stars 2 forks source link

Change rolling indexer looping strategy #1086

Closed justinlittman closed 8 months ago

justinlittman commented 8 months ago

Currently, we retrieve 500 druids from Solr of the least recently updated items, build the items, update Solr, and wait for Solr to commit. The reason for the wait for the commit is so that we the next 500 druids are retrieved, they don't include the just updated items.

In the new approach, we will retrieve a large number of druids of the least recently updated items, split them in batches of 500, build the items in each batch and post to Solr. Only the last batch will wait for Solr to commit.

This eliminates the wait after every batch.