Reduce created segments when there is low traffic on a shard

opensearch-project / OpenSearch

🔎 Open source distributed and RESTful search engine.

https://opensearch.org/docs/latest/opensearch/index/

Apache License 2.0

9.39k stars 1.72k forks source link

Reduce created segments when there is low traffic on a shard #618

Open itiyamas opened 3 years ago

itiyamas commented 3 years ago

Lucene creates 1 segment per active concurrent thread per shard. Number of active concurrent threads per shard is determined by bulk thread pool in ES, which is a worker on the queue holding sub bulk requests. Each sub-bulk request ends up being picked by a different thread, hence resulting in multiple segments during refresh, resulting in better performance during reads. When there is a low traffic on a particular shard on a node, we can potentially reduce the number of created segments by co-alescing bulk requests.

When there are lot of shards on a single node for different indices, this problem may aggravate further.

The proposal is to optimize this entire process is via the following tasks:

[ ] Pipeline the CPU work into different tasks further (parsing, translog, lucene add) per shard
[ ] Coalesce the lucene documents across different bulk requests by adding a separate queue for lucene adds.

andrross commented 2 years ago

Can you more clearly define the problem that is solved by reducing the number of segments that are created on low traffic shards? What is the use case that optimization is targeting?

anasalkouz commented 2 years ago

Closing it since we didn't receive a response for a while. @itiyamas feel free to reopen once you have more details about it.

itiyama commented 1 year ago

@anasalkouz I do not have permissions to re-open the thread. Can you please open this?

Can you more clearly define the problem that is solved by reducing the number of segments that are created on low traffic shards? What is the use case that optimization is targeting?

In order to understand the use case, you need to understand how lucene parallelizes data writes across multiple segments within the same shard. Whenever data is to be written to Lucene buffer, it tries to get a buffer object from a list of pending buffer objects and ties it to the thread. If it does not find one, a new buffer object is created and documents are added there. These buffer objects are then indepently flushed to a segment during refresh. If there are multiple concurrent threads writing data to Lucene, the system ends up creating more segments. These segments are then merged later which means that we end up spending more compute. Searches also do not work well on lot of small segments.

itiyama commented 1 year ago

@dreamer-89 Can you help re-open this?

dreamer-89 commented 1 year ago

@dreamer-89 Can you help re-open this?

Issue re-opened. @itiyama : Can you please share more details around the issue as requested previously.

msfroh commented 3 months ago

Fixed by https://github.com/apache/lucene/pull/921

itiyama commented 2 months ago

@msfroh The issue that you linked solves the problem of searching through more segments but utilizes more resources to do merging. The proposal here is to create fewer segments in the first place so that you do not spend resources on merging later.

msfroh commented 2 months ago

Ahh... okay. I guess we can reopen it.

It feels like a pretty low priority, though, since writing and then merging small segments will (or at least should?) have negligible impact on overall performance.

If/when we move to predominantly pull-based indexing, we can allocate indexing threads per node based on the number of pending documents. (If each shard writes with one thread or less, then each shard will only ever write one segment per flush.)