[Design Feature Request for performance] on-demand scaling of workers with redundancy

Hi @knadh I was studying the design of dungbeetle and I realized a scenario where this design might see bottleneck as per my understanding

As per my understanding the workers fetch one job from the QUEUE and then write it to primary_db and result_db ( for frequent operations and then removed after a time-to-live time ) and result_db is a single one ( chosed from list randomly )which we declare when we start the process in config file

Now consider the worst-case like this

You have N number of requests coming in which is much higher than anticipated
This leads to large number of operations even on result_db
Because of this even the result_db starts facing bottleneck
Because of this writes might become slower on DB ( I consider secondary DB of not of that big capacity) degrading overall perfomance

In short result_db is like a single point of failure irrespective of number of workers we have

Proposed Solution

Broker selects one result_db at start
When a result_db encounters some kind of congestion it feedbacks it to the broker.
Broker the feedback crosses a threshhold , the broker does a health check on its result_db options and makes it as primary result database
Broker now writes the new fresh job IDS to this new primary result database

if we implement this feedback on brokers then autoscalers like in kubernetes could use it to scale our workers on a particular queue. for this we need to implement a way to accept workers and create task tables just like redhat openshift does this for it's worker nodes

zerodha / dungbeetle

[Design Feature Request for performance] on-demand scaling of workers with redundancy #47

Proposed Solution