In the following output, reported rate is close to 4k rows/s but the actual rate is half that.
scylla-bench executes requests concurrently according to the configured concurrency and emits a partial result from each concurrent fiber if more than a second has elapsed. Those partial results are then summed up together and assumed to have been produced over a period of a second. When latency is high, the period is actually larger than a second, and actual rate lower.
In the following output, reported rate is close to 4k rows/s but the actual rate is half that.
scylla-bench executes requests concurrently according to the configured concurrency and emits a partial result from each concurrent fiber if more than a second has elapsed. Those partial results are then summed up together and assumed to have been produced over a period of a second. When latency is high, the period is actually larger than a second, and actual rate lower.