yugabyte / yugabyte-db

YugabyteDB - the cloud native distributed SQL database for mission-critical applications.
https://www.yugabyte.com
Other
9.06k stars 1.09k forks source link

[YSQL] ANALYZE may took too much time or even fail with RPC Timeout #21041

Open qvad opened 9 months ago

qvad commented 9 months ago

Jira Link: DB-10010

Description

It's occurred randomly in our QO tests on distributed clusters.

During test execution few tables are created and after that we do call ANALYZE on all of them. E.g. 20 ANALYZEs are successfully completed, on 21th table we've got RPC Timeout error:

91%|█████████ | 20/22 [00:32<00:02,  1.47s/it]
12:20:04  2024-02-13 08:19:56,757:ERROR: Perform RPC (request call id 17037) to 10.9.118.250:9100 timed out after 602.000s

During successfull runs, all 22 analyzes took ~40 seconds to complete.

Another case is when instead of 40 seconds, analyze step took 4-8 minutes.

Issue Type

kind/bug

Warning: Please confirm that this issue does not contain any sensitive information

rthallamko3 commented 5 months ago

cc @ttyusupov as he is looking at improving the speed of analyze.

ttyusupov commented 2 months ago

Should be improved by #23069 and #24113