matrixorigin / matrixone

Hyperconverged cloud-edge native database
https://docs.matrixorigin.cn/en
Apache License 2.0
1.79k stars 277 forks source link

[Bug]: CN scale out takes effect in about 10s #9782

Open dengn opened 1 year ago

dengn commented 1 year ago

Is there an existing issue for the same bug?

Environment

- Version or commit-id (e.g. v0.1.0 or 8b23a93): nightly-e033a55
- Hardware parameters:
- OS type:
- Others:

Actual Behavior

I have a cluster with 1 CN replica at first, then I set this CN replica to 2. I connect MO by proxy address, and run show backend servers to watch running CN numbers.

It appears that the CN scale out becomes effective after 10s.

企业微信截图_e04599d1-a141-4b44-88c9-4ca41ca0e2d4 企业微信截图_7374b162-4ac9-4b1d-82eb-ddcb28d0691b

This behavior will affect the speed of the cluster scaling. Every time a CN scaling needs 10 seconds+, user will experience quite a delay of 10 seconds+. We need to find a way to reduce this delay.

Expected Behavior

No response

Steps to Reproduce

No response

Additional information

No response

volgariver6 commented 11 months ago

no plan for this issue, move to next phase plz

dengn commented 7 months ago

@aylei please check the current status for this issue.

aylei commented 7 months ago

Proxy should support watch-based CN status refresh. Currently, CN status refresh is poll-based, if a CN is labeled to a CN group, proxy can only discover the change til next poll.

aressu1985 commented 5 months ago

s1级别的问题,delay到1.3.0在做跟踪

volgariver6 commented 3 weeks ago

done