milvus-io / milvus

A cloud-native vector database, storage for next generation AI applications
https://milvus.io
Apache License 2.0
27.29k stars 2.63k forks source link

[Bug]: collection not loaded, querynode restart, search/query delegator rpc error #32694

Open JamesBonddu opened 3 weeks ago

JamesBonddu commented 3 weeks ago

Is there an existing issue for this?

Environment

- Milvus version: 2.3.11
- Deployment mode(standalone or cluster): cluster
- MQ type(rocksmq, pulsar or kafka): pulsar   
- SDK version(e.g. pymilvus v2.0.0rc2): 2.3.6
- OS(Ubuntu or CentOS): Ubuntu20.0
- CPU/Memory: 521 core/1T
- GPU: 24G
- Others:

Current Behavior

image

image

image

milvus.md

Expected Behavior

No response

Steps To Reproduce

No response

Milvus Log

No response

Anything else?

No response

yanliang567 commented 3 weeks ago

@JamesBonddu please offer the full milvus pods logs for investigation. I looks like a known fix that was fixed in milvus 2.3.13, please retry on it. /assign @JamesBonddu /unassign

JamesBonddu commented 2 weeks ago

@JamesBonddu please offer the full milvus pods logs for investigation. I looks like a known fix that was fixed in milvus 2.3.13, please retry on it. /assign @JamesBonddu /unassign

i had updated milvus 2.4.1-gpu and still has error pulsar Too many requests to the same Bookie while reading.

docker-compose down 
docker-compose up -d

docker-compose.yaml.md

15ac012ed4afb9c1773ecb679088ef1

yanliang567 commented 2 weeks ago

/assign @LoveEachDay /unassign

yhmo commented 1 day ago

Link to https://github.com/milvus-io/milvus/issues/33171

xiaofan-luan commented 1 day ago

@JamesBonddu how many collections do you have and how many nodes? I don't this is likely related to a pulsar issue

xiaofan-luan commented 1 day ago
  1. when collection number is large, loading all collection from pulsar might be expensive. Solution:
  2. reduce collection number might help
  3. upgrade to 2.3.15 might help
  4. try to increase pulsar cpu and memory might help