[Bug]: uneven datanode memory usage

pycui commented 2 months ago

Is there an existing issue for this?

[x] I have searched the existing issues

Environment

- Milvus version: 2.4.10
- Deployment mode(standalone or cluster): cluster
- MQ type(rocksmq, pulsar or kafka):    pulsar
- SDK version(e.g. pymilvus v2.0.0rc2): pymilvus
- OS(Ubuntu or CentOS): Ubuntu (k8s)
- CPU/Memory: 32core/512Gi for datanode
- GPU: n/a
- Others:

Current Behavior

One datanode's memory consumption increases very rapidly and always results a OOM error. See screenshot: 7 of 8 datanode is using very limit mem, but one is using 330G+ after only 8 minutes of restart.

I suspect this is due backlog caused by heavy write for a while. But still this is unreasonable. It's not resuming normal on its own now (keep crash loop). Have to increase memory limit to 512G to resolve this. However, this is very fragile so would like better handling going forward.

Expected Behavior

No response

Steps To Reproduce

No response

Milvus Log

k logs my-release-2-milvus-datanode-dd47d5599-66h9s -p > ~/oom.txt
Error from server (BadRequest): previous terminated container "datanode" in pod "my-release-2-milvus-datanode-dd47d5599-66h9s" not found

However, there is no error from the log itself (the pod was killed by OOM externally).

Anything else?

No response

yanliang567 commented 2 months ago

@pycui pleas share more info about what you did before the issue pops up, how much data/collections/entities are you running, what is the schema, also please attach all the milvus pods' log files for investigation. /assign @pycui /unassign

pycui commented 2 months ago

1 collection, 750M rows at the time at happens. Was probably writing 5K rows / sec for a while before the issue. Unfornately cannot share the schema, but it's very simple (1024 dim vector, DiskANN index, a few scalar fields, no index on scalar fields, collection not loaded)

the datanode's log is already gone in k8s. Let me know what other specific pod's log you want.

xiaofan-luan commented 2 months ago

if you have only on shard, then one datanode can be used.

for 750 million data we recommend to use 4-8 shard to start.

We recommend you to use bulkinsert instead of insertion when you want to import large collections.

There is no reason datanode will use 100GB memory or more. The largest laster managed by us has 16GB datanode at most and it works perfectly. you need to figure out where those memory are used by running pprof

xiaofan-luan commented 2 months ago

@pycui do you want to setup a meeting with me so we can know more details about your use case and help.

Your use cases seems to be an interesting one and I'm sure we can help My email is james.luan@zilliz.com

stale[bot] commented 3 weeks ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. Rotten issues close after 30d of inactivity. Reopen the issue with /reopen.

milvus-io / milvus