milvus-io / milvus

A cloud-native vector database, storage for next generation AI applications
https://milvus.io
Apache License 2.0
28.03k stars 2.7k forks source link

[Bug]: milvus: error while loading shared libraries: libmilvus_common.so #34180

Open trustnote-wang opened 3 days ago

trustnote-wang commented 3 days ago

Is there an existing issue for this?

Environment

- Milvus version: 2.4.4-gpu
- Deployment mode(standalone or cluster): cluster
- MQ type(rocksmq, pulsar or kafka): kafka
- SDK version(e.g. pymilvus v2.0.0rc2): 
- OS(Ubuntu or CentOS): centos
- CPU/Memory: 
        resources:
          limits:
            cpu: "2"
            memory: 5Gi
            tencent.com/vcuda-core: "200"
            tencent.com/vcuda-memory: "640"
          requests:
            cpu: "2"
            memory: 2Gi
- GPU: 
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.129.03             Driver Version: 535.129.03   CUDA Version: 12.2     |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
|=========================================+======================+======================|
|   0  NVIDIA A800-SXM4-80GB          Off | 00000000:10:00.0 Off |                    0 |
| N/A   37C    P0              66W / 400W |      2MiB / 81920MiB |      0%      Default |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+
|   1  NVIDIA A800-SXM4-80GB          Off | 00000000:16:00.0 Off |                    0 |
| N/A   34C    P0              62W / 400W |      2MiB / 81920MiB |      0%      Default |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+
|   2  NVIDIA A800-SXM4-80GB          Off | 00000000:49:00.0 Off |                    0 |
| N/A   34C    P0              63W / 400W |      2MiB / 81920MiB |      0%      Default |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+
|   3  NVIDIA A800-SXM4-80GB          Off | 00000000:4D:00.0 Off |                    0 |
| N/A   35C    P0              61W / 400W |      2MiB / 81920MiB |      0%      Default |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+
|   4  NVIDIA A800-SXM4-80GB          Off | 00000000:89:00.0 Off |                    0 |
| N/A   36C    P0              62W / 400W |      2MiB / 81920MiB |      0%      Default |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+
|   5  NVIDIA A800-SXM4-80GB          Off | 00000000:8E:00.0 Off |                    0 |
| N/A   38C    P0              65W / 400W |      2MiB / 81920MiB |      0%      Default |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+
|   6  NVIDIA A800-SXM4-80GB          Off | 00000000:C5:00.0 Off |                    0 |
| N/A   35C    P0              64W / 400W |      2MiB / 81920MiB |      0%      Default |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+
|   7  NVIDIA A800-SXM4-80GB          Off | 00000000:C9:00.0 Off |                    0 |
| N/A   37C    P0              59W / 400W |      2MiB / 81920MiB |      0%      Default |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+

+---------------------------------------------------------------------------------------+
| Processes:                                                                            |
|  GPU   GI   CI        PID   Type   Process name                            GPU Memory |
|        ID   ID                                                             Usage      |
|=======================================================================================|
|  No running processes found                                                           |
+---------------------------------------------------------------------------------------+
- Others:

Current Behavior

[root@qdetest01v ~]# kubectl logs -f milvus-prod-indexnode-589b665c87-sf75p -n milvus-prod milvus: error while loading shared libraries: libmilvus_common.so: cannot open shared object file: No such file or directory [root@qdetest01v ~]#

indexnode unable to start,help me please!

Expected Behavior

No response

Steps To Reproduce

No response

Milvus Log

No response

Anything else?

No response

yanliang567 commented 2 days ago

/assign @foxspy /unassign