milvus-io / milvus

A cloud-native vector database, storage for next generation AI applications
https://milvus.io
Apache License 2.0
31.14k stars 2.95k forks source link

[Bug]: milvus didn't return the same number of records as parameter "limit" #36268

Open lukeisanniesdaddy opened 2 months ago

lukeisanniesdaddy commented 2 months ago

Is there an existing issue for this?

Environment

- Milvus version:
- Deployment mode(standalone or cluster):
- MQ type(rocksmq, pulsar or kafka):    
- SDK version(e.g. pymilvus v2.0.0rc2):
- OS(Ubuntu or CentOS): 
- CPU/Memory: 
- GPU: 
- Others:

Current Behavior

When I searched use limit=10, but only 6 records returned from milvus, there are thousands of records in milvus

Expected Behavior

No response

Steps To Reproduce

No response

Milvus Log

No response

Anything else?

No response

yanliang567 commented 2 months ago

@lukeisanniesdaddy please share more info about the issue you met

  1. milvus version, sdk version
  2. collection schema, index type and params, and reproduce code if convenient
  3. Milvus log files

/assign @lukeisanniesdaddy /unassign

xiaofan-luan commented 2 months ago

Is there an existing issue for this?

  • [x] I have searched the existing issues

Environment

- Milvus version:
- Deployment mode(standalone or cluster):
- MQ type(rocksmq, pulsar or kafka):    
- SDK version(e.g. pymilvus v2.0.0rc2):
- OS(Ubuntu or CentOS): 
- CPU/Memory: 
- GPU: 
- Others:

Current Behavior

When I searched use limit=10, but only 6 records returned from milvus, there are thousands of records in milvus

Expected Behavior

No response

Steps To Reproduce

No response

Milvus Log

No response

Anything else?

No response

usually means your pk is duplicated

lukeisanniesdaddy commented 1 month ago

Is there an existing issue for this?

  • [x] I have searched the existing issues

Environment

- Milvus version:
- Deployment mode(standalone or cluster):
- MQ type(rocksmq, pulsar or kafka):    
- SDK version(e.g. pymilvus v2.0.0rc2):
- OS(Ubuntu or CentOS): 
- CPU/Memory: 
- GPU: 
- Others:

Current Behavior

When I searched use limit=10, but only 6 records returned from milvus, there are thousands of records in milvus

Expected Behavior

No response

Steps To Reproduce

No response

Milvus Log

No response

Anything else?

No response

usually means your pk is duplicated

Thanks, the root cause was primary key duplicated. It's so strange that milvus allows primary key duplicate.

xiaofan-luan commented 1 month ago

Is there an existing issue for this?

  • [x] I have searched the existing issues

Environment

- Milvus version:
- Deployment mode(standalone or cluster):
- MQ type(rocksmq, pulsar or kafka):    
- SDK version(e.g. pymilvus v2.0.0rc2):
- OS(Ubuntu or CentOS): 
- CPU/Memory: 
- GPU: 
- Others:

Current Behavior

When I searched use limit=10, but only 6 records returned from milvus, there are thousands of records in milvus

Expected Behavior

No response

Steps To Reproduce

No response

Milvus Log

No response

Anything else?

No response

usually means your pk is duplicated

Thanks, the root cause was primary key duplicated. It's so strange that milvus allows primary key duplicate.

this is something we will handle in the future. A legacy issue but thanks for pointing this out

stale[bot] commented 15 hours ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. Rotten issues close after 30d of inactivity. Reopen the issue with /reopen.