Closed bigsheeper closed 2 months ago
we need to make sure there is no duplicate pK in every segment.
This is done by:
we need to make sure there is no duplicate pK in every segment.
This is done by:
- Temporarily -> mask entry duplicated PK as delete
- Final fix -> when segment is flushed, do an analyze task tor reorder
@zhagnlu will help implement feature 1.
/assign @zhagnlu
Is there an existing issue for this?
Environment
Current Behavior
In segcore, once the number of query results reaches the required number, it returns:
So in the scenario with duplicate pks, the results obtained may include duplicates. We need to deduplicate pks in segcore reduce, similar to internal reduce and global reduce.
This issue exists for both growing segments and sealed segments.
Expected Behavior
No response
Steps To Reproduce
No response
Milvus Log
No response
Anything else?
No response