zilliztech / VectorDBBench

A Benchmark Tool for VectorDB
MIT License
522 stars 133 forks source link

Correcting SIFT datasize for capacity case #252

Closed devanshsa5 closed 9 months ago

devanshsa5 commented 9 months ago

The Capacity case mentioned in cases.py is fetching 500K docs with 128dims . There is no 100K dataset available for SIFT neither the case. class CapacityDim128(CapacityCase): case_id: CaseType = CaseType.CapacityDim128 dataset: DatasetManager = Dataset.SIFT.manager(500_000) name: str = "Capacity Test (128 Dim Repeated)" description: str = """This case tests the vector database's loading capacity by repeatedly inserting small-dimension vectors (SIFT 100K vectors, 128 dimensions) until it is fully loaded. Number of inserted vectors will be reported."""

devanshsa5 commented 9 months ago

/assign @XuanYang-cn

XuanYang-cn commented 9 months ago

@devanshsa5 Good catch!

sre-ci-robot commented 9 months ago

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: devanshsa5, sarthak-dua27, XuanYang-cn

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files: - ~~[OWNERS](https://github.com/zilliztech/VectorDBBench/blob/main/OWNERS)~~ [XuanYang-cn] Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment