Weaviate Benchmarking

This repo contains a tool for benchmarking Weaviate performance.

Documentation for benchmarker

📊 results and context can be found in the Weaviate documentation
💬 discuss the results on our Slack channel or Twitter

ANN benchmark

There are two components you will need to run for the benchmarks:

weaviate the standard Weaviate image
benchmarker a go based benchmarking tool

You can run both as containers on the same machine via Docker compose.

For replicating our benchmarks we recommend setting the following machine:

Machine name	CPU type	CPUs	Memory	Disk size	Disk type	Misc.
`n4-highmem-16`	N4	16	128GB	512GB	Hyperdisk Balanced	Debian 12 (bookworm) with Docker and Compose V2

Run tests

Clone this repo and cd into it $ git clone https://github.com/weaviate/weaviate-benchmarking && cd weaviate-benchmarking

Download the files into a datasets folder as outlined below.

mkdir datasets && \
    curl -o ./datasets/dbpedia-openai-1000k-angular.hdf5 https://storage.googleapis.com/ann-datasets/ann-benchmarks/dbpedia-openai-1000k-angular.hdf5 && \
    curl -o ./datasets/snowflake-msmarco-arctic-embed-m-v1.5-angular.hdf5 https://storage.googleapis.com/ann-datasets/custom/snowflake-msmarco-arctic-embed-m-v1.5-angular.hdf5 && \
    curl -o ./datasets/sift-128-euclidean.hdf5 http://ann-benchmarks.com/sift-128-euclidean.hdf5 && \
    curl -o ./datasets/sphere-10M-meta-dpr.hdf5 https://storage.googleapis.com/ann-datasets/custom/sphere-10M-meta-dpr.hdf5

Run a single performance test on an ann-benchmarks hdf5 dataset.

DATASET=./datasets/dbpedia-openai-1000k-angular.hdf5 DISTANCE=cosine docker compose up --abort-on-container-exit

For more details on additional configuration options see the help options.

docker compose run benchmarker /app/benchmarker ann-benchmark -h

weaviate / weaviate-benchmarking

readme

Weaviate Benchmarking

Documentation for benchmarker

ANN benchmark

Run tests