This repo contains a tool for benchmarking Weaviate performance.
- 📊 results and context can be found in the Weaviate documentation
- 💬 discuss the results on our Slack channel or Twitter
There are two components you will need to run for the benchmarks:
weaviate
the standard Weaviate imagebenchmarker
a go based benchmarking tool
You can run both as containers on the same machine via Docker compose.
For replicating our benchmarks we recommend setting the following machine:
Machine name | CPU type | CPUs | Memory | Disk size | Disk type | Misc. |
---|---|---|---|---|---|---|
n4-highmem-16 |
N4 | 16 | 128GB | 512GB | Hyperdisk Balanced | Debian 12 (bookworm) with Docker and Compose V2 |
Clone this repo and cd into it $ git clone https://github.com/weaviate/weaviate-benchmarking && cd weaviate-benchmarking
Download the files into a datasets folder as outlined below.
mkdir datasets && \
curl -o ./datasets/dbpedia-openai-1000k-angular.hdf5 https://storage.googleapis.com/ann-datasets/ann-benchmarks/dbpedia-openai-1000k-angular.hdf5 && \
curl -o ./datasets/snowflake-msmarco-arctic-embed-m-v1.5-angular.hdf5 https://storage.googleapis.com/ann-datasets/custom/snowflake-msmarco-arctic-embed-m-v1.5-angular.hdf5 && \
curl -o ./datasets/sift-128-euclidean.hdf5 http://ann-benchmarks.com/sift-128-euclidean.hdf5 && \
curl -o ./datasets/sphere-10M-meta-dpr.hdf5 https://storage.googleapis.com/ann-datasets/custom/sphere-10M-meta-dpr.hdf5
Run a single performance test on an ann-benchmarks hdf5 dataset.
DATASET=./datasets/dbpedia-openai-1000k-angular.hdf5 DISTANCE=cosine docker compose up --abort-on-container-exit
For more details on additional configuration options see the help options.
docker compose run benchmarker /app/benchmarker ann-benchmark -h