illuin-tech / vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
https://huggingface.co/vidore
MIT License
135 stars 15 forks source link

Reduce RAM high usage during evaluation #56

Closed ManuelFay closed 3 weeks ago

ManuelFay commented 3 weeks ago

Description

The current implementation loads the whole dataset in the RAM during evaluation, which causes unneeded high RAM spikes. This PR reuses the batching strategy already used during inference in a pre-batching step to drastically reduce the RAM usage.

Features

Changed

Fixed

Test

E2E tested with:

vidore-benchmark evaluate-retriever \
    --model-class siglip \
    --model-name google/siglip-so400m-patch14-384 \
    --dataset-name vidore/shiftproject_test \
    --split test