Closed Matthieu-Tinycoaching closed 1 year ago
We're currently working on these types of measurements. We'll publish some benchmarks in a blog, hopefully soon. Thanks for the question!
I publish a blog, benchmarking DPR and ColBERT+PLAID on CPU; have a look, you may find it interesting.
Hi,
Have you load tested fastRAG pipeline with Colbert, PLAID and FiD on CPU instance?
Could you provide examples of latency, RPS relatively to CPU instance characteristics?