LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
https://open-assistant.io
Apache License 2.0
37.03k stars 3.23k forks source link

Add distributed testing for inference server #1621

Open jackapbutler opened 1 year ago

jackapbutler commented 1 year ago

Overview

We want to test the dockerised inference-server under different stress conditions such as;

  1. Load testing - handling many concurrent of users
  2. Latency testing - speed of response to users

This should inform changes to the inference server as it can help diagnose bottlenecks in the backend. It also gives us a better idea on the compute requirements for hosting a worker or inference server node in different conditions.

Tasks

Context

@yk suggested I work on this and I'm a research engineer at Faculty (https://faculty.ai/) committing almost full time to OS contributions for the foreseeable future

jackapbutler commented 1 year ago

self-assign

olliestanley commented 1 year ago

Hi @jackapbutler do you still have interest in working on this? Or shall I unassign you from all load testing issues?

jackapbutler commented 1 year ago

Hey @olliestanley, no sorry I won't be able to commit to this now.

olliestanley commented 1 year ago

No problem :)