bentoml / llm-bench

28 stars 3 forks source link

BentoCloud Benchmark Client

Usage

1. Set up environment variables

Make sure you have logged into Huggingface

huggingface-cli login

Set environment variables for benchmarking

export BASE_URL=<BentoCloud Service URL>
export SYSTEM_PROMPT=1      // 1 or 0

2. Run benchmark

python benchmark.py --max_users 10 --session_time 300 --ping_correction