yale-sys / prompt-cache

Modular and structured prompt caching for low-latency LLM inference
MIT License
14 stars 1 forks source link

Benchmark script #12

Closed shsym closed 8 months ago