c0sogi / llama-api

An OpenAI-like LLaMA inference API
MIT License
111 stars 9 forks source link

Dev update (23.8.17.) #4

Closed c0sogi closed 1 year ago

c0sogi commented 1 year ago

πŸš€ This PR introduces a series of improvements aimed at enhancing user experience and refining the codebase. Here's a breakdown of the changes:


🌟 1. Exllama Module - LoRA Integration


πŸ”— 2. OpenAI Logit Bias Support


βš– 3. Optimized Worker Load Balancing


πŸ“œ 4. Enhanced Logging Mechanism


πŸ”₯ 5. Docker Image Upgrades