c0sogi / llama-api

An OpenAI-like LLaMA inference API
MIT License
111 stars 9 forks source link

Dev update (23.8.22.) #5

Closed c0sogi closed 1 year ago

c0sogi commented 1 year ago

🚀 Overview This PR introduces a series of improvements aimed at enhancing user experience and refining the codebase. Here's a breakdown of the changes:


1. Optimized performance: llama.cpp & exllama


🌐 2. Tunnel through Cloudflare


⚙️ 3. CLI args Refinement


🐞 4. Bugfix: niceness of process


🔜 5. Enhancement: required option in function call schema