c0sogi / llama-api

An OpenAI-like LLaMA inference API
MIT License
111 stars 9 forks source link

Support min_p sampler #25

Open atisharma opened 10 months ago

atisharma commented 10 months ago

Support min_p sampler, which is implemented in ExLlamav2.-