tjake / Jlama

Jlama is a modern LLM inference engine for Java
Apache License 2.0
669 stars 62 forks source link

Set the max tokens based on the model and fix temp for now #77

Closed tjake closed 1 month ago