tjake / Jlama

Jlama is a modern LLM inference engine for Java
Apache License 2.0
499 stars 48 forks source link

Quantizations #5

Closed tjake closed 1 year ago