tjake / Jlama

Jlama is a modern LLM inference engine for Java
Apache License 2.0
656 stars 60 forks source link

Gemm support for batch processing #30

Closed tjake closed 5 months ago