containers / ramalama

The goal of RamaLama is to make working with AI boring.
MIT License
284 stars 49 forks source link

Mount model. car volumes into container #396

Closed rhatdan closed 4 weeks ago

rhatdan commented 4 weeks ago

This is a redesign of run and serve to not run RamaLama within the container but only the AI Model Runtime llama.cpp or vllm.

This should simplify the operation, although potentially causes us risk with versions of Python on the host.