Closed michaelfeil closed 2 months ago
Example:
port=7997 model1=michaelfeil/bge-small-en-v1.5 model2=mixedbread-ai/mxbai-rerank-xsmall-v1 volume=$PWD/data docker run -it --gpus all \ -v $volume:/app/.cache \ -p $port:$port \ michaelf34/infinity:latest \ v2 \ --model-id $model1 \ --model-id $model2 \ --port $port
Have fun!
Looks great! Thanks for the heads up 😀
Closed for now, we currently assign one GPU per model by launching separate Infinity instances.
Example:
Have fun!