-
I am trying to use both of my GPUs who are passed through to my docker container.
`services: faster-whisper-server-cuda: image: fedirz/faster-whisper-server:latest-cuda build: dockerfile: Dockerfil…
-
## Questions / Clarifications / Requests
I have been testing the Reservation plugin for a bit now but have a few questions on its functionality. I have looked through the available files on…
-
This issue appears to be the same as #871 and #872, but I wanted to provide my own detailed write up as a new issue.
I was running 2 Windows server VMs with DHCP and DNS services only. I'm a netwo…
-
I'm running docker and LLM-X in a container with the following compose.yml file:
```
version: '3.8'
services:
ollama:
image: ollama/ollama
runtime: nvidia
environment:
…
-
### Your current environment
Below is my current Docker Compose configuration:
```yaml
services:
vllm:
image: vllm/vllm-openai:v0.6.4
deploy:
resources:
reservation…
-
_Originally from #86_
> It should also be possible to query metal_devices by hardware_reservation_id.
-
Testflinger allows for a reservation to be created for a device. However, it is the necessary to poll the job and watch the output to know when the device is ready and how to connect to it. This reque…
-
I'm using this great program as a Docker container and I'm wondering if it would be possible to remove the Whisper model from the VRAM after a certain time so that it doesn't permanently block the VRA…
-
### NetBox version
v3.7.6
### Feature type
New functionality
### Proposed functionality
When reserving spaces within a rack for devices, the current system does not differentiate between the fron…
-
Hi All
First off, thank you for this awesome project. This seems like exactly what I am looking for to start learning the vast world of AI.
I am trying to run a docker AIO but with CPU support o…