-
Hello,
Thanks for the great package!
I'd like to do multi-GPU parallel sweeps. I have 4 GPUs and I'd like to do a sweep on, say 16 configs. I have this code:
```python
wandb.require("core"…
-
pytorch 1.2.0 two gpu cards. err msg "
`terminate called after throwing an instance of 'c10::Error'
what(): CUDA error: driver shutting down `
render_thread and gpu model might have something…
-
Is parallel GPU training support possible? We would like to try this with a fairly large (multi-GB) dataset, but to make training time reasonable it would need to be done in parallel. Single node pa…
-
Is GPU supported when create new VM instance?
-
We are unable to spin up Paperspace instances like a `P4000` or a `GPU+` or a `A4000`.
```sh
$ sky launch -c mycluster hello_sky.yaml
Task from YAML spec: hello_sky.yaml
I 09-25 16:26:09 optimi…
-
### What happened + What you expected to happen
1. This template works on NVIDIA A10 GPUs on AWS (g5.xlarge instances): https://github.com/ray-project/ray/blob/master/python/ray/autoscaler/aws/exampl…
-
### What is the issue?
Linux, I use the following command to start Ollama server:
CUDA_VISIBLE_DEVICES=1,2,3,4,5 OLLAMA_MAX_LOADED_MODELS=5 ./ollama-linux-amd64 serve&
Then I want to run several py…
-
1>D:\Dev\caffe-master-gpu\include\caffe/util/gpu_util.cuh(28): error : identifier "__longlong_as_double" is undefined
1>D:\Dev\caffe-master-gpu\include\caffe/util/gpu_util.cuh(28): error : identifi…
-
I'm not sure what is the correct place to report this, please direct me if this is not the correct place.
Goal:
I want to have EKS cluster with working observability, Bottlerocket AMI and GPU-node…
-
### 🐛 Describe the bug
## Description
When using PyTorch 2.x with multiprocessing (`torch.multiprocessing.Pool`), there is a significant performance degradation (100x) compared to PyTorch 1.13.1 whe…