multi-node Search Results

backube/volsync #1329

Multi-AZ Volume Node Affinity Conflict

**Describe the bug** There is a misalignment of volumes being provisioned in multi-AZ clusters. This causes volsync job-pods to be unscheduleable. On my non multi-AZ cluster, volsync pods are …

DreamingRaven updated 23 minutes ago

rootless-containers/usernetes #335

Usernetes with bypass4netns on multi-node

Hi @AkihiroSuda ! :wave: I want to introduce you to @lisejolicoeur, who has joined our team this summer (with @milroy) to work specifically on Usernetes networking! We are opening this issue to sh…

vsoch updated 3 days ago

comfyanonymous/ComfyUI #3851

Add multi-IO reroute nodes

## Use Case Make managing muti-node buses easier, for example: 1. image latent and VAE -- a 2-channel reroute bus; 2. positive and negative prompt -- a 2-channel (or 4-channel with L and G variants…

rhdunn updated 1 week ago

moment-timeseries-foundation-model/moment #24

GPU, single-node multi-GPU, multi-node multi-GPU support for…

It is not clear from the documentation and the sample code, if the forecast generation can be performed on a GPU, multiple GPUs, or multiple GPUs in multiple nodes. If this is the case, please add som…

ryuta-yoshimatsu updated 1 month ago

ymcui/Chinese-LLaMA-Alpaca-3 #79

multi-node inference for llama3 70b

### Check before submitting issues - [X] Make sure to pull the latest code, as some issues and bugs have been fixed. - [X] I have read the [Wiki](https://github.com/ymcui/Chinese-LLaMA-Alpaca-3/wiki)…

Abolfazl-kr updated 1 week ago

ChenghaoMou/text-dedup #92

Run MinHash dedup on Multi-Nodes

Hello there, First i would like to extend my many thanks to you for setting up this amazing repo ! I'm currently working on a project with the aim to release the largest clean arabic text datase…

alielfilali01 updated 2 weeks ago

Lightning-AI/litgpt #1474

Gradient Accumulation Step under Multi-node Pretaining

@awaelchli I found that in the `pretrain.py`, the accumulation steps are calculated based on global batch size, device number and micro batch size. This works fine under single-node setting, e.g. glo…

SHUMKASHUN updated 5 days ago

NVIDIA/nccl-tests #215

NCCL_ALGO on multi-node and multi-GPU

Hi. I have been running NCCL_TESTS on a multi-node, multi-GPU environment with NCCL 2.19.3-1 and OpenMPI 4.1.6. Each node has 4 NVIDIA V100 GPUs interconnected with NVLink and PCIe. 1. How is th…

MajidSalimi updated 1 month ago

pyg-team/pytorch_geometric #9464

Add multi node training guide for XPU device

### 📚 Describe the documentation issue Currently, [training_benchmark_xpu.py](https://github.com/pyg-team/pytorch_geometric/blob/master/benchmark/multi_gpu/training/training_benchmark_xpu.py) only su…

zhouyu5 updated 3 days ago

NVIDIA/TensorRT-LLM #1667

Multi-node inference: invalid device ordinal

### System Info NCCL version 2.19.3+cuda12.0 TensorRT-LLM version: 0.11.0.dev2024052100 Ubuntu 22.04 ### Who can help? @byshiue ### Information - [X] The official example scripts - [ ] My o…

thies1006 updated 1 week ago

1000+ results for multi-node

1000+ results
for multi-node