distributed-configuration Search Results

1000+ results
for distributed-configuration

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlcommons/chakra #166

Incorrect JSON format during Pytorch Execution Trace generat…

## Bug Description I am running a distributed Linear model (20 parameters) across 2 GPU Nodes, each node having 2 NVIDIA H100 NVL GPUs. The Model uses DDP parallelization strategy. I am generating…

arjuntemura updated 19 hours ago
2
infiniflow/infinity #2280

[Question]: Can infinity python sdk support minio？I cannot f…

### Describe your problem ![image](https://github.com/user-attachments/assets/b3e4d2ed-b140-44bb-92c3-479c8d78008a) ![image](https://github.com/user-attachments/assets/456426c7-5d64-48a1-969d-d9453d…

yanshuaibupt updated 5 days ago
3
pytorch/torchtune #1932

Pretraining Cuda Out of Memory Issue

I have a device containing 4 Nvidia L40 GPUs. I am trying to use the full_finetune_distributed llama3_1/8B_full recipe. My configuration for dataset in the config file is given below: dataset: _c…

muniefht updated 3 weeks ago
6
milvus-io/milvus #38013

[Feature]: Support for thread number limits in queries and m…

### Is there an existing issue for this? - [X] I have searched the existing issues ### Is your feature request related to a problem? Please describe. I would like to request two features that…

lmccccc updated 20 hours ago
5
kieker-monitoring/kieker #1065

[KIEKER-564] Monitoring and analysis for large-scale distrib…

JIRA Issue: [KIEKER-564] Monitoring and analysis for large-scale distributed/cloud-based systems Original Reporter: Andre van Hoorn *** To be polished Brief explanation: (Max. 5-7 sentences) As…

rju updated 2 weeks ago
6
Lightning-AI/pytorch-lightning #20438

Slurm multi-node work fine but multi-gpu doesn't

### Bug description I am training a sample model which works on multiple GPUs as long as these are across nodes. But as soon as I allocate more than one GPU on a node it returns `[rank7]: torch.dist…

atifkhanncl updated 4 days ago
1
eclipse/packages #476

Support configuration of distributed embedded Inifispan cach…

See https://github.com/eclipse-hono/hono/issues/3425

sophokles73 updated 9 months ago
1
argoproj/argo-cd #20721

Project configurations in any namespace

# Summary Some of the configuration shouldn't be centrally managed, as different user groups (linked to projects) may want to define them on their own. # Motivation We should make the distinc…

OpenGuidou updated 1 week ago
3
grafana/mimir #7554

[mimir-distributed] Unexpected Shutdown Causes Mimir Distrib…

#### Describe the bug I'm evaluating mimir-distributed in high availability mode to determine its reliability when one of the nodes is offline. Following a series of bring-ups and down operations, …

alita1991 updated 8 months ago
1
CubeCoders/AMPTemplates #1123

[Software] Uptime Kuma

# Module Request Note: Please try setting up a configuration yourself before raising an issue to request a configuration: ~~https://config.getamp.sh/~~ ***There is a newer beta version available …

CostcoFanboy updated 1 week ago
38

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for distributed-configuration

1000+ results
for distributed-configuration