multi-node-communication Search Results

1000+ results
for multi-node-communication

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Har-Kuun/OneClickCDN #19

Features and Collaboration Proposals

If possible we can make a web version together. Just like Kangle, with commercial version features. Also includes a web console, multi-node functionality, and more. After reading your blog, u may be a…

imPrk0 updated 2 years ago
1
lucidrains/DALLE2-pytorch #156

Inefficiency Decoder Training with Multi-GPU on Several Node…

Hello everyone, training with multi-GPU on one node (machine) is normal, but multi-GPU on several nodes (machines) training is Inefficiency. By sorting out the problem, it is found that it has nothing…

1073521013 updated 2 years ago
2
thu-ml/tianshou #1172

Suggestion - Redesign RayEnvWorker for Improved Performance

- [x] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [x] ne…

destin-v updated 1 month ago
14
kubeflow/training-operator #2047

Support MLX on Kubernetes with Kubeflow

MLX is a new ML framework specifically designed to run on Apple silicon: https://github.com/ml-explore/mlx It has some differences compare to PyTorch with `mps` backend: https://github.com/ml-explo…

andreyvelich updated 1 month ago
6
ct-js/ct-js #538

Migration to Neutralino.js

**Describe the current state of the problem** Nw.js is based on a seemingly good idea of merging node.js and webkit contexts, but, in hindsight, it is a terrible idea as new APIs come and nw.js start…

CosmoMyzrailGorynych updated 1 week ago
2
StanfordLegion/legion #967

Realm: all-to-all communication is slow

For the past few months I've been working on a program that needs all-to-all exchanges and Realm doesn't seem to perform distributed all-to-all communication efficiently. To understand what an efficie…

magnatelee updated 4 years ago
1
bsc-quantic/Extrae.jl #8

Automatic Extrae deployment on `Distributed.Worker`s

Extrae automatically deploys on multi-node jobs if MPI communication is detected, which is not always the case in Julia. Automatically deploying to remote workers on execution and posteriously merging…

mofeing updated 1 year ago
1
huggingface/candle #2007

How to run inference of a (very) large model across mulitple…

It is mentioned on README that candle supports multi GPU inference, using NCCL under the hood. How can this be implemented ? I wonder if there is any available example to look at.. Also, I know PyT…

jorgeantonio21 updated 3 months ago
4
cockroachdb/cockroach #103320

allocator2: multi-metric allocation

This issues tracks work on allocator2 package development, integration and testing. **Necessary for simulator integration** - [ ] Refactor `asim` to support hooking in different allocators - [ ]…

kvoli updated 2 months ago
1
Bluefog-Lib/bluefog #20

Proposal for local GPU communication merging

Current win_ops logics is win_create -> gradient/iterate update -> win_put -> win_sync The processing between all nodes/agents are almost decoupled and independent. We want to further optimi…

BichengYing updated 4 years ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for multi-node-communication

1000+ results
for multi-node-communication