-
If possible we can make a web version together. Just like Kangle, with commercial version features. Also includes a web console, multi-node functionality, and more. After reading your blog, u may be a…
-
Hello everyone, training with multi-GPU on one node (machine) is normal, but multi-GPU on several nodes (machines) training is Inefficiency. By sorting out the problem, it is found that it has nothing…
-
- [x] I have marked all applicable categories:
+ [ ] exception-raising bug
+ [ ] RL algorithm bug
+ [ ] documentation request (i.e. "X is missing from the documentation.")
+ [x] ne…
-
MLX is a new ML framework specifically designed to run on Apple silicon: https://github.com/ml-explore/mlx
It has some differences compare to PyTorch with `mps` backend: https://github.com/ml-explo…
-
**Describe the current state of the problem**
Nw.js is based on a seemingly good idea of merging node.js and webkit contexts, but, in hindsight, it is a terrible idea as new APIs come and nw.js start…
-
For the past few months I've been working on a program that needs all-to-all exchanges and Realm doesn't seem to perform distributed all-to-all communication efficiently. To understand what an efficie…
-
Extrae automatically deploys on multi-node jobs if MPI communication is detected, which is not always the case in Julia. Automatically deploying to remote workers on execution and posteriously merging…
-
It is mentioned on README that candle supports multi GPU inference, using NCCL under the hood. How can this be implemented ? I wonder if there is any available example to look at..
Also, I know PyT…
-
This issues tracks work on allocator2 package development, integration and testing.
**Necessary for simulator integration**
- [ ] Refactor `asim` to support hooking in different allocators
- [ ]…
kvoli updated
2 months ago
-
Current win_ops logics is
win_create -> gradient/iterate update -> win_put -> win_sync
The processing between all nodes/agents are almost decoupled and independent.
We want to further optimi…