-
Hi, first of all, thanks for sharing this library. It's really awesome.
I was trying to implement [Hierachical Multiscale Recurrent Neural Network](https://arxiv.org/pdf/1609.01704.pdf), where the …
-
I am trying to get multi-node inference working with 4 nodes, each with 4xRTX8000 GPUs (48GB per GPU).
`deepspeed --hostfile=$hostfile Megatron-DeepSpeed/scripts/inference/bloom-ds-inference.py --nam…
-
### 🐛 Describe the bug
I installed the final RC version of pytorch 2.3, and ran the following code, errors occurs.
```
#minified.py
import torch
from torch import nn
import os
from torch …
-
Recently we made liblinear report convergence failures.
Now this is reported in lots of places. I expect our users will start to see it everywhere. Should we change something? It's weird if most uses…
-
I posted the model code over on the Stan forums (see [here](https://discourse.mc-stan.org/t/improving-efficiency-for-dynamic-coevolutionary-stan-model/36103)). Bob Carpenter and Aki Vehtari had some g…
-
### What happened?
When training with Decompose Weights on, it looks like this:
![2024-08-10_17-09-54-training-sample-300-37-4](https://github.com/user-attachments/assets/4fd283d0-e9fa-42e8-b4e8-6ac…
-
I am getting this:
```
Error: unhandled exception: p ASSERT FAILED at /data/Storage/Development/nimtorch/aten/include/c10/impl/DeviceGuardImplInterface.h:130, please report a bug to PyTorch. Devi…
-
Version 0.10 is released now. If no major bugs surface in the next few days the server will start enforcing this version.
There is this 1500+ post issue where most plans for the future were posted …
-
Hi all,
The reason I've been slow on convnet-benchmarks these days is because i've been working on the side on DeepMark.
I initially wrote _convnet-benchmarks_ to increase competition among framewor…
-
@shelby3 [wrote](https://github.com/keean/traitscript/issues/2#issuecomment-248102324):
> Also, the compiler can use a much simpler form of path analysis (independent of run-time state) to do that pe…