-
This is to unblock @cowanmeg and @samnordmann 's distributed matmul experiments.
I'll start with the tensor parallelism proposed by [the original Megatron-LM paper](https://arxiv.org/pdf/1909.0805…
-
Thanks for tagging me. This indeed looks like the same symptom I encountered.
For https://github.com/NVIDIA/Fuser/issues/2199, I used to be able to generate a one-nvFusion transforme…
-
The following test fails currently:
```c++
TEST_F(MatmulSchedulerTest, SelfMappingErrorSmemEpilogue1dBias) {
NVFUSER_TEST_CUDA_ARCH_RANGE_GUARD(7, 5, 9, 0);
Fusion fusion_obj;
Fusion* fusion = …
-
The following script gave numerically incorrect values on a small part of the array.
```py
import torch
import thunder
def f(a, b, c):
c += 1
return a.add(b, alpha=c)
dim = 2**2…
-
### 🐛 Describe the bug
```
Traceback (most recent call last):
File "main.py", line 202, in
all_metrics = trainer.train(args.steps, args.val_steps, args.save_every, args.eval_every)
Fil…
-
Can someone help please ? I fellowed all steps but nothing is working
Here the output:
```
Running Stage 1: Overall restoration
Traceback (most recent call last):
File "C:\BOPTL\Bringing-Old…
Arifi updated
8 months ago
-
### 🐛 Bug
Applying `thunder.jit` to `conv` operation in UNet model of NeMo Stable Diffusion gives an error:
```
Unsupported iterable object type for define_vector! Index:0
Exception raised from…
-
Has anyone encountered similar issues when training the Multi3DRefer? How to fix it?
```
sk_bce_loss: 0.0045, layer_3_mask_dice_loss: 0.0401, layer_3_sem_loss: 0.0509, layer_3_indi_loss: 0.5398,…
-
## 🚀 Feature
The feature request is to add the decision making capabilities inside nvFuser executor that allows the nvFuser executor to reject/pass on certain op executions where other backends/exe…
-
## 🚀 Feature
An environment variable that dumps out the various Thunder provided debug traces to a log file. This can have variable levels like
`export THUNDER_DEBUG=`
```
0/'' : Disable
1/'t…