-
### 🐛 Describe the bug
Sometimes,There is an error 'AssertionError: "Simulate error" does not match "grad can be implicitly created only for scalar outputs"' in case: test_autograd_xpu.py::TestAutogr…
-
xft version:1.8.2
lscpu:
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 52 bits physical, 48 bits virtual
Byte Order: Little End…
-
### 🐛 Describe the bug
We observed a NaN regression with 2.5.0, and traced it to CUDNN attention.
2.5.0:
![Screenshot_20241030_095725](https://github.com/user-attachments/assets/5e83ecc4-8f0c-46b…
-
### 🐛 Describe the bug
```
File "/mnt/clusterstorage/workspace/kevin/ml-monorepo/chadfusion/train_fsdp.py", line 363, in fsdp_train
scaler.step(opt)
File "/usr/local/lib/python3.10/dist-…
-
-
### 🐛 Describe the bug
I am trying to call `torch.vmap` on `torch.jacfwd`. This works fine normally but raises the following error when called under `torch.inference_mode()`.
```
File [...]/torch…
-
### System Info
```
~/work/llama-stack/distributions/meta-reference-gpu (main)]$ python -m "torch.utils.collect_env"
/home/kaiwu/.conda/envs/llamastack-meta-reference-gpu/lib/python3.10/runpy.py:12…
-
### Describe the bug
To finetune model on Xeon CPU, we are following the [ai-reference-models/models_v2/pytorch/llama/training/cpu at main · intel/ai-reference-models (github.com)](https://github.com…
-
It might be quite interesting to explore SIMD vectorization for elliptic curves and MSMs. This might significantly speed-up:
- Verkle Trees
- KZG
- MSM
without needing a GPU. Ideally the same op…
-
### 🐛 Describe the bug
Based on pytorch main branch commit https://github.com/pytorch/pytorch/commit/acfe237a71af609e837a34bb38048aa8acb8eb4d
GCC 13.2.0: build pass
GCC 14.1.0: build fail
…