-
From https://arxiv.org/abs/2112.05682v2. I have no immediate use for this, but it looks cool and I didn't want it to go unmentioned in case some aspiring contributor to Transformers.jl is looking for …
-
I see the memory consumption chart in the readme, but after looking at the code, I have doubts that this implementation is fully memory efficient. I see the call to cp.checkpoint in _DenseLayer.forwar…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
when I run .\webui.bat --xformers or .\webui.bat --xforme…
-
Hi, thank you for implementing and deploying the models. It's awesome.
I run into this issue when using a downstream library that depends on this library. I opened up an issue [there](https://githu…
-
# Concepts for Efficient Coding
## Course Description
This course aims to provide an intuitive and deeper understanding of what happens when you run code. By focusing on the underlying computer ar…
-
https://arxiv.org/pdf/2309.06180.pdf
-
## 🚀 Feature
Continuing the [requests](https://github.com/pytorch/pytorch/pull/50693) to support various needs of the models in the new Pipe pytorch feature, this one brings up
### Memory-Eff…
-
# 🐛 Bug
Xformers gives a CUDA error like this when the batch size is larger or equal to 65536.
```
RuntimeError: CUDA error: invalid configuration argument
CUDA kernel errors might be asynchrono…
-
Hi I am using Google colab and when i run this command "mistral-demo $M7B_DIR" I use T4 GPU i got this error any solution for that plz
Traceback (most recent call last):
File "/usr/local/bin/mi…
-
# 🐛 Bug
The documentation says one thing that doesn't match the equivalent code:
[equivalent code](https://github.com/facebookresearch/xformers/blob/97cc81f5e4aef5af7202791bd75a7e3fb5a1762e/xfor…