-
## Description
I'm benchmarking naive FlashAttention in `Jax` vs. the Pallas's version of [`FA3`](https://github.com/jax-ml/jax/blob/7b9914d711593dca8725d46aa1dadb2194284519/jax/experimental/pallas…
-
### Problem Description
Hello, my GPU output is:
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 33 [Radeon RX 7700S/7600/7600S/7600M XT/PRO W7600] (rev c0)
I insta…
-
I want to know torch version, thanks
-
First, thank you for sharing your work. It will definitely help me to better understand burn framework usage and how a pytorch model can be translated.
I would like to know if you have validated y…
-
### ✨ Short description of the feature [tl;dr]
Compatibility with pytorch 2.5
### 💬 Detailed motivation and codes
Compatibility with pytorch 2.5
-
### System Info
```
(zt) root@autodl-container-7071118252-7032359d:~/test/PiPPy/examples/llama# transformers-cli env
Copy-and-paste the text below in your GitHub issue and FILL OUT the two last p…
-
It is useful to shard optimizer state across devices (to save significant memory). This reflects current practice. We want to support it.
* We want to switch from no sharding to naive model parameter…
-
Dear Plumed Team,
I am using PLUMED 2.10b with CUDA 11.2 and I am attempting to enable libtorch to use the Pytorch module. I have tried with multiple versions of libtorch but it seems like one of t…
-
### 🐛 Describe the bug
I build pytorch from github but when I try to use I get this error:
```
Traceback (most recent call last):
File "C:\Users\Admin\Desktop\Python\0.LLMs\AutoAWQ\AutoAWQ\set…
-
- Add headers to the scripts
- Add docstrings to functions and classes
- Change the name of the repository
- Add information to the ReadMe about the tests and verification
- Play the original soun…