-
Hi!
A big thanks for your impressive work! Since full attention and causal attention are both included, I am curious how you implemented such attention masks if flash attention is used.
Best reg…
-
Hi author, I have compiled and installed fused_dense_lib successfully. But when I tried to run the finetuning code, I encountered this error: "RuntimeError: linear_act_forward failed." which is due to…
-
If torch is below flash-attn in requirements.txt, it doesn't work properly since flash-attn depends on torch.
File "C:\Users\user\AppData\Local\Temp\pip-install-7czi3fmk\flash-attn_6da82b15991a4532…
-
Hi again :) - I am getting the following JAX notifications while running a training job and I was wondering if you can provide some clarity.
1. `INFO:levanter.distributed:Not initializing jax.distr…
-
after flashing the lcd screen and the MB firmware, i get a notification sayiing buiild is 622 but expected 61, and i dont know hhow to fix it
-
Hi I am getting the following error when trying to install. Can you please advise how to troubleshoot this?
` *********************************************************************…
-
### Check existing issues
- [X] I have checked existing issues and believe that my issue is not a duplicate
### Description
Upon switching workspaces, the menu bar flashes as it appears to refresh.…
-
Recently purchased some Gosund WP3s from Amazon to flash over to Tasmota.
![PXL_20240905_014907760 MP](https://github.com/user-attachments/assets/9b51f129-bda9-429d-b2ce-7529999dc08f)
I took one o…
-
https://research.colfax-intl.com/flashattention-3-fast-and-accurate-attention-with-asynchrony-and-low-precision/
cc @yzh119
-
See repro here:
https://stackblitz.com/edit/vitejs-vite-eafhmw?file=src%2Fmain.tsx
And the recording:
https://github.com/pmndrs/uikit/assets/9379701/81269900-f3d8-433c-92d5-a8828a915bf6
To rep…