Closed jmercat closed 6 months ago
Hi, thank you for providing the information.
It seems that the issue with the first question is most likely related to the version. The locally tested version that works fine is as follows.
╰─± pip list | grep triton
triton 2.0.0
triton-nightly 2.1.0.dev20230728172942
You can use the following command to install the package:
pip install triton==2.0.0
pip install triton-nightly==2.1.0.dev20230728172942 --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/Triton-Nightly/pypi/simple/
As for the second question, you can temporarily ignore it. Due to the inherent issues with Triton, numerical errors cannot be avoided. However, we have trained models using this kernel and compared them to the baseline (torch version), and there is almost no difference in loss. So, you can use it with confidence.
If you encounter any other issues, feel free to ask at any time.
Thanks for this repo, I'm pretty excited to test this out.
I drop-in replaced attention from lightning-attention in one of my projects and got the following:
So I tried to simply run
pytest tests/ops/test_lightning2.py
And got only failures (it is weird that there is anassert False
statement in there...) And the more worrisome result is that the errors are quite large...