Closed catid closed 7 months ago
Due to some numerical issues with triton itself, it is reasonable for these phenomena to occur. This kernel has been used to train transnormerllm, so feel free to use it. We will optimize the test section in the future to reduce ambiguity.
Sorry disregard this was a bug in my testing script.
All the tests currently fail:
To reproduce, I checked out the repo, set up a conda env,
pip install -e .
and then:I also wrote my own unit test and it fails: