Closed le1nux closed 3 months ago
The current state in main
(4d9218f51e867331d92de0b068db2b1b7a3da726) contains dgx-specific paths. Probably dgx2. I want to emphasize again that this is a bad idea. Especially modifying test resources like the lorem_ipsum config (e.g. here).
Git blame indicates @mali-git . Please don't do this. This makes the tool unusable on different devices and undermines the stability of the tests and invites people to ignore them even more.
Seems like this problem was introduced with the tokenizer merge. (At least using the previous state of 60feafe29ec882939202be4e88892bbcde2e53f5 Are you sure about the code's stability for the usage on Taurus? @le1nux, @mali-git
I was able to fix the tests. They pass in my local setup. Since the Ci is still broken due to our non-GPU setup and flash-attn being integrated, I verified it locally by creating a dockerized environment. I did not make the utilities for the Docker build part of this PR. But you can try reproducing it by checking out this tagged version: https://github.com/Modalities/modalities/tree/dockerized-pytester
It looks like there are issues with the loading of the dataset.