Closed michal1000w closed 4 months ago
Hi. Based on my previous PR with mamba LM pytorch training script I've created a similar mlx version. From what I've tested it works correctly and the code is very similar for better consistency.
Hello, sorry for the late response, thank for the PR!
Hi. Based on my previous PR with mamba LM pytorch training script I've created a similar mlx version. From what I've tested it works correctly and the code is very similar for better consistency.