Open RajaRuling opened 2 months ago
The documentation here states that the argument should be "--adapter-path" instead of "--adapter-file" as well as the usage example you printed above.
Now the problem is further with the dequantization
Loading pretrained model
Trainable parameters: 0.108% (1.126M/1044.752M)
Loading datasets
Training
Starting training..., iters: 500
Traceback (most recent call last):
File "
Model training complete.
If you're getting a strange error and the training isn't happening (will be obvious as it'll end instantly).
CPU times: user 17 ms, sys: 10.1 ms, total: 27.1 ms Wall time: 2.1 s
Not sure if related to this issue: https://github.com/ml-explore/mlx/issues/814
Description
When running the mlx-usft.ipynb notebook on M1 Mac with the
--adapter-file
argument, it results in an "unrecognized arguments" error. It seems like the argument is either not implemented or incorrectly handled.Steps to Reproduce
Expected Behavior
The script should recognize the
--adapter-file
argument and use the specified adapter file for training or testing as intended.Actual Behavior
The script throws an error:
lora.py: error: unrecognized arguments: --adapter-file trial1.npz
.Possible Solution
--adapter-file
is correct.Additional Information
Please let me know if there's a different way to specify the adapter file or if there's an update needed to handle this argument correctly. Thanks!