Open unclemusclez opened 1 month ago
is it an issue with the latest version? 🤔
this occurs when a job fails and you try to run the job again when running the same instance. at the moment, the only solution i have found is turn off the application and turn it back on.
this was from the recent origin main
but i noticed this same issue a week or so ago as well with a non-updated version.
It also seems to be particular to SFT training.
This issue is stale because it has been open for 30 days with no activity.
Prerequisites
Backend
Local
Interface Used
CLI
CLI Command
autotrain app --host 0.0.0.0 --port 7000
UI Screenshots & Parameters
No response
Error Logs
__main__.py: error: unrecognized arguments: --mixed_precision bf16 -m autotrain.trainers.clm --mixed_precision bf16 -m autotrain.trainers.clm --mixed_precision fp16 -m autotrain.trainers.clm --mixed_precision fp16 -m autotrain.trainers.clm
Additional Information
Running Local and it seems to double up the flags, and then keep doing so every time the training is run.