Closed NimbusLongfei closed 2 months ago
Hi @NimbusLongfei, thanks for reporting ! Could you share a minimal reproducer ?
Hi @NimbusLongfei , is it possible that you are using BnB 4 bit quantization with FP16=True? If is thats the case, that gives this error.
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Please note that issues that do not follow the contributing guidelines are likely to be ignored.
System Info
Information
Tasks
no_trainer
script in theexamples
folder of thetransformers
repo (such asrun_no_trainer_glue.py
)Reproduction
When I was debugging in VSCode, the following error appeared, but strangely, this error did not occur when using
accelerate launch
.ValueError: Attempting to unscale FP16 gradients.
Note that the Settings and parameters for debugging and running are exactly the sameExpected behavior
I would like to know why this error occurs.