Closed fahadh4ilyas closed 1 month ago
@fahadh4ilyas the problematic PR that causes this issue was reverted in https://github.com/microsoft/DeepSpeed/commit/bc48371c5e1fb8fd70fc79285e66201dbb65679b
Thanks, @nelyahu.
Closing as this appeared to be fixed.
Describe the bug When training model using deepspeed 0.14.2. I got this error:
Here is sample script to reproduce
Here is my deepspeed config json
To Reproduce Steps to reproduce the behavior:
deepspeed the_script.py --model_name_or_path your_model --deepspeed --deepspeed_config your-deepspeed-config.json
Expected behavior There is no error and the script run successfully
ds_report output
Screenshots If applicable, add screenshots to help explain your problem.
System info (please complete the following information):