Closed Kahsolt closed 2 months ago
Thank you for your attention, I will reply to you during the day tomorrow.
Could you tell me which version of the transformers library you are using?
Just returned from travel, I'm so sorry to reply a bit late. 🤧
I use Python 3.9.0
, numpy 1.24.1
, torch 2.1.0
and transformers 4.43.3
, hope these can help!
Note: even if I switched to transformers 4.40.1
as the latest commit requires, NaN still occurs.
I run the model in float16
, might this be the cause?
Well, I changed float16
to bfloat16
, now it work perfect!
Thanks for your help, now it can be closed :)
What I do:
Qwen1.5-7B-Chat
print
logspython unke.py
following the tutorialWhat I get:
What's wrong with this? Please help...