Open xinghua-qu opened 1 year ago
I tried to run the code for prompt tuning example, but got the NaN error after some iterations.
Does anybody know the reason for such error? Thanks :) The dots after the training curve mean NaN.
Hi @xinghua-qu! We made a fix for this in https://github.com/bigscience-workshop/petals/pull/343 which should likely resolve your issue. Can you try running the SST-2 prompt tuning notebook from main?
I tried to run the code for prompt tuning example, but got the NaN error after some iterations.
Does anybody know the reason for such error? Thanks :) The dots after the training curve mean NaN.