Closed iMountTai closed 1 year ago
Hello, I met the same error on my own code but not CoFi. You can set CUDA_LAUNCH_BLOCKING=1 to get more stacktrace to help debug :smile:
When layer_distill_version=3 no longer reports an error, it may be that some layers are lost.But another problem arises, when I fine-tune, the following code needs to be changed is this correct?
Which layer_distill_version
did you use when you encountered the error? Sorry for the late reply, I am happy to help with debugging!
layer_distill_version=4,Now the problem is solved, thank you!
Great job. However, when I train to the 3rd epoch in the QNLI task, I encounter the following problem, but the CoLA or Squad tasks do not encounter this problem. Do you have any suggestions? I will be very grateful!
The error may appear in the following code block in the file: https://github.com/princeton-nlp/CoFiPruning/blob/main/trainer/trainer.py,Lines 680-685