Hi,
First of all, thanks a lot for open-sourcing your code and models!
I've been trying to use your code to generate predictions with CoFi models (with --do_predict on for example test-split of GLUE tasks) but unfortunately the prediction loop always fails with CUDA OOM exception (even on the 80GB A100 GPU). Could you also please try and let me know if I did something wrong?
Hi, First of all, thanks a lot for open-sourcing your code and models!
I've been trying to use your code to generate predictions with CoFi models (with
--do_predict
on for example test-split of GLUE tasks) but unfortunately the prediction loop always fails with CUDA OOM exception (even on the 80GB A100 GPU). Could you also please try and let me know if I did something wrong?