Open Spongeorge opened 1 year ago
Output from tests/correctly_installed.sh
pytorch 1.10.1+cu102
cuda 10.2
Apex not installed
smatch installed
pytorch-scatter installed
fairseq works
[OK] correctly installed
I also tried with the wiki25 dataset downloaded in tests/minimal_test.sh and got the same issue, infinite loss in both training and validation, so I don't think its an issue with my input. During tests/minimal_test.sh the loss isn't infinite, though.
I'm trying to fine-tune the AMR3.0 large SBART checkpoint on another dataset, but during training I get the following warnings:
In my config I set the fairseq-preprocess arguments as:
and train args as:
Any ideas as to what I'm doing wrong? Thanks in advance.