Fine tuning NLLB - Githubissues

❓ Questions and Help

Before asking:

search the issues.
search the docs.
My question?

I am trying to fine-tune the NLLB model on the Moroccan Arabic and English languages. I prepared my data using the "Filtering and Preparing the Data" pipeline as described in the ReadMe. The result of the Prepare Data step is the following directory:

The second step now is to fine tune the model, I used this command:

DROP=0.1 python examples/nllb/modeling/train/train_script.py cfg=nllb200_dense3.3B_finetune_on_fbseed cfg/dataset=bilingual cfg.dataset.lang_pairs="ary_Arab-eng_Latn" cfg.fairseq_root=/home/docker/notebooks/khadija/Translation_with_NLLB-200/fairseq cfg.output_dir=model_fine_tuned cfg.dropout=$DROP cfg.warmup=10 cfg.finetune_from_model=/home/docker/notebooks/khadija/Translation_with_NLLB-200/checkpoint.pt

The bilingual.yaml file is as follows:

But I get this error when I run the command:

Can you help me please? Is there anything I've missed?

What's your environment?

fairseq Version : 1.0.0a0
PyTorch Version: 1.13.0
OS : Linux Ubuntu 20.04.4 LTS
How you installed fairseq: (pip, source):
Python version: 3.8
CUDA/cuDNN version: Cuda 11.1, cuDNN 8

facebookresearch / fairseq

Fine tuning NLLB #4870

❓ Questions and Help

Before asking:

My question?

What's your environment?