Dealing with custom tokens using the provided example notebook

System Info

Standard trn1n.32xlarge instance with the huggingface AMI image.

Who can help?

No response

Information

[X] The official example scripts
[ ] My own modified scripts

Tasks

[X] An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
[ ] My own task or dataset (give details below)

Reproduction (minimal, reproducible, runnable)

Use the notebook provided in the repo on fine-tuning llama2, but instead use llama 3 and add 3 custom tokens and during the pre-compile stage it seems to run in a division by 8 error. Seems like bug report #175 is related but I'm not sure how to modify the provided notebook so that it will work here.

Expected behavior

Running as the notebook intended

huggingface / optimum-neuron

Dealing with custom tokens using the provided example notebook #637