Open DevonPeroutky opened 5 months ago
Hi @DevonPeroutky,
Thank you for your interest in our work. Did you try to upgrade the transformers to the latest version? Please note that LLaMA-3 based trainings are only supported with "transformers==4.41+" which you can install as follows,
pip install git+https://github.com/huggingface/transformers@a98c41798cf6ed99e1ff17e3792d6e06a2ff2ff3
Let me know if it helps. Good Luck!
Environment
Issue
I'm seeing random and sudden loss spikes during training, if there is a simpler way of debugging this, I'm open to a new approach. However, I attempted to reproduce the training loop in pytorch such that I could log out abnormal gradients during the training process to detect any erroneous examples in my training data.
However, I'm always getting
AttributeError: 'NoneType' object has no attribute 'device'
in the forward pass (Full stacktrace below).I built the model exactly how its done in train.py and my training loop looks like
The
model.forward
always fails with the below stacktrace. I've tried the forward pass with and without labels, similar results. Afterprepare_inputs_labels_for_multimodal
call, the inputs look like the following:Below is the full stacktrace and the model layers. What am I missing?
Model
Full StackTrace