Open BeachWang opened 9 months ago
Yeah model weights are not available . Authors of repo please release it , i am excited to see.
I find the link to huggingface is available. But I test it on MME and only get 1110.96 score.
Hi,
I try to reproduce your work with the reported codes. The loss in pretrain and finetune meets expectations. But I get ridiculous answer in eval. For example:
Q: Is there a snowboard in the image? Answer the question using a single word or phrase.
A: 50
So can you upload the model weights to help me debug. Thank you very much!
The bug may be in the preprocess_v0
in train.py
. It should be
# instruction_len = len(tokenizer(parts[0]).input_ids)
instruction_len = len(tokenizer(parts[0]).input_ids) - 1
, since the last space will be tokenized in parts[0]
but be tokenized with other contents in target
.
Hi, I try to reproduce your work with the reported codes. The loss in pretrain and finetune meets expectations. But I get ridiculous answer in eval. For example: Q: Is there a snowboard in the image? Answer the question using a single word or phrase. A: 50 So can you upload the model weights to help me debug. Thank you very much!
The bug may be in the
preprocess_v0
intrain.py
. It should be# instruction_len = len(tokenizer(parts[0]).input_ids) instruction_len = len(tokenizer(parts[0]).input_ids) - 1
, since the last space will be tokenized in
parts[0]
but be tokenized with other contents intarget
.
I also got 1110.96 on MME, have you fixed the performance sir?
Hi,
I try to reproduce your work with the reported codes. The loss in pretrain and finetune meets expectations. But I get ridiculous answer in eval. For example:
Q: Is there a snowboard in the image? Answer the question using a single word or phrase.
A: 50
So can you upload the model weights to help me debug. Thank you very much!