Open sgjheywa opened 1 year ago
i have the same issue about video inference, please sort it out
Same here! Hope to get the solutions as soon as possible. Thank you so much for your great works and efforts!
Meanwhile, does anyone know any baselines method could be comparable with the mPLUG-Owl in videoQA?
It looks like that one of weights in the official checkpoint has NaN value.
See #101. Also we have update the checkpoint in HF.
Hi,
Thanks for sharing this repo!
I am trying to test the video model and I keep getting the same error:
Looking at similar issues online it looks like a bfloat16/fp32 issue but I've tried running the model at both precisions and both on the CPU and GPU and get the same error. somewhere in the inference a tensor turns to NaNs, this is only occuring during videoQA, image is working fine.
Can you help me understand where this error might be occurring? Here is a colab recreating the error (I'm running it on a V100): https://colab.research.google.com/drive/1znwvEgSYoqbA67BH3S1ppmWEWIy8cDDt?usp=sharing