Open UmarIgan opened 1 year ago
cc @amyeroberts and @rafaelpadilla
Hi @UmarIgan
Thank you for bringing this to our attention!
I've tested your code and indeed, I've encountered the same error. I'm on it and will work towards a solution.
Thanks @rafaelpadilla As I understand vision transformers also can't encode grayscale of images as well, I tried to wrap around the dataset - tried to transform image to add a new channel but no go. Is there a way to overcome this?
I was facing the same error. Fixed this by converting images to RGB mode. image = image.convert('RGB')
I was facing the same error. Fixed this by converting images to RGB mode. image = image.convert('RGB')
Thank you, works for me!
But if you convert from 16bit single channel to 8bit RGB, you will lose resolution. It does not seem like a solution.
System Info
I am facing an issue during encoding image dataset using facebook/dino-vits16, I faced this issue with grayscale images before too but it worked well with Bingsu/Human_Action_Recognition dataset. Versions
The error:
Who can help?
@amyeroberts
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
Expected behavior was to obtaining embeddings.