TXH-mercury / COSA

Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
https://arxiv.org/abs/2306.09085
MIT License
39 stars 3 forks source link

Warning each batch: Attempting to cast a BatchEncoding to type torch.float32. This is not supported. #3

Open mgwillia opened 1 year ago

mgwillia commented 1 year ago

As the title says, when I run zero-shot inference with, for example, ActivityNet, from pre-trained weights, I get this warning once per batch. The final retrieval results seem plausible, so I'm wondering if this warning message is to be expected, or if it indicates some broken environment on my end.

TXH-mercury commented 1 year ago

Hi@mgwillia , If the evaluation metric are normal, then I think this warning could be ignored.