Embedding running out of GPU memory

Hi, first of all: thanks for creating MedCLIP. It seems to be an amazing library. I'd like to embed several hundreds of images with the MedCLIPProcessor. However my GPU memory filled up rather fast. That's why I needed to copy each and every embedding to the CPU memory. This is of course rather slow. I tried to start the embedding on the CPU, but the input tensors (cuda.tensors) and weight tensors(torch.tensors) are not compatible with each other.

Is there a way to run the MedCLIPProcessor on batches of images? Is there a way to force the input tensors to normal torch.tensors? Is there a way to actually run the embedding process on a CPU?

Best, Michael

RyanWangZf / MedCLIP

Embedding running out of GPU memory #33