Closed Gvaihir closed 5 years ago
Hi, Reading the message error it seems that faiss doesn't realize that the unoccupied cached memory currently held by the caching allocator of pytorch is free to use. I've had a similar issue. To solve this, in this version of the code (pytorch 0.2), I was dedicating a GPU only for PIC clustering in order to avoid conflict with pytorch.
With more recent versions of pytorch, this function is very helpful and allows to fix the issue. Hope that helps
Hi! I'm training AlexNet with PIC on NVIDIA Tesla M60 GPU (AWS g3.4xlarge instance), 800e3 images. After 2-3 epochs I get the following:
I saw the issue to originate from Faiss library. Can you advice anything from your side? Thanks!