I think the program load the model to ram, and then move it to GPU memory, and after that, the model can run on its own, with a few ram supporting the program.
But now I'm facing the fact that the ram usage is about the same as GPU memory usage, any ideas on how to solve my problem?
Thanks in advance.
I think the program load the model to ram, and then move it to GPU memory, and after that, the model can run on its own, with a few ram supporting the program. But now I'm facing the fact that the ram usage is about the same as GPU memory usage, any ideas on how to solve my problem? Thanks in advance.