Closed kevintsq closed 3 weeks ago
Some tensors were created on the CPU, causing high CPU usage. This PR fixes this issue by creating them directly on the GPU, lowering CPU usage and thereby improving training speed.
Some tensors were created on the CPU, causing high CPU usage. This PR fixes this issue by creating them directly on the GPU, lowering CPU usage and thereby improving training speed.