Closed Suodislie closed 7 years ago
The GTX 980 Ti has 6GB VRAM which is less than the 8GB VRAM of the reference GTX 1080 I used, so that may explain why you had to use a smaller batch size. As to the loss, maybe resume learning with a lower learning rate.
The GTX 980 Ti has 6GB VRAM which is less than the 8GB VRAM of the reference GTX 1080 I used, so that may explain why you had to use a smaller batch size. As to the loss, maybe resume learning with a lower learning rate.