resolve OOM errors in transfer-learning or warn that it needs a GPU

lukas / ml-class

Machine learning lessons and teaching projects designed for engineers

GNU General Public License v2.0

2.34k stars 1.17k forks source link

On my reasonably-equipped home cpu-machine and on the (cpu) hub, the transfer-learning example causes OOM errors -- sometimes even before getting to the model.fit call.

They're pretty scary-looking, if you're not expecting them, and depending on the exact system parameters, they can sometimes be triggered inside of the wandb.init call, making it look like our fault.

Potential solutions:

cut the dataset size in half
shrink the images to the minimum acceptable size for ResNet50 (80% of current size)

lukas / ml-class

resolve OOM errors in transfer-learning or warn that it needs a GPU #67