AMI-system / species_classifier

This repository contains the code to create on-device machine learning models for species classification.
MIT License
2 stars 0 forks source link

Run models on baskerville with GPU/CUDA #17

Closed KatrionaGoldmann closed 1 year ago

KatrionaGoldmann commented 1 year ago

Running the models on Baskerville crashes when requesting GPUs, despite torch.is_cuda_available() returning True and the set-up selecting multiple GPUs.

Using:

#SBATCH --gpus-per-task 3
#SBATCH --tasks-per-node 1
#SBATCH --nodes 1 

Returns error RuntimeError: CUDA error: no kernel image is available for execution on the device

KatrionaGoldmann commented 1 year ago

Need to load compatible versions of CUDA, Pytorch and Python on the server.

Currently running:

For more info see: