michellab / Cluster

This repository is used for tracking any issues regarding the cluster
2 stars 0 forks source link

node005 nvidia device file #6

Closed ppxasjsm closed 9 years ago

ppxasjsm commented 9 years ago

No device files: /dev/nvidia0 etc.

slurmd daemon does not run because of this.

ppxasjsm commented 9 years ago

Fix with this script: cat dev_script

#!/bin/bash
for i in 0 1 2 3; do
  node="/dev/nvidia$i"
  rm -f $node
  mknod $node c 195 $i  || echo "mknod \"$node\""
  chmod 0660 $node      || echo "chmod \"$node\""
  chown :video $node    || echo "chown \"$node\""
done

node="/dev/nvidiactl"
rm -f $node
mknod $node c 195 255   || echo "mknod \"$node\""
chmod 0666 $node        || echo "chmod \"$node\""
chown :video $node      || echo "chown \"$node\""

now the slurm daemon can be started again