Open consideRatio opened 5 years ago
Awesome! 🍰
@consideRatio Hi, do you think swapping in pytorch with tensorflow in the dockerfile will work? (changing conda channel and pytorch)
@koustuvsinha yepp, installing both would also work i think.
Cool. It sure will be fun to try to use GPUs on Azure AKS. Will report after having a chance to work on it.
The post is now updated, I think it is easier to read and has a more logical order to the steps taken. It also has some extra verification steps, but still not enough verification steps I think.
This is related to https://github.com/jupyterhub/zero-to-jupyterhub-k8s/issues/992 correct?
@jzf2101 This is #994 :D You meant #1021? Yeah those that gets their own GPU etc could certainly be the kind of users that would appreciate being abple to do sudo apt-get install ...
I'm glad you raised that issue, it is very relevant to me to get more knowledgeable about as well.
Correction- I meant #992
@jzf2101 ah! yepp thanks for connecting this
Made an update to the text, I added information about autoscaling the GPU nodes. Something resolved itself, I'm not sure what, now it "only" takes 9 minutes + image pulling to get a GPU node ready.
Which version of Ubuntu is in the Docker Images? I can't find it in the notes.
@jzf2101 the image I provide in this post is built from jupyter/datascience-notebook (1), built in top of scipy-notebook (2), on top of minimal-notebook (3), on top of base-notebook (4), on top of ubuntu 18.04 aka bionic.
@consideRatio Thank you for putting this together! I am currently stuck at Step #5. I get an error when I try to run kubectl logs
error: cannot get the logs from *extensions.DaemonSet
kubectl get -n kube-system ds/nvidia-driver-installer
gets me this:
NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE
nvidia-driver-installer 1 1 1 1 1
Suggestions?
@amanda-tan hmm clueless, but you could do a more explicit command:
kubectl logs -n kube-system nvidia-driver-installer-alskdjf
Where you would enter your actual pod name
also the container name as the ds uses initconatiners:
kubectl logs -n kube-system nvidia-driver-installer-alskdjf -c nvidia-driver-installer
Best,
clkao
On Fri, 7 Dec 2018 at 16:40, Erik Sundell notifications@github.com wrote:
@amanda-tan https://github.com/amanda-tan hmm clueless, but you could do a more explicit command:
kubectl logs -n kube-system nvidia-driver-installer-alskdjf
Where you would enter your actual pod name
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/jupyterhub/zero-to-jupyterhub-k8s/issues/994#issuecomment-445161778, or mute the thread https://github.com/notifications/unsubscribe-auth/AAEQaD3vxirY2utl8UFaNnN4Lj6GESJwks5u2ilsgaJpZM4X6ezZ .
@clkao Yes! That worked thank you!
Also, just wanted to add that I got this to work -- the profileListConfig did not work for me ; I probably made an error somewhere but just whittling it down to:
extraConfig: |- c.KubeSpawner.extra_resource_limits = {"nvidia.com/gpu": "1"}
worked like a charm. Thank you so much.
Has anyone tried provisioning pre-emptible GPU instances with this setup? I am having a hard time getting the beyond one instance of pre-emptible GPU.
Also, I am trying to use this setup for classroom and it seems extremely cost-ineffective; are there suggestions on how to lower the overall costs?
ETA: I guess there is also a Pre-emptible GPU quota which must be increased! That solved #1.
@amanda-tan yepp this will cost a lot. I don't know how to reduce the cost much, but the experience for the users can be improved greatly with user-placeholders as found in the z2jh 0.8-dev releases available already. Users would not have to wait for the scale up in best case with these. See the "optimizations" section of z2jh.jupyter.org for more info about such autoscaling optimizations. Requires k8s 1.11+ and Helm 2.11+.
Having multiple GPUs per node is also a reasonable idea, then the users could share some CPU even though they dont share the GPUs.
I ran a short course using Jupyterhub and Kubernetes with pre-emptible GPUs and scaled up to about 50 users. I ran the nodes for 8 hours with a total cost of about $75 on Google Cloud. Using 10 CPU/8 GPU clusters worked well for me so that each user had 1 CPU and 1 GPU available. You do need an extra 2 CPUs per node to manage the sub-cluster, otherwise you will have 1 GPU sitting idle per cluster. Use K80 GPUs to keep costs minimized and make sure you are running in a region and zone that has them available. Adding extra RAM to a node is really cheap, so don't be afraid to do that beyond the 6.5 GB per CPU standard for the highmem instances.
Make sure you have your quota increase requests in well before you need the nodes for the course because that was one of the more challenging parts for me to get through. You will need the GPUs (all regions) and regional GPU quotas increased. There are also separate quotas for preemptible GPUs versus regular GPUs, so be aware of those. You may also run into issues with quotas on the number of CPUs and the number of IP addresses you can have, so check on all those.
I'am looking forward to GPU integration ! How can I apply this without gke? on my own kubernetes cluster ? How should I handle nvidia-driver-installer?
@FCtj I don't know, but you would need to redo some work GKE have done as well if youd want to do this. I would consider this a very advanced topic.
GKE had one daemonset registering GPY nodes etc to kubernetes before the nvidia-driver-installer came into play, i think it was called nvidia-device-plugin. Also note that this regards NViDIA graphics cards as well specifically.
Thanks for this wonderful doc! This worked great for getting a GPU set up on our k8s cluster on google cloud. However, I have been having trouble getting it to work with tensorflow. I used your image with a Tesla P4 but got an issue:
In [2]: tf.Session()
2019-03-29 01:35:34.339396: I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiledto use: SSE4.1 SSE4.2 AVX AVX2 FMA
2019-03-29 01:35:34.439991: E tensorflow/stream_executor/cuda/cuda_driver.cc:300] failed call to cuInit: CUDA_ERROR_UNKNOWN: unknown error
2019-03-29 01:35:34.440053: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:163] retrieving CUDA diagnostic information for host: jupyter-...
2019-03-29 01:35:34.440063: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:170] hostname: jupyter-...
2019-03-29 01:35:34.440136: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:194] libcuda reported version is: 410.79.0
2019-03-29 01:35:34.440166: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:198] kernel reported version is: 410.79.0
2019-03-29 01:35:34.440174: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:305] kernel version seems to match DSO: 410.79.0
Any idea why this could happen?
@jrmlhermitte thanks for the feedback! Hmmm sadly that error didnt help me much, what happens if you run nvidia-smi, the terminal command line tool? It is supposed to describe some info about the GPU situation in general. I found that getting that functional was an essential first step before Tf.
@jrmlhermitte What version of tensorflow-GPU did you install? The Tensorflow 1.13 binary is built with CUDA 10 while 1.12 and earlier is built with CUDA 9. Try pip install tensorflow-gpu==1.12
and see if that fixes the problem.
Thanks for the quick response @consideRatio . Here is my nvidia-smi
output:
Sat Mar 30 03:02:44 2019
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 410.79 Driver Version: 410.79 CUDA Version: 10.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla P4 Off | 00000000:00:04.0 Off | 0 |
| N/A 35C P8 7W / 75W | 0MiB / 7611MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
pip install tensorflow-gpu==1.12
didn't work either.
Could it have something to do with the GPU? I may try to switch to a k80.
Allright, never mind, I got it to work!!!!!! I have to make sure to patch the daemonset before starting the node. That did the trick for me. (I also needed the patch you mentioned). I need to do some more reading myself about all this, but I'm also interested to hear more about your developments.
I would suggest mentioning this detail in the README. Oh, and here is nvidia-smi
after a successful attempt:
jovyan@jupyter-jrmlhermitte:~$ nvidia-smi
Sat Mar 30 03:43:35 2019
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 384.145 Driver Version: 384.145 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla P4 Off | 00000000:00:04.0 Off | 0 |
| N/A 55C P0 24W / 75W | 7395MiB / 7606MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
+-----------------------------------------------------------------------------+
Thanks for the feedback!
I'm a bit confused on the cudatoolkit==9.0.0 pinning. According to here: https://github.com/NVIDIA/nvidia-docker/wiki/CUDA 10.1 should be compatible with the Keplers.
(I am new to this so perhaps missing something)
Like @beniz (https://github.com/jolibrain/docker-stacks/tree/master/jupyter-dd-notebook-gpu) I am building of the Nvidia images ARG BASE_CONTAINER=nvidia/cuda:10.1-cudnn7-devel-ubuntu18.04
but i notice he pins to 9.0.0 as well in the Dockerfile as well.
Finally if one builds of the nvidia base image should one install tensorflow-gpu or just plain tensorflow (the recipe here suggests tensorflow-gpu, @beniz suggests tensorflow)....
@rahuldave depending on what graphic card and drivers you have, you can use various versions of CUDA. This was done for NVIDIA K80's on Google Cloud a while back, the circumstances may have changed and not apply to you.
I'm planning to run on GKE with K80's. Was just confused with https://github.com/NVIDIA/nvidia-docker/wiki/CUDA saying that the toolkit version 10.1 is compatible with Keplers.
I'll try the standard nvidia image (though i have this feeling that the conda tensorflow-gpu and pytorch packages may override my 10.1 install. We'll see.
EDIT: the conda build installs cudatoolkit 10.0 based on the conda dependencies. Wondering then if i needed to start the base-notebook from nvidia's 18.04 image (as @beniz does) rather than the jupyter one...
@rahuldave hmmm this was the most troublesome part of the setup to get right to me. Perhaps K80 does not support the driver version that is required to use that version of CUDA then?
I'm using the standsrd jupyter taints:
prePuller:
continuous:
enabled: true
scheduling:
userScheduler:
enabled: true
podPriority:
enabled: true
userPlaceholder:
enabled: true
replicas: 1
userPods:
nodeAffinity:
matchNodePurpose: require
and setting up by node-pool with the appropriate taints
gcloud beta container node-pools create testgpu-gpu-pool
--accelerator type=nvidia-tesla-k80,count=2 --machine-type n1-standard-2 --num-nodes 1
--enable-autoscaling --min-nodes 0 --max-nodes 2
--node-labels hub.jupyter.org/node-purpose=user
--node-taints hub.jupyter.org_dedicated=user:NoSchedule
--zone us-east1-c --cluster univai-testgpu
But this gives me the error
Cannot schedule pods: node(s) didn't match node selector.
Learn more
Do i need to specifically match nvidia.com/gpu=present
in my config.yaml to get this to work? I thought that the user taint would do this for me... but i cant get the user-placeholder to get itself onto the machine...
@rahuldave your pods needs to request the GPU, the nodes with GPUs will get additional taints, and if you request GPU resources your pods will automatically get the toleration of such taint after drivers has been installed etc by the daemonset that does that.
It appears though that you have a "nodeSelector" on the pod you try to schedule, do you? That is not there by default. I don't grasp where this error is written out etc though, so I'm somewhat in the dark.
You will end up needing to learn about how tolerations and taints work with GPU requests on GKE etc I think, to debug this thoroughly.
(i reformatted the above message to be clear) The error is from the GKE page for the pod workload.
So the key question is, how do pods request the gpu?
the nodes in the cluster get
--node-labels hub.jupyter.org/node-purpose=user
--node-taints hub.jupyter.org_dedicated=user:NoSchedule
GKE adds nvidia.com/gpu=present
.
So if i am understanding you right, this:
userPods:
nodeAffinity:
matchNodePurpose: require
is the part where the pod gets a nodeSelector.
And this Does not match the nvidia.com/gpu=present
that GKE adds for me.
So i somehow need to additionally match on nvidia.com/gpu=present
in the config.yaml file?
OR
Is this done by something like you showed above in the thread for the profile: the additional kubespawner stuff:
kubespawner_override:
image: consideratio/singleuser-gpu:v0.3.0
extra_resource_limits:
nvidia.com/gpu: "1"
I'm not trying to provide any profile. I dont think it is this latter stuff. I'm thinking i somehow need to have the pod explicitly want nvidia.com/gpu=present
in the config.yaml. But I dont know how to achieve this. Am i making any sense?
The GKE docs seem to want a pod spec:
apiVersion: v1
kind: Pod
metadata:
name: my-gpu-pod
spec:
containers:
- name: my-gpu-container
image: nvidia/cuda:10.0-runtime-ubuntu18.04
resources:
limits:
nvidia.com/gpu: 2
So i guess the question is, where does this go in the jupyter singleuser yaml specs...
Ah, thats why then, because your pod will get a toleration when requesting a GPU for the taint that comes by adding GPUs to a nodepool.
So how to provide the additional resource request for your user pods through the helm chart configuration by default, rather than optionally through a choosable profile with the profile_list functionality of kubespawner? I found this: https://github.com/jupyterhub/zero-to-jupyterhub-k8s/blob/master/jupyterhub/values.yaml#L223
So, I think this would do it:
singleuser:
extraResources:
limits:
nvidia.com/gpu: 2
Thanks! It finally worked!
singleuser:
extraResource:
limits:
nvidia.com/gpu: 2
in addition to the user tolerations specified in the 0tojh docs works.
The default daemonset specified in the GCP docs works out of the bar with CUDA 10.1, no extra intervention is needed. Used:
https://github.com/rahuldave/docker-stacks
The GCP quota stuff taked about by https://github.com/jupyterhub/zero-to-jupyterhub-k8s/issues/994#issuecomment-457398750 are critical.
Next job is to get this working with a CPU node pool in a cluster and a GPU node pool in a cluster, and a profile (or i suppose i could just give one or other node pools a taint to have pods not go there...)
based on @jolibrain stacks.
User placeholders for GPU nodes: is it possible to do this now?
If i set up an autoscaling gpu cluster, not all nodes will have the nvidia daemonset installed right, unless i ask to place a pod on that node. So if i wanted to warm the "seats" of some gpu machines in the morning i would want to place some user-placeholders on them
kubectl scale statefulset user-placeholder --replicas=10
cannot target a particular profile yet though, right? So it does just depend on users waiting on a new node to have everything installed, every day.
Or am i missing something?
EDIT: would seem i need a toleration-specific stateful set?
For some reason the LD_LIBRARY_PATH environment variable was not carried forward to my notebook pods, even after whitelisting it.
I overcame this by adding a file cuda.conf containing the paths to the drivers to /etc/ld.so.conf.d/cuda.conf and then a bash cript that runs ldconfig in /usr/local/bin/before-notebook.d/. This got pytorch/fastai working with gpu's in notebooks.
@astrajohn I got that issue recently myself, and for me it was because:
jovyan
user in a sudo -E -u jovyan
command.This is the issue, the switch from the root user to the jovyan user will reset a certain set of paths as can be spotted with sudo sudo -V
. I've submitted a fix for this in https://github.com/jupyter/docker-stacks/pull/1052.
@rahuldave there are no GPU-specific user placeholder pods as part of the JupyterHub helm chart, but you can create it yourself by mimicing the statefulset called user-placeholder part of the helm chart and adjusting it slightly. Here is my attempt.
# Purpose:
# --------
# To have a way to ensure there is always X numbers of slots available for users
# that quickly needs some GPU.
#
# Usage:
# ------
# 1. Update metadata.namespace, spec.template.spec.priorityClassName,
# spec.template.spec.schedulerName with your namespace and helm release name
# respectively. Verify the namespace matches with where you deployed your
# JupyterHub helm chart by inspecting `kubectl get namespace`, and verify the
# helm release name with `kubectl get priorityclass`.
# 3. Optionally update spec.template.spec.affinity.nodeAffinity to match how
# your JupyterHub helm chart was configured.
# 4. Optionally configure your resource requests to align with what you
# provision your users in the Helm chart.
# 5. kubectl apply -f user-placeholder-gpu-daemonset.yaml
# 6. kubectl scale -n <namespace> sts/user-placeholder-gpu --replicas 1
# 7. You may want to ensure your continuous image puller have a toleration for
# `nvidia.com/gpu=present:NoSchedule` as well to prepull the images, assuming
# you have the same images for the GPU nodes as the CPU nodes.
#
# kubectl patch -n <namespace> ds/continuous-image-puller --type=json --patch '[{"op":"add", "path":"/spec/template/spec/tolerations/-", "value": {"effect":"NoSchedule", "key":"nvidia.com/gpu", "operator":"Equal", "value":"present"}}]'
#
apiVersion: apps/v1
kind: StatefulSet
metadata:
labels:
app: jupyterhub
component: user-placeholder-gpu
name: user-placeholder-gpu
namespace: jupyterhub
spec:
podManagementPolicy: Parallel
replicas: 0
selector:
matchLabels:
app: jupyterhub
component: user-placeholder-gpu
serviceName: user-placeholder-gpu
template:
metadata:
labels:
app: jupyterhub
component: user-placeholder-gpu
spec:
affinity:
nodeAffinity:
# Make this either requiredDuring... or preferredDuring... so it
# matches how you configured your Helm chart in
# scheduling.userPods.nodeAffinity.matchNodePurpose.
requiredDuringSchedulingIgnoredDuringExecution:
nodeSelectorTerms:
- matchExpressions:
- key: hub.jupyter.org/node-purpose
operator: In
values:
- user
containers:
- image: gcr.io/google_containers/pause:3.1
name: pause
resources:
limits:
# OPTIONALLY: Configure these to align with how you configure your
# users resource limits and requests in the JupyterHub Helm chart.
# This is only relevant if you will have a mix of gpu and non-gpu
# users on this GPU node though, as the limiting resource then could
# end up being something else than the GPUs.
nvidia.com/gpu: 1
requests:
nvidia.com/gpu: 1
priorityClassName: jupyterhub-user-placeholder-priority
schedulerName: jupyterhub-user-scheduler
tolerations:
- effect: NoSchedule
key: hub.jupyter.org_dedicated
operator: Equal
value: user
- effect: NoSchedule
key: hub.jupyter.org/dedicated
operator: Equal
value: user
updateStrategy:
rollingUpdate:
partition: 0
type: RollingUpdate
I've been playing with NVIDIA's helm chart for injecting GPU drivers etc as an alternative to Google's daemonset, worth looking into: https://github.com/NVIDIA/gpu-operator
I think this could provide an upstream helm chart dependency that could be included in response to a values.yaml setting to enable GPUs, which seems like a more z2jh approach?
Thanks for describing that as an option @snickell ! I think it is an approach that makes sense to document, but not to have as an optional chart dependency. Adding this would be a maintenance challenge too big to keep current i think.
Having a working example in docs with a timestamp on when it worked seems like a good path in between to me.
Totally makes sense to me to document the setup nicely and move on.
Seems like GPU support out of the box would be a "nice to have someday" in the core, or perhaps in a companion chart?
It's too bad the GPU setup story is so complex (at least on GKE) in 2020, its deceptively easy to click the "yes give me a GPU in my pool!" button and surprisingly hard to get it all going correctly. IMO A great starting point would be if GKE would allow the GPU node taints to be optional, but of course that's not in our control 🤷🏾♀️ It'd sure be nice if NVIDIA and AMD got together and released gpu-operator :-P
In case its helpful:
No pressure thought: On the companion chart front, one idea would be to have z2jh-experimental-gpu that pulls z2jh as its helm dependency, and sprinkles in 'best effort no guarantees' GPU bits. In my experience docs rot a lot faster than repos, because repos tend to get issues / PRs faster (for better or worse 🤣)
Having an experimental companion chart would also start the process of building a foundation for GPUs that could someday be folded into z2jh mainstream. e.g. if its 2023 and its been a couple years since z2jh-experimental-gpu had major change and a ton of people are using it, you have info on how good the setup is and might think "lets put that in the main chart". Whereas with a documented example, you never quite know who's doing what, and how well its working.
Side note: I'm curious to read about the Z2JH travis setup, is there somewhere I can look to learn more about that?
It's being refactored at the moment, see https://github.com/jupyterhub/zero-to-jupyterhub-k8s/pull/1664
On the companion chart front, one idea would be to have z2jh-experimental-gpu that pulls z2jh as its helm dependency, and sprinkles in 'best effort no guarantees' GPU bits
That sounds sensible, it's the model used by the BinderHub Helm chart.
This issue has been mentioned on Jupyter Community Forum. There might be relevant details there:
https://discourse.jupyter.org/t/cuda-and-images-problem-on-jupyterhub-k8s/5139/2
@meeseeksmachine Thank you for you attention! @consideRatio Dear my friend, I according your tutorial to build my singlegpuuser
such below:
Dockerfile
# For the latest tag, see: https://hub.docker.com/r/jupyter/datascience-notebook/tags/
FROM jupyter/datascience-notebook:5197709e9f23
RUN conda config --set channel_priority flexible
# GPU powered ML
# ----------------------------------------
RUN conda install --yes --quiet --force-reinstall \
tensorflow-gpu \
cudatoolkit=10.2 && \
conda clean -tipsy && \
fix-permissions $CONDA_DIR && \
fix-permissions /home/$NB_USER
# Allow drivers installed by the nvidia-driver-installer to be located
ENV LD_LIBRARY_PATH=${LD_LIBRARY_PATH}:/usr/local/nvidia/lib64
# Also, utilities like `nvidia-smi` are installed here
ENV PATH=${PATH}:/usr/local/nvidia/bin
But, unfortunately, I could still not find cuda
information when I begin notebook with my singlegpuuser
image. I could see nvidia driver information by using nvidia-smi
.
Is it because I did not complete step 5? I saw the nvidia information before I built and used this image. Just add below information into jhub
config.yaml
.
singleuser:
profileList:
- display_name: "Dedicated, 2 CPU cores & 13GB RAM, 1 NVIDIA Tesla K80 GPU"
description: "By selecting this choice, you will be assigned a environment that will run on a dedicated machine with a single GPU, just for you."
kubespawner_override:
image: consideratio/singleuser-gpu:v0.3.0
extra_resource_limits:
nvidia.com/gpu: "1
So, tell me how to do? Thanks!
Has anyone done something similar to this using AWS EKS?
I currently have GPU nodes running in my cluster with the official Amazon EKS optimized accelerated Amazon Linux AMIs, which from my understanding, has the NVIDIA drivers and the nvidia-container-runtime (as the default runtime) already installed.
I can successfully run a pod based off of the nvidia/cuda:9.2-devel
image, which gives me some output for nvidia-smi
.
However, I am having trouble constructing my own Docker image on top of one of the jupyter/docker-stacks images, as in this demonstration. I tried using the same Dockerfile shown above, but I cannot successfully run nvidia-smi
, which gives me the following output: Failed to initialize NVML: Unknown Error
. I noticed that I do not have a /usr/local/nvidia
directory. I do have /usr/bin/nvidia-smi
, however.
I don't believe there is an equivalent to the nvidia-driver-installer daemonset for AWS EKS, as described above, so that is one difference here, but as described in the AWS docs, it sounds like the official AMI that I'm using should take care of the drivers already.
It's hard to tell what's missing here. I'd really appreciate any help!
@jkovalski I was able to get this working on EKS.
There's an nvidia daemonset that you need to run on kubernetes in order for the kubernetes containers to use the nvidia GPUs. I think the AWS AMI includes the drivers, but not the kubernetes daemonset. I followed this tutorial: https://aws.amazon.com/blogs/compute/running-gpu-accelerated-kubernetes-workloads-on-p3-and-p2-ec2-instances-with-amazon-eks/
Then for the user notebook image, I used this: https://hub.docker.com/r/cschranz/gpu-jupyter/ which uses the NVIDIA CUDA image as the base, and installs the jupyter/docker-stacks dependencies on top.
@jeffliu-LL Hm, so I have the NVIDIA device plugin daemonset already running in my cluster, and the Jupyter singleuser notebook pods are successfully getting scheduled onto my GPU node (p3.2xlarge), so that part is working. I'm also using the official AWS AMI: amazon-eks-gpu-node-1.16-*
.
I also tried using that image that you linked, but when I try to run nvidia-smi
, I get: Failed to initialize NVML: Unknown Error
. Specifically, I am using cschranz/gpu-jupyter:v1.1_cuda-10.1_ubuntu-18.04_python-only
.
My issue seems to be related to the Docker image / the pod having access to the underlying GPU. Did you have to add anything to the Docker image to make it work?
@jeffliu-LL Following up - the image seems okay. I spun up a pod using that base image I referenced above, and I was able to successfully run the nvidia-smi
command, which I saw from the pod's logs. It only seems to not work when I'm running it in the context of JupyterHub. Is there anything that you had to add in the JupyterHub configuration, maybe for singleuser specifically?
GPU powered machine learning on GKE
To enable GPUs on GKE, this is what I've done. Note that this post is a Work In Progress and will be edited from time to time. To see when the last edit was made, see the header of this post.
Prerequisite knowledge
Kubernetes nodes, pods and daemonsets
A node is represents actual hardware on the cloud, a pod represents something running on a node, and a daemonset will ensure one pod running something is created for each node. If you lack knowledge about kubernetes, I'd recommend learning more at their concepts page.
Bonus knowledge:
This video provides a background allowing you to understand why additional steps is required for this to work: https://www.youtube.com/watch?v=KplFFvj3XRk
NOTE: Regarding taints. GPU nodes will get them on GKE, and pods requesting them will get tolerations, without any additional setup.
1. GKE Kubernetes cluster on a GPU enabled zone
Google has various zones (datacenters), some does not have GPUs. First you must have a GKE cluster coupled with a zone that has GPU access. To find out what zones has GPUs and what kind of GPUs it has, see this page. In overall performance and cost, K80 < P100 < V100. Note that there is also TPUs and that their availability is also zone dependant. This documentation will not address utilizing TPUs though.
Note that GKE Kubernetes clusters comes with a pre-installed with some parts needed for GPUs to be utilized:
nvidia-gpu-device-plugin
. I don't know fully what this does yet.nvidia.com/gpu: 1
properly.2. JupyterHub installation
This documentation assumes you have deployed a JupyterHub already by following the https://z2jh.jupyter.org guide on your Kubernetes cluster.
3. Docker image for the JupyterHub users
I built an image for a basic Hello World with GPU enabled Tensorflow. If you are fine to utilize this, you don't need to do anything further. My image is available as
consideratio/singleuser-gpu:v0.3.0
.About the Dockerfile
I build on top of a jupyter/docker-stacks image to allow JupyterHub to integrate well with. I also pinned
cudatoolkit=9.0
, it is a dependency oftensorflow-gpu
but would install with a even newer version that is unsupported by the GPUs I'm aiming to use, namely Tesla K80 or Tesla P100. To learn more about these compatibility issues see: https://docs.anaconda.com/anaconda/user-guide/tasks/gpu-packages/Dockerfile reference
NOTE: To make this run without a GPU available, you must still install an nvidia driver. This can be done using
apt-get install nvidia-384
, if you do, this must not conflict with thenvidia-driver-installer
daemonset later that still needs to run sadly afaik. This is a rabbithole and hard to maintain I think.3B. Create an image using repo2docker (WIP)
https://github.com/jupyterhub/team-compass/issues/96#issuecomment-447033166
4. Create a GPU node pool
Create a new node pool for your Kubernetes cluster. I choose a
n1-highmem-2
node with a Tesla K80 GPU. These instructions are written and tested for K80 and P100.Note that there is an issue of using autoscaling from 0 nodes, and that it is a slow process to scale up a GPU node as it needs to start, install drivers, and download the image file - each step takes quite a while. I'm expecting 5-10 minutes of startup for this. I recommend you start out with using a single fixed node while setting this up initially.
For details on how to setup a node pool with attached GPUs on the nodes, see: https://cloud.google.com/kubernetes-engine/docs/how-to/gpus#create
5. Daemonset: nvidia-driver-installer
You need to make sure the GPU nodes gets appropriate drivers installed. This is what the
nvidia-driver-installed
daemonset will do for you! It will install drivers and utilities in/usr/local/nvidia
, which is required for the conda packagetensorflow-gpu
for example to function properly.NOTE: Tensorflow have a pinned dependency on cudatoolkit, and a given cudatoolkit requires a minimum NVIDIA driver version.
tensorflow=1.11
andtensorflow=1.12
requirescudatoolkit=9.0
andtensorflow=1.13
will requirecudatoolkit=10.0
for example,cudatoolkit=9.0
requires a NVIDIA driver of at least version384.81
andcudatoolkit=10.0
requires a NVIDIA driver of at least version410.48
.Set a driver version for the nvidia-driver-installer daemonset to install
The default driver as of writing for the daemonset above, is
396.26
. I struggled with installing that without this daemonset, so I ended up using384.145
instead.Option 1: Use a one liner
Option 2: manually edit the daemonset manifest...
Reference: https://github.com/GoogleCloudPlatform/container-engine-accelerators/tree/master/cmd/nvidia_gpu
6. Configure some spawn options
Perhaps the user does not always need a GPU, so it is good to allow the user to choose instead. This can be done with the following configuration.
Result
Note that this displays a screenshot of the configuration I've utilized, which differs slightly from the example configuration and setup documented in this post.
7. Verify GPU functionality
After you have got a Jupyter GPU pod launched and running, you could verify your GPU works as intended by...
TensorFlow-Examples/notebooks/convolutional_network.ipynb
, and run all cells.Previous issues
Autoscaling - no longer an issue?
UPDATE: I'm not sure why this happened, but it doesn't happen any more for me.
I've had massive trouble autoscaling. I managed to autoscale from 1 to 2 nodes, but it took 37 minutes... Autoscale down worked as it should, with 10 minutes of a unused GPU node for the be scaled down.
To handle the long scale up time, you can configure a long timeout for kubespawner's spawning procedure like this:
Latest update (2018-11-15)
I got autoscaling to work, but it is slow still, it takes about 9 minutes plus the time for your image to be pulled to the new node. Some lessons learned:
The cluster autoscaler runs simulations using a hardcoded copy of kube-scheduler default configuration logic, so utilizing a custom kube-scheduler configuration with different predicates could cause issues. See https://github.com/kubernetes/autoscaler/issues/1406 for more info.
I stopped using a dynamically applied label as a label selector (
cloud.google.com/gke-accelerator=nvidia-tesla-k80
). I don't remember if this worked at all with the cluster autoscaler, and that it worked to scale from both 0->1 node and from 1->2 nodes. If you want to select a specific GPU from multiple node pools, I'd recommend adding your own pre-defined labels likegpu: k80
and using them to nodeSelector select on.I started using the default-scheduler instead of the jupyterhub-user-scheduler as I figure it would be safer to not risk there was a difference in what predicates they used even though they may have the exact same predicates configured. NOTE: a predicate is a function that takes information about a node in this case, and returns true or false if the node is a candidate to be scheduled on.
To debug the autoscaler:
kubectl describe pod -n jhub jupyter-erik-2esundell
Look for the node pool in the output, mine was named
user-k80
Inspect the status of your node-pool regarding
cloudProviderTarget
,registered
andready
.You want all to become
ready
.You can also inspect the node events with
kubectl describe node the-name-of-the-node
:Potentially related:
I'm using Kubernetes
1.11.2-gke.9
, but my GPU nodes apparently have1.11.2-gke.15
. Autoscaling from 0 nodes: https://github.com/kubernetes/autoscaler/issues/903User placeholders for GPU nodes
Currently the user placeholders can only go to one kind of node pool, and it would make sense to allow the admin to configure how many placeholders for a normal pool and how many for a GPU pool. They are needed for autoscaling ahead of arriving users to not force them to wait for a new node, and this could be extra relevant for GPU nodes as they may need to be created on the fly every time for an arriving real user without the user placeholders.
We could perhaps instantiate multiple placeholder deployment/statefulsets based on a template and some extra specifications.
Pre pulling images specifically for GPU nodes
Currently we can only specify one kind of image puller, pulling all kinds of images to a single type of node. It is pointless to pull and especially to wait for image pulling of unneeded images, so it would be nice to optimize this somehow.
This is tracked in #992 (thanks @jzf2101!)
The future - Shared GPUs
Users cannot share GPUs like they can share CPU, this is an issue. But in the future, perhaps? From what I've heard this is something that is progressing right now.