jupyterhub / team-compass

A repository for team interaction, syncing, and handling meeting notes across the JupyterHub ecosystem.
http://jupyterhub-team-compass.readthedocs.io
62 stars 33 forks source link

Supporting Development of JupyterHub for Scalability #338

Open whatnick opened 4 years ago

whatnick commented 4 years ago

My organization ( Geoscience Australia ) would be interested in funding jupyterhub / kubespawner activity to enhance

What would be the avenue to achieve this. We run a couple of 100's of users Jupyterhub deployments on EKS.

Our userpod image is here and deployed on.

DigitalEarth Australia DigitalEarth Africa

welcome[bot] commented 4 years ago

Thank you for opening your first issue in this project! Engagement like this is essential for open source projects! :hugs:
If you haven't done so already, check out Jupyter's Code of Conduct. Also, please try to follow the issue template as it helps other other community members to contribute more effectively. welcome You can meet the other Jovyans by joining our Discourse forum. There is also an intro thread there where you can stop by and say Hi! :wave:
Welcome to the Jupyter community! :tada:

betatim commented 4 years ago

Hey there 👋 ! This sounds great!

For HA related stuff I'd talk with @minrk directly. There are a few issues and forum posts(?) on the jupyterhub repository about ideas and things to keep in mind. I think the short summary is "this is a lot trickier than most people think with a lot of trade-offs to be decided".

For the second bullet point: could you explain a bit more what you have in mind?

minrk commented 4 years ago

This conversation is moving over from gitter because @whatnick brought up the possibility of being able to fund this work so I figured we should have people here who can talk about receiving funding. For instance, Simula, where I work, can effectively contract-out my time via Simula Consulting, or pay me (as I currently am paid) via grants at Simula Lab. The technical discussion of HA is open at https://github.com/jupyterhub/jupyterhub/issues/1932. It's a big project, but one I've been thinking about a lot and would love to be able to focus on.

choldgraf commented 4 years ago

This might also be something we can help with via 2i2c if that is an attractive option for the community. I know that @yuvipanda has been interested in HA for a while as well.

whatnick commented 4 years ago

@betatim re the second point we have difficult time performing userpod environment management and have an idea to use overlayfs capabilites / bidirectional sync in Kubernetes to allocate a section of the user PVC to store user-specific site packages, also include a safe-mode profile while the overlay is disabled if the user manages to pollute their environment. This way we can get persistence of user installed packages and keep userpod docker images lightweight. However the current Kubespawner code does not support special modes for volume mounts to allow this to work.

consideRatio commented 3 years ago

Oh that sounds interesting @whatnick!

Is it correct that there is some part of the k8s specification within volumes or volumeMounts of a pod specification that would need to be supported by KubeSpawner, that currently isn't supported?

whatnick commented 3 years ago

We are looking to implement this pattern : https://itnext.io/using-overlay-mounts-with-kubernetes-960375c05959 in jupyterhub userpods.