Closed rabernat closed 6 years ago
To summarize how we got here.
Skipping a deployment with the script provider because this is not a tagged commit
.Help needed! cc @jacobtomlinson, @yuvipanda
On a separate but related note, it is still not entirely clear to me what a separate helm chart in another repo gets us. To make pangeo work we still need to include many configuration changes in a config.yaml (and a secrets.yaml), and so a separate helm chart just splits some of the required configurations into another repo, making changes to pangeo more complicated. I'm probably just not understanding it. What am I missing?
That helm chart is also used in other deployments, particularly the UK Met's deployment. The stuff in this repository is specific to the deployment in pangeo.pydata.org
That being said, there is clearly more sorting out that should be done to make things work cleanly for everyone.
On Sun, Apr 29, 2018 at 11:04 AM, Tim Crone notifications@github.com wrote:
On a separate but related note, it is still not entirely clear to me what a separate helm chart in another repo gets us. To make pangeo work we still need to include many configuration changes in a config.yaml (and a secrets.yaml), and so a separate helm chart just splits some of the required configurations into another repo, making changes to pangeo more complicated. I'm probably just not understanding it. What am I missing?
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/pangeo-data/helm-chart/issues/13#issuecomment-385257765, or mute the thread https://github.com/notifications/unsubscribe-auth/AASszIknEBp5ZY8VCQaFkXFZnqqwcjOyks5ttdaTgaJpZM4Tq_Da .
There are definitely many ways of doing things! I am currently experimenting with stripping almost everything out of the notebook image and using pip --user installs on the first user login (through minor mods to prepare.sh). I am also working on giving the worker pods the same home directory as the user. This is allowing me to massively reduce the size of the images which has greatly reduced the time required to launch workers and notebook server pods, especially when deploying to a new node. I have increased the singleuser storage capacity slightly to deal with this. Just another way of doing things. I agree that a discussion of the overall structure of Pangeo might be a good thing to have at some point in the future.
Sharing the home directory does sound very nice. I think that the UK met deployment does this to good effect. CC @jacobtomlinson
It would also be nice to give up controlling the environment entirely to the user
On Sun, Apr 29, 2018 at 11:31 AM, Tim Crone notifications@github.com wrote:
There are definitely many ways of doing things! I am currently experimenting with stripping almost everything out of the notebook image and using pip --user installs on the first user login (through minor mods to prepare.sh). I am also working on giving the worker pods the same home directory as the user. This is allowing me to massively reduce the size of the images which has greatly reduced the time required to launch workers and notebook server pods, especially when deploying to a new node. I have increased the singleuser storage capacity slightly to deal with this. Just another way of doing things. I agree that a discussion of the overall structure of Pangeo might be a good thing to have at some point in the future.
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/pangeo-data/helm-chart/issues/13#issuecomment-385259569, or mute the thread https://github.com/notifications/unsubscribe-auth/AASszIHUe1uB4LEEdEdVW8Zkf0UZeWfWks5ttdy-gaJpZM4Tq_Da .
I agree this would be great, and I am looking into various options in Azure, which does have options for ReadWriteMany PersistantVolume access modes. It's not clear how easy this would be for GCE, although there was some discussion a while back in pangeo-data/pangeo#88. I feel like I have pulled this thread off topic, but maybe it would be worth picking it back up over there, or even starting a new issue focused on linking worker/notebook pods more generally or giving more control to users.
I think the issue is to do with it being run from a PR. Only a branch build can decode the encrypted info for security reasons. Also it looks like there is a bug with checking our the right refs.
To answer @tjcrone the idea behind a separate repo is to allow people (like me) to use it as a basis for my own customized pangeo stack. helm has the concept of dependencies and I wanted to create a custom chart which depends on pangeo. As @mrocklin says the pangeo-data
repo is for https://pangeo.pydata.org deployment specific stuff, this repo is for generic stuff.
We are still looking into shared home directories. Our main block is using ReadWriteOnce
volumes backed by EBS. I'm planning on switching home directories to ReadWriteMany
volumes backed by NFS, which means they can be mounted by the workers too.
I totally agree that having the environment installed in the user home would save a lot of headaches, however moving away from conda
is a big step and will be a problem for some users as not everything is available from pip
.
Also just to add the deploy script has never worked. The version available now was deployed locally.
I'm just working on it now to get it deploying automatically.
I have updated the deployment script to successfully deploy and also added an index page based on the jupyterhub one to make it easier to find the latest chart.
Thank you so much for sorting this out! Sorry for making a mess of the repo. On the upside, now I understand better how chartpress works.
I know I am making a mess here. But I can't understand how it is possible that the original deployment script ever actually worked.
With my new modifications (#12), the deploy now fails.
This is the error it gives: