berkeley-dsep-infra / datahub

JupyterHubs for use by Berkeley enrolled students
https://docs.datahub.berkeley.edu
BSD 3-Clause "New" or "Revised" License
66 stars 39 forks source link

Semester End Clean Up Tasks! #4102

Open balajialg opened 1 year ago

balajialg commented 1 year ago

Summary

At the end of every semester, we need to perform the following housekeeping tasks. Collating them here so that we can prioritize these activities after the end of every semester.

Important information

Spring 23 semester ends May 12th!

Any other activity I am missing?

ryanlovett commented 1 year ago

R libraries may be tagged with comments mentioning the course and term for which they are requested. Should we remove them during maintenance windows after every term and require instructors to request them the next time they’re needed? This would help reduce the size of the image, but could lead to more CI builds at the beginning of the term if people don’t prep in advance. I’m in favor of removal but it should be discussed and perhaps raised with users (instructors).

We do need an R popularity dashboard.

balajialg commented 1 year ago

@ryanlovett I am thinking of adding a question "Whether the package requested has an end date for removal from the image" or something related to the template for package request. What do you think?

I am all in favor of building a R popularity dashboard as highlighted in this issue #2942. We should plan some dev cycles in the next few months if possible.

ryanlovett commented 1 year ago

@balajialg That is logical, but my guess is that instructors would want to specify no end date more often than not. Other infra devs may feel differently, but I think at least for the non-core courses, libraries should be opt-in every term. Smaller images means faster node start up which means faster scaling.

And yes, an R popularity dashboard is crucial. If we have that, we can feel better about removing libraries.

balajialg commented 1 year ago

@ryanlovett Sounds good. It will be a good idea to analyze the hubs we want to prune down the image (I am assuming the generic hubs). I will plan to blast out an email at the end of the semester highlighting the image pruning activity and request instructors to raise github issues for packages (Added this to the to-do list)