berkeley-dsep-infra / datahub

JupyterHubs for use by Berkeley enrolled students
https://docs.datahub.berkeley.edu
BSD 3-Clause "New" or "Revised" License
62 stars 37 forks source link

Document process for data retrieval. #2686

Open felder opened 2 years ago

felder commented 2 years ago

I sent over some signed urls to the people requesting their data and typed up some notes as I went along which I shared with @balajialg

balajialg commented 2 years ago

Thanks, @felder for the detailed write-up! Appreciate it!

Couple of questions,

  1. In general, How much time does it take to share the data for a student request?
  2. From a back-of-the-envelope estimation, How hard is it to automate such requests using scripts for the future? Is this something feasible and can be added to our backlog? @yuvipanda Please feel free to chime in!
felder commented 2 years ago

@balajialg

  1. About 15 minutes. It's a pretty quick task.
  2. I think this is something that could be automated to be self serve. It would ideally be calnet authenticated which would remove us from needing to verify the requests while also ensuring the student only can request their own files. However, that might present some edge cases such as a student who no longer has hub access needing to retrieve.
balajialg commented 2 years ago

@felder Thanks for the info! We can definitely handle edge cases the way we handle them right now. I can create a GitHub issue for automating such requests and add it to the project board backlog!