Consider (and attempt) blessing snapshots runs to "release" status

kltm commented 5 months ago

Look at blessing snapshots to release, to:

help with things like https://github.com/geneontology/pipeline/issues/349
prevent conflicting run timings when snapshot or release not going well

No new libraries or technologies. The only "interesting" additions would likely be:

[ ] an extension of the current Zenodo scripts to assemble an upload package from a given snapshot and push it
[ ] snapshots to get an associated date and location
[ ] figure out what to do about testing

kltm commented 4 months ago

Noting that we have a week-long holding pen for snapshots already built in for debugging, during the "Publish" step. If I switch over to having these autoclean by bucket policy, these would give us a clean jump-off point to perform the manual publication that we're already doing because of the Zenodo instability. This holding pen could be arbitrarily extended up from a week to however long we want.

While this very much falls short of a full after-the-fact "blessing" system, it is actually very in line with current practices and I believe that with the change of a few lines of the current manual release SOP, we could bring up a successful snapshot.

@pgaudet What are the minimum indicators you need before knowing if a snapshot is worthwhile? Would you be able to look at the stats and, if it looks okay, let me know and I could put it out on the experimental AmiGO so you could take a closer look? How would letting you know work? Could I just sign you up for all success snapshot run emails and you get back to me when the timing feels right? If this kind of thing might work for you, I think I have a fairly quick way forward:

[x] add better hygene to the snapshot holding pen (i.e. go-data-products-daily), so that only intended files are kept
[ ] create release SOP to move held daily to zenodo, publish, and deploy
[ ] pause/remove release code
[ ] add bits to warn you of snapshots passing
[ ] remove release amigo-exp deployment, create manual SOP that aims at specified daily bucket

kltm commented 4 months ago

7-day existence rule added; we should see results very soon.

kltm commented 4 months ago

The dailies now auto-clean. Moving forward, we can use these as a clean base, within a week, to create a release.

pgaudet commented 3 months ago

@kltm

What are the minimum indicators you need before knowing if a snapshot is worthwhile? Would you be able to look at the stats and, if it looks okay, let me know and I could put it out on the experimental AmiGO so you could take a closer look? How would letting you know work? Could I just sign you up for all success snapshot run emails and you get back to me when the timing feels right?

The same procedure as we have now for the release seems appropriate:

I get a notification that a release/snapshot is ready to be checked. Note that having the data on some experimental AmiGO is required for the checks to be carried out.
I look at the stats, and if all is OK, I notify you. Right now this communication is by email; we can change that if needed.

Does that answer all the questions?

Thanks, Pascale