Closed monarin closed 1 month ago
I'm going to reformat your description here to restore the "Description" header, approve the workflow run, and add a few people as reviewers
For reviews, I'm pinging Silke because she has most of the commits on the previous restartdaq
script, but she may not have time, so I'm also pinging Vincent because he has the most commits on this repo out of all the CDSOs.
HI all, sorry for my delayed response. I really appreciate a lot of comments and feedbacks here! Sorry if I can't reply directly to all these suggestions now, I hope to be sitting down and working with both chris and silke next week so we can work through these. Hopefully the next version will be more ready.
I should have mentioned this last week, but the core intent and implementation here (the main points in the PR description, such as switching to slurm) were good and didn't need any scrutiny. Aside from some questions for understanding, my comments are largely targeting maintainability (putting all the cli parsing in one place, avoiding confusing error tracebacks, etc.).
I hope I didn't come across as too critical and I'm sorry if I did.
For the old setup, I wanted to cd into the same directory so that the temporary files such as pid.txt would always end up in the same space.
I think this is good to merge if someone tries it out and confirms it does what they need
We decided to rename the slurm-for-daq manager from daqbatch to daqmgr. This is to avoid technical details in the name. We needed to complete this name change while it's still early enough. Unfortunately, this happened to be in the middle of this PR and I had to complete the work here too to verify the test at tmo and rix.
I'm looking at the pre-commit errors. Can anyone advise how to run pre-commit test interactively?
First, please ignore any pre-commit warnings from parts you did not edit. We're in the process of cleaning some things up in #204.
Some of the pre-commit jobs will fix the formatting for you, others will just complain.
To run pre-commit interactively:
source pcds_conda # /cds/group/pcds/pyps/conda/pcds_conda if not on PATH
pre-commit install # optional, makes pre-commit run before every commit for this repo
pre-commit run --files <filename> # make pre-commit run on a specific file
(for example, please ignore shellcheck for now- it's not on you to edit the entire restartdaq script)
Description
Motivation and Context
All slurm jobs launched by daqmgr are stored in slurm database. Daqutils added here is written in python to ease the job of reading of and parsing slurm job statuses. Some of these utilities are slimmed down version of what already exist in daqmgr making it easier to future maintenance work.
Replace procmgr with slurm.
How Has This Been Tested?
The three commands (restartdaq, stopdaq, and wheredaq) have been tested at tmo and rix.
Similar to the existing requirement, **/engineering_tools/scripts must be part of $PATH for -opr user accounts.
The changes are minimal. None of the preexisting code have been modified apart from the top part of the scripts, which allow overriding daq-related commands.
Where Has This Been Documented?