simonsobs-uk / data-centre

This tracks the issues in the baseline design of the SO:UK Data Centre at Blackett
https://souk-data-centre.readthedocs.io
BSD 3-Clause "New" or "Revised" License
2 stars 1 forks source link

Feedback on user documentation #14

Closed rwf14f closed 7 months ago

rwf14f commented 11 months ago

Just a few notes on the user documentation:

Wallclock Time:

Container support:

rwf14f commented 11 months ago

Condor now also has a container universe which can run jobs in an apptainer container, so doesn't need docker installed.

ickc commented 9 months ago

About wall clock time, when is the 2nd constraint not the same as the first constraint? How is it going to track CPU hours?

ickc commented 8 months ago

We will not document the use of container above as it is not a supported configuration from us (SO:UK Data Centre) where we only tell the users "welcome to try". The notes above probably would be useful, but as far as I understand, HTCondor's universes are mutually exclusive as explained in their documentation, so probably parallel universe won't work with container universe. Unless this can be done, we should discourage the users to use container as that would limited their workflows to single node usage.

The wallclock time needs to be documented, but I don't think we have all the information to add such page yet. I envisioned it to be something like https://docs.nersc.gov/jobs/policy/ where wall-clock time should be part of the constraints on the computer systems that the user should be aware of. C.f. #6.

Pushing back to next release.

ickc commented 7 months ago

Addressed by fe63759, 71aa29e, and continue to #35 for the remaining clarification needed.