Closed tomeichlersmith closed 2 years ago
May 16, 2022
Focus on simple solution that is long term supportable.
As hardware dies we can get rid of it.
In number of cores, the cluster is comparable to Europe Tier 2 sites. Storage capacity is the biggest issue - Hadoop is dying without new hardware.
Upgraded head-node would be very helpful.
Allows remote jobs to write to our area.
The first diagram on this page is a good top-level view. https://opensciencegrid.org/docs/compute-element/hosted-ce/
I'm looking over the Hosted CE requirements. Please review these:
Other pieces OSG Repository available to all nodes - Easy to set with Puppet EPEL Repository for Singularity - Already set in Puppet CVMFS - Already set in Puppet It looks like the worker node containers are not to configure the worker nodes but to send jobs to the queue as a container.
traydock-osg module in puppet
Reach out to Bryan Lim at UW to talk about Condor and OSG.