Open SteVwonder opened 5 years ago
Notes from the group meeting:
resource-query
to generate visualizations like ORNL's Jsrun visualizerWould be nice to extend resource-query to generate visualizations like ORNL's Jsrun visualizer
If there is already a visualizer that supports a reasonable input format and meets our requirement, this would be a matter of adding a new writer.
The three main use-cases for tiered IO (that we see so far) are:
/tmp
file system for the given node, job, or workflowWe want to make sure we can express all three of these use-cases in the jobspec. With existing systems, caching and scratch are mutually exclusive with each other but compatible with staging; that may not always be the case.
Idea 1
We could take a similar approach to how it is done already (Slurm @ NERSC link) by sticking all of this information under
attributes.system.tiered-io
(or similar). For example:I think this method starts to break down once you have multiple tiers of storage. We would have to embed locality information into the attributes section, which seems wrong.
Idea 2
I think a better idea would be to lift some of this information (i.e., capacity and locality) into the
resource
section: To avoid adding all of the information to the resource vertices, we can addlabel
s on the resources and reference them in theattributes
section:This second idea could be really powerful for hybrid architectures where there are SSDs in multiple storage tiers, each of which a job may want to configure differently. It gives us lots of flexibility and allows us to support hybrid architectures without requiring any modifications to the existing canonical jobpsec. It also provides a separation of concerns, the resource info that the scheduler cares about is up in the
resources
section and the configuration/staging info that the IMP/job-shell cares about is down in theattributes
section.Any thoughts?