ULHPC / puppet-slurm

A Puppet module designed to configure and manage SLURM(see https://slurm.schedmd.com/), an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters
Apache License 2.0
19 stars 25 forks source link

Allow for TRESBillingWeights #16

Closed Falkor closed 5 years ago

Falkor commented 5 years ago

See Trackable RESources (TRES).

Associated directives:

AccountingStorageTRES
PriorityWeightTRES
TRESBillingWeights
Falkor commented 5 years ago

Ex of hiera config:

# Which Trackable RESources (TRES) will be tracked in addition to
# billing, cpu, energy, memory and node
slurm::accountingstoragetres: gres/gpu,gres/gpu:volta
slurm::priorityweighttres:    CPU=1000,Mem=2000,GRES/gpu=300

slurm::partitions:
  'DEFAULT':
    TRESbillingweights: 'CPU=1.0,Mem=0.25G,GRES/gpu=15.0'
    content: State=UP AllowGroups=clusterusers AllowAccounts=ALL DisableRootJobs=YES OverSubscribe=NO
  'admin':
    priority: 100
    nodes: 'node-[1-2]'
    content: Hidden=YES DefaultTime=0-10:00:00 MaxTime=5-00:00:00 MaxNodes=UNLIMITED  AllowQos=qos-admin
  'interactive':
    priority: 20
    nodes: 'node-1'
    content: DefaultTime=0-1:00:00  MaxTime=0-4:00:00  AllowQos=qos-besteffort,qos-interactive DefMemPerCPU=400 MaxMemPerCPU=400
  'batch':
    priority: 20
    nodes: 'node-[2-2]'
    content: DefaultTime=0-2:00:00  MaxTime=5-00:00:00 MaxNodes=UNLIMITED Default=YES AllowQos=qos-besteffort,qos-batch DefMemPerCPU=400 MaxMemPerCPU=400
Falkor commented 5 years ago

Lint compliance, see 53aca5d4f9d97ea67a08d1a762087747f17dae9e