Closed stefaniereuter closed 12 months ago
Could you try printing the node list in a per-performance hook in your test?
@run_before('performance')
def print_job_nodelist(self):
print(self.job.nodelist)
I'm not sure that the problem is in the way we poll Slurm as we get the Slurm's reported nodespec every time we poll.
I think the problem comes from logging that triggers the evaluation of the nodelist here too early, before the final node list populated:
Hi I found this closed issue #2567 but I think I'm running into the same problem. Problem: if I run a run only regression test on multiple nodes I currently only get the first startup node logged in my reframe perflog. I read in the issue mentioned above that the issue was, that 'sacct' wasn't called multiple times. But by running in debug mode '-v -v -v' reframe seems to call 'sacct' multiple times during the run and if I call the same 'sacct' command in a second terminal I get the correct nodelist.
I have the following setup: scheduler: slurm, launcher:mpirun ReFrame-HPC 4.4.0