LLNL / ATS

ATS - Automated Testing System - is an open-source, Python-based tool for automating the running of tests of an application across a broad range of high performance computers.
BSD 3-Clause "New" or "Revised" License
7 stars 5 forks source link

same node flux usaage (--requires=-rank:1) not working #156

Open dawson6 opened 9 months ago

dawson6 commented 9 months ago

This was working,

But in pre-release testing for 7.0.116 it is not now.

That is I allocate 3 nodes ,and use the testing in the HelloSameNode test directory in the ATS repository, and the writes and reads which have the same --require=:rank:0 are not being run on the same node.

Need to follow up on this with the flux team.

Will not advertise this as ready in the 7.0.116 release of ATS.

dawson6 commented 9 months ago

Another data point, it does work if 'atsflux' is run on the login node. That is atsflux allocates the nodes and then works with that allocation to submit further flux commands.

But if one pre-allocates a node and runs 'ats' or 'atsflux' in that allocation then it fails.

This is a use case that we need to work.