Closed heliosfa closed 2 years ago
Not directly related to this issue, but an interesting observation to do with Ayres. The spurious traffic is not limited to receiving packets with bad application IDs or packets for external devices, but we are also apparently getting random BARRIER packets while running and also spurious Debuglink traffic. Maybe it needs a reboot?
gmb@ayres:~/Orchestrator$ ./orchestrate.sh -b calc.poets
POETS> 05:13:49.01: 20(I) The microlog for the command 'load /engine = "../Config/POETSHardwareOneBox.ocfg"' will be written to '../Output/Microlog/Microlog_2021_11_02T05_13_49p0.plog'.
POETS> 05:13:49.01: 140(I) Topology loaded from file ||../Config/POETSHardwareOneBox.ocfg||.
POETS> 05:13:49.01: 20(I) The microlog for the command 'call /file = "/home/gmb/Orchestrator/calc.poets"' will be written to '../Output/Microlog/Microlog_2021_11_02T05_13_49p1.plog'.
POETS> 05:13:49.01: 20(I) The microlog for the command 'exit /at = stop' will be written to '../Output/Microlog/Microlog_2021_11_02T05_13_49p2.plog'.
POETS> 05:13:49.01: 69(I) Exit trigger staged - will exit when the next application is completely stopped.
POETS> 05:13:54.88: 20(I) The microlog for the command 'load /app = +calc.xml' will be written to '../Output/Microlog/Microlog_2021_11_02T05_13_49p3.plog'.
POETS> 05:13:54.88: 235(I) Application file ../calc.xml loading...
POETS> 05:13:54.88: 65(I) Application file ../calc.xml loaded in 4881 ms.
POETS> 05:13:54.05: 20(I) The microlog for the command 'tlink /app = *' will be written to '../Output/Microlog/Microlog_2021_11_02T05_13_54p0.plog'.
POETS> 05:13:54.05: 234(I) Typelinking graph instance 'calc'...
POETS> 05:13:54.05: 249(I) Successfully typelinked graph instance 'calc'.
POETS> 05:13:54.26: 20(I) The microlog for the command 'place /spread = *' will be written to '../Output/Microlog/Microlog_2021_11_02T05_13_54p1.plog'.
POETS> 05:13:54.26: 309(I) Attempting to place graph instance 'calc' using the 'spre' method...
POETS> 05:13:54.26: 302(I) Graph instance 'calc' placed successfully.
POETS> 05:18:04.34: 20(I) The microlog for the command 'compose /app = *' will be written to '../Output/Microlog/Microlog_2021_11_02T05_13_54p2.plog'.
POETS> 05:18:04.34: 803(I) Composing graph instance 'calc'...
POETS> 05:18:04.34: 804(I) Graph instance 'calc' composed successfully.
POETS> 05:18:04.38: 20(I) The microlog for the command 'deploy /app = *' will be written to '../Output/Microlog/Microlog_2021_11_02T05_18_04p0.plog'.
POETS> 05:18:04.38: 184(I) Deployment of graph instance 'calc' staged. Waiting for Mothership(s) to acknowledge receipt in the background.
POETS> 05:18:04.38: 20(I) The microlog for the command 'initialise /app = *' will be written to '../Output/Microlog/Microlog_2021_11_02T05_18_04p1.plog'.
POETS> 05:18:04.38: 187(I) Initialisation of graph instance 'calc' staged. Waiting for Mothership(s) to acknowledge receipt in the background.
POETS> 05:18:04.38: 20(I) The microlog for the command 'run /app = *' will be written to '../Output/Microlog/Microlog_2021_11_02T05_18_04p2.plog'.
POETS> 05:18:04.38: 188(I) Run of graph instance 'calc' staged. Waiting for Mothership(s) to acknowledge receipt in the background.
POETS> 05:18:05.63: 529(I) Mothership (rank 2): Deployment of application 'Arith::calc' (to this Mothership) complete.
POETS> 05:18:05.63: 530(I) Mothership (rank 2): Initialising fully-defined application 'Arith::calc'.
POETS> 05:18:05.63: 186(I) Application 'Arith::calc' successfully deployed on all Motherships it is mapped to.
POETS> 05:18:17.17: 531(I) Mothership (rank 2): Initialisation of application 'Arith::calc' (to this Mothership) complete.
POETS> 05:18:17.17: 186(I) Application 'Arith::calc' ready to start on all Motherships it is mapped to.
POETS> 05:18:17.17: 532(I) Mothership (rank 2): Starting (running) fully-initialised application 'Arith::calc'.
POETS> 05:18:17.17: 186(I) Application 'Arith::calc' running on all Motherships it is mapped to.
POETS> 05:21:25.25: 513(W) Mothership: Received an unexpected 'BARRIER' packet for an application in state 'RUNNING'. Ignoring.
POETS> 05:21:27.85: 513(W) Mothership: Received an unexpected 'BARRIER' packet for an application in state 'RUNNING'. Ignoring.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '1'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f1'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '2e'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fb'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '1'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fb'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '44'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '44'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'df'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'df'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'df'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'eb'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'd4'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x20ed' with content 'c4'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x20ed' with content '30'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ed'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '17'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '17'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '17'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '17'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f7'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '81'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ed'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '17'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '17'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '17'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '17'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f7'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f7'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f7'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f3'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'df'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'df'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'de'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'de'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'de'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'de'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fe'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'df'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'bf'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'bf'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'da'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'da'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f4'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f4'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f4'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f4'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '7f'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '7f'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'df'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '7f'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'df'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f6'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '6d'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '1f'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f7'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f7'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '97'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f5'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'f6'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'fd'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '57'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '57'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'bf'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'bf'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'ff'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '34'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content '57'.
POETS> 05:21:41.41: 521(I) Debug byte received from thread '0x23ae' with content 'bf'.
POETS>
A note for later - maybe there's some hysteresis here. Maybe the tinsels are still working after the Orchestrator finishes?
A note for later - maybe there's some hysteresis here. Maybe the tinsels are still working after the Orchestrator finishes?
That is what I was thinking at first, but the range of spurious output does not make sense for that.
Just to update with the details from today's E-Mails with @mn416, Ayres has a fan failure on one of the boards leading to overheating and is out of commission for the time being.
That said, we should still not be seg faulting in response.
Preamble This occurs on the
BUGFIX-0285-HardwareIdleBarrier
branch withFEATURE-0242-HardwareIdle-Mothership
merged in locally. It may occur on other branches.I have only seen the packet issue that causes this on Ayres - there may be an underlying issue there, but it should not lead to us seg faulting..
Issue After a number of runs, the Orchestrator will occasionally report that a packet with an "Invalid application ID" has been received and seg faults. Sometimes it is a plain signal 11, other times it is an
std::bad_alloc
signal 6.Example application that inconsistently triggers the issue calc.xml, call file calc.poets and a bypass call file calcby.poets (remove .txt from all three. The bypass call file will only work after the normal one is used for a successful run)
Example output with the seg fault:
Example output of the
std::bad_alloc
: