xtuml / munin

Apache License 2.0
1 stars 0 forks source link

Multiple containers failing in longer runs #146

Closed FreddieMatherSmartDCSIT closed 8 months ago

FreddieMatherSmartDCSIT commented 8 months ago

Protocol Verifier containers are frequently failing and restarting. In longer runs with multiple AEReception containers and multiple AEOSVDC containers this seems to lead to AEReception failing completely and then large amounts of file processing errors in AEOrdering.

We have plotted metrics as the test progresses (grafana graphs show BST):

CPU usage image

Memory usage

image

AEOrdering file processing errors

image

Containers running

image

Steps to reproduce:

cortlandstarrett commented 8 months ago

fixed in main as of 30 oct 2023