Closed larrryfisherthefisherman closed 5 years ago
The first error is a timeout error. This means that the RCON connection ran into a timeout. For the second one I have to dig deeper into the code of the RCON library. In both cases something with your connection to the server is wrong. Are you sure that the servers are really running RCON and that your password is correct? What kind of servers (CSGO, TF2, L4D2, ...) are you monitoring?
Yes I am sure they are running RCON. They respond and then other times they throw out these errors.
They are all CSGO servers.
Okay, can you see a correlation between the times when this error occurs and what is going on on the servers? Maybe the server kills the connection or does not answer when it is changing the map?
Actually scratch all that it does seem to be related to map changes is there solution we can find so that this doesn't effect the graphs in Grafana?
You could tell your grafana to connect the missing points? If the connection fails it only returns the srcds_up 0 and no other metrics. This means that connecting the other metrics would smooth out your graphs.
While the script is running it will throw this out every so often. I am running the scraper every 5s like your example. I also have 14 servers set to scrape in this way in the prometheus.yml:
as well as: