Open NightowlKr opened 1 year ago
You hit the max default value for port bits. The reason it has a max is because bad values either from the device (or network interruptions) can cause spikes.
@librenms/reviewers Thoughts on the max value? 10G is very common these days and 40G and 100G+ are becoming more common.
@murrant According to your opinion, I set it to 800G (800 1000 1000 * 1000 / 8 = 100000000000). Because recently standards is QSFP-DD800. http://www.qsfp-dd.com/specification/
If we go with option 2, that would be fine for most people for awhile.
If we go with option 2, that would be fine for most people for awhile.
Agree, the value should probably be at least a few hundred gigs since 100G ports are becoming quite popular on newer hardware.
Sure, send a change to update the value to 100000000000
, remember that it will only affect new RRDs.
I guess the issue is which is more common: buggy SNMP implementations causing spikes, or hitting the default max? Going forward it will probably be more and more likely to hit the max, so bumping it seems like a good choice. However, a 5 Gbps spike on my VDSL uplink at home is still going to show up as a huge anomaly, so people will likely have to manually set the value lower on some ports anyway. I'd probably vote for bumping it to 100 Gbps by default so that people don't end up with bad data in their RRDs.
@paulgear I think LibreNMS could do a better job of detecting snmp queries interrupted mid-query and prevent writing 0s to the rrd. That would help to significantly reduce the chance of spikes.
Unfortunately, that involves refactoring the ports module which is a huge monster containing all kinds of black magic.
The point of this issue is that the graph data is strange.
Because it's the sum graph for the two ports below.
Moreover, the total graph was also strange.
I checked that the collection was working fine.
So, i found the point of failure.
Eventually, i fount no data in the point at rrd file.
When i looked at the structure of the rrd file, it seemed strange. Because according to the document below, it had to be supported up to 100Gbps. https://docs.librenms.org/Extensions/RRDTune/
It was the same even if the corresponding php was executed manually.
Finally, to resolve this symptom, I changed it manually by referring to the URL below. https://oss.oetiker.ch/rrdtool/doc/rrdtune.en.html#:~:text=disable%20this%20limit.-,%2D%2Dmaximum%7C%2Da%C2%A0ds%2Dname%3Amax,-alter%20the%20maximum
Output of ./validate.php
What was the last working version of LibreNMS?
22.10.0
Anything in the logs that might be useful for us?
No response