travisghansen / hass-opnsense

OPNsense integration with Home Assistant
214 stars 29 forks source link

Intermittent timeout to specific opnsense host #134

Closed LorenKeagle closed 1 month ago

LorenKeagle commented 7 months ago

I have a primary and backup opnsense router on my network, and I installed this plugin to monitor traffic and status.

I'm running OPNsense 24.1.3_1-amd64. HA is running in a dedicated VM on my NAS.

Both integrations appear to work, but my primary device has gaps in data. The HA logs indicate several timeout errors throughout the day, but no further information.

2024-03-16 01:31:46.312 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
2024-03-16 02:22:06.313 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
2024-03-16 04:05:33.622 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
2024-03-16 04:45:57.624 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
2024-03-16 05:58:57.622 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
2024-03-16 08:04:27.623 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
2024-03-16 08:24:57.624 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
2024-03-16 09:27:27.623 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
2024-03-16 09:47:57.623 ERROR (MainThread) [custom_components.opnsense] Timeout fetching Merlin.piccnet.org OPNsense state data
...

The status dashboards look like this:

Screenshot 2024-03-18 at 10 16 20 AM

All the permissions appear to be set correctly. Both opnsense devices have the exact same configuration, but the secondary does not have these timeout issues and missing data.

Any suggestions on where I can investigate next?

LorenKeagle commented 7 months ago

I should add that all of the sensor data from the primary host has similar gaps. It is not limited to just the WAN status. There also doesn't appear to be any regular interval for either the timeout errors or the duration of the gaps, although on average I'd say it occurs approximately every 40 minutes or so.

twodrops commented 6 months ago

Is this somehow the same as https://github.com/travisghansen/hass-opnsense/issues/118?

LorenKeagle commented 6 months ago

I didn't post this in that issue because I don't have any related logs in opnsense, and it's an intermittent issue. #118 indicated that the issue was persistent.

Perhaps it's related, but I don't have any data in my systems to support that correlation.

LorenKeagle commented 6 months ago

I removed the opnsense plugins and HA integration for both routers, reinstalled everything from scratch, and I no longer see this issue. I wish there were more information to provide, sorry!