home-assistant / operating-system

:beginner: Home Assistant Operating System
Apache License 2.0
4.78k stars 957 forks source link

Host crashes and goes offline at random times #1336

Closed rousveiga closed 2 years ago

rousveiga commented 3 years ago

Hardware Environment

Home Assistant OS release:

outthereandaway commented 2 years ago

Unfortunately, HA has crashed again about 24h after upgrading the OS to 6.4. My entire network was not available. After unplugging the Raspberry Pi it went up immediately again. After rebooting HA, I could see that it crashed around 2:50 am (no more readings of CPU temp after that) and the first temp reading was at around 75 degC.

@rousveiga Can you please reopen the issue? Thanks

rousveiga commented 2 years ago

Sure!

outthereandaway commented 2 years ago

@rousveiga Quick question: Did you change anything in your physical setup (Raspy, SD, ...)? Do you use RaspBee 2 by any chance?

rousveiga commented 2 years ago

@rousveiga Quick question: Did you change anything in your physical setup (Raspy, SD, ...)?

This is hard to answer, because I have a spare identical Pi with an identical SD. I use the spare one for tests, and might have swapped them at some point. However, when this first started happening, I made sure to verify it happened on both Pi's and both SDs, so I don't think that's the reason it works now.

Another thing I did was switch from SD-only, to data partition in external HDD for a few months. Crashing behavior was exactly the same in 6.X, so I kept my OS version in 5.3, and that worked fine.

By this point my ISP changed some configuration, replaced my router, etc. so my entire installation stopped working. I didn't get to restore it until this past month, and that's when I swapped back to SD-only, upgraded to 6.4, and found that the RasPi didn't crash anymore.

One test I can do, though, is to downgrade to one of the offending versions and verify that it still fails with my current setup.

Do you use RaspBee 2 by any chance?

No, I don't.

agners commented 2 years ago

@outthereandaway there are various reasons for crashes. To avoid that we mix up different root causes and system configurations I'd rather prefer if you collect the logs/information for your system and open a new bug.

@rousveiga thanks for the updates! Would be interesting if you still can reproduce the issue when downgrading OS (e.g. to 6.2 or older versions). If so, then something in OS indeed fixed the problem. If not, then any other update/change in your system or Home Assistant fixed the problem.

rousveiga commented 2 years ago

@agners I will try the downgrade!

rizer1980 commented 2 years ago

Hello.

System Health

version core-2021.10.6
installation_type Home Assistant OS
dev false
hassio true
docker true
user root
virtualenv false
python_version 3.9.7
os_name Linux
os_version 5.10.17-v8
arch aarch64
timezone Europe/Moscow
Home Assistant Cloud logged_in | false -- | -- can_reach_cert_server | ok can_reach_cloud_auth | ok can_reach_cloud | ok
Home Assistant Supervisor host_os | Home Assistant OS 6.5 -- | -- update_channel | stable supervisor_version | supervisor-2021.10.6 docker_version | 20.10.7 disk_total | 116.7 GB disk_used | 6.7 GB healthy | true supported | true board | rpi4-64 supervisor_api | ok version_api | ok installed_addons | File editor (5.3.3), Mosquitto broker (6.0.1), Samba share (9.5.1), Terminal & SSH (9.0.2), Zigbee2mqtt (1.21.2-1)
Lovelace dashboards | 1 -- | -- resources | 0 views | 2 mode | storage

The same problem as many here, pi 4 hangs every 2-5 days. I connected a monitor, that's what was in the logs when it hung up, I hope it helps. ps. pi 4 with ssd only.

WhatsApp Image 2021-10-31 at 11 33 11 WhatsApp Image 2021-10-31 at 11 33 10

JaimeOlaneta commented 2 years ago

Hey guys! I've been having the same crashes in 4 to 6 days, I lost all control of all devices, which were offline (sonoff blinking) and sometimes I couldn't access the local ip using ssh, even with the HA dripping on the network.

Olá pessoal! estive tendo os mesmos travamentos em 4 à 6 dias, perdia todo o controle de todos os dispositivos, que ficavam off-line (sonoff piscando) e por vezes não conseguia acesso pelo ip local usando o ssh, mesmo com o HA pingando na rede.

I tried with this to get the version back to 2021.10.6 but I lost the history record after 24hrs, so I decided to risk uploading the version.

Tentei com isso voltar a versão para 2021.10.6 mas perdi o registro do histórico após 24hrs, com isso decidi arriscar subir a versão.

JaimeOlaneta commented 2 years ago

System Health

version core-2021.12.0
installation_type Home Assistant OS
dev false
hassio true
docker true
user root
virtualenv false
python_version 3.9.7
os_name Linux
os_version 5.10.63-v8
arch aarch64
timezone America/Sao_Paulo
Home Assistant Cloud logged_in | true -- | -- subscription_expiration | 9 de janeiro de 2022 21:00 relayer_connected | true remote_enabled | true remote_connected | true alexa_enabled | false google_enabled | false remote_server | us-east-1-1.ui.nabu.casa can_reach_cert_server | ok can_reach_cloud_auth | ok can_reach_cloud | ok
Home Assistant Supervisor host_os | Home Assistant OS 7.0 -- | -- update_channel | stable supervisor_version | supervisor-2021.12.1 docker_version | 20.10.9 disk_total | 109.3 GB disk_used | 44.5 GB healthy | true supported | true board | rpi4-64 supervisor_api | ok version_api | ok installed_addons | Home Assistant Google Drive Backup (0.105.2), Samba share (9.5.1), Mosquitto broker (6.0.1), File editor (5.3.3), Terminal & SSH (9.2.1), Grafana (7.3.2), InfluxDB (4.3.0), MariaDB (2.4.0), TasmoAdmin (0.16.0)
Lovelace dashboards | 1 -- | -- resources | 0 views | 2 mode | storage
JaimeOlaneta commented 2 years ago

Hello, has anyone updated to OS 7.0? I'm using it with core 2021.12.0, so far it hasn't crashed or had the result locked in the history, I'm 9 days without problems. Let's wait!

Olá, alguém atualizou para o OS 7.0? Estou usando com core 2021.12.0, até o momento não travou nem teve resultado travado no histórico, estou 9 dias sem problemas. Vamos aguardar!

agners commented 2 years ago

Forgot about this thread: FYI, #1119 got solved meanwhile, see https://github.com/home-assistant/operating-system/issues/1119#issuecomment-1004120863. Since symptoms sound very similar, testing using 7.1 is definitely worth a try.

github-actions[bot] commented 2 years ago

There hasn't been any activity on this issue recently. To keep our backlog manageable we have to clean old issues, as many of them have already been resolved with the latest updates. Please make sure to update to the latest Home Assistant OS version and check if that solves the issue. Let us know if that works for you by adding a comment 👍 This issue has now been marked as stale and will be closed if no further activity occurs. Thank you for your contributions.