coreos / bugs

Issue tracker for CoreOS Container Linux
https://coreos.com/os/eol/
146 stars 30 forks source link

Frequent kernel panics on Dell PowerEdge R730 #2325

Open hpio opened 6 years ago

hpio commented 6 years ago

Issue Report

Bug

Container Linux Version

cat /etc/os-release
NAME="Container Linux by CoreOS"
ID=coreos
VERSION=1576.5.0
VERSION_ID=1576.5.0
BUILD_ID=2018-01-05-1121
PRETTY_NAME="Container Linux by CoreOS 1576.5.0 (Ladybug)"
ANSI_COLOR="38;5;75"
HOME_URL="https://coreos.com/"
BUG_REPORT_URL="https://issues.coreos.com"
COREOS_BOARD="amd64-usr"
...
BUG_REPORT_URL="https://issues.coreos.com"

Environment

What hardware/cloud provider/hypervisor is being used to run Container Linux?

Dell PowerEdge R730 server

Expected Behavior

OS runs without issues

Actual Behavior

I've noticed 2 different behaviors, sometimes a host reboots and sometimes it just sits there

Other Information

dmesg-erst-6514387398485344261.txt

megastallman commented 6 years ago

@hpio , please look here: https://github.com/coreos/bugs/issues/1862 While I'm waiting for microcode delivery to the production branch, I've turned off hyperthreading on my servers. Looks like a workaround for the moment.

fabiorauber commented 6 years ago

I'm seeing this issue in CoreOS 1632.3.0 Stable, running on oVirt 4.2.1 (kvm), which runs in IBM x240 nodes (Intel Sandybridge IBRS family). They are serving 1000+ containers, orchestrated by Rancher