mchehab / rasdaemon

Rasdaemon is a RAS (Reliability, Availability and Serviceability) logging tool. It records memory errors, using the EDAC tracing events. EDAC is a Linux kernel subsystem with handles detection of ECC errors from memory controllers for most chipsets on i386 and x86_64 architectures. EDAC drivers for other architectures like arm also exists.
GNU General Public License v2.0
188 stars 81 forks source link

ras-mc-ctl: drivers not loaded. #137

Open panorain opened 10 months ago

panorain commented 10 months ago

Hi, there is a Lenovo M57p 9088 desktop here (are these errors displayed because of no EDAC capability)? These errors are being displayed after passing the following commands:

Thinkcentre-M57p:/etc/sysconfig # ras-mc-ctl --status
ras-mc-ctl: drivers not loaded.
Thinkcentre-M57p:/etc/sysconfig # 
Thinkcentre-M57p:/etc/sysconfig # rasdaemon --status
Segmentation fault (core dumped)
paul-Thinkcentre-M57p:/etc/sysconfig # 
Thinkcentre-M57p:/etc/sysconfig # rasdaemon --enable
rasdaemon: ras:mc_event event enabled
rasdaemon: ras:aer_event event enabled
rasdaemon: mce:mce_record event enabled
rasdaemon: ras:extlog_mem_event event enabled
rasdaemon: ras:non_standard_event event enabled
rasdaemon: ras:arm_event event enabled
rasdaemon: devlink:devlink_health_report event enabled
rasdaemon: block:block_rq_error event enabled
rasdaemon: ras:memory_failure_event event enabled
rasdaemon: Can't write to set_event
rasdaemon: Can't write to set_event
rasdaemon: Can't write to set_event
rasdaemon: Can't write to set_event
rasdaemon: Can't write to set_event
rasdaemon: Can't write to set_event
rasdaemon: Can't write to set_event
rasdaemon: Can't write to set_event
Thinkcentre-M57p:/etc/sysconfig # 
Thinkcentre-M57p:/etc/sysconfig # systemctl status rasdaemon
● rasdaemon.service - RAS daemon to log the RAS events
     Loaded: loaded (/usr/lib/systemd/system/rasdaemon.service; enabled; **preset: disabled**)
     Active: active (running) since Sat 2024-01-13 13:41:43 CST; 5h 36min ago
    Process: 1195 ExecStartPost=/usr/sbin/rasdaemon --enable (code=exited, status=0/SUCCESS)
   Main PID: 1194 (rasdaemon)
      Tasks: 1 (limit: 4915)
        CPU: 142ms
     CGroup: /system.slice/rasdaemon.service
             └─1194 /usr/sbin/rasdaemon -f -r

Jan 13 18:04:41 Thinkcentre-M57p rasdaemon[1194]:            <...>-955   [002] .....     0.000534 block_rq_error 2024-01-13 15:10:26 -0600
Jan 13 18:04:41 Thinkcentre-M57p rasdaemon[1194]: rasdaemon: diskerror_event store: 0x55cedf2742e0
Jan 13 18:04:41 Thinkcentre-M57p rasdaemon[1194]: rasdaemon: register inserted at db
Jan 13 18:04:41 Thinkcentre-M57p rasdaemon[1194]:            <...>-955   [000] .....     0.000534 block_rq_error 2024-01-13 15:10:26 -0600
Jan 13 18:04:42 Thinkcentre-M57p rasdaemon[1194]: rasdaemon: diskerror_event store: 0x55cedf2742e0
Jan 13 18:04:42 Thinkcentre-M57p rasdaemon[1194]: rasdaemon: register inserted at db
Jan 13 18:04:42 Thinkcentre-M57p rasdaemon[1194]:            <...>-955   [000] .....     0.000534 block_rq_error 2024-01-13 15:10:27 -0600
Jan 13 18:04:42 Thinkcentre-M57p rasdaemon[1194]: rasdaemon: diskerror_event store: 0x55cedf2742e0
Jan 13 18:04:42 Thinkcentre-M57p rasdaemon[1194]: rasdaemon: register inserted at db
Jan 13 18:04:42 Thinkcentre-M57p rasdaemon[1194]:            <...>-955   [000] .....     0.000534 block_rq_error 2024-01-13 15:10:27 -0600
Thinkcentre-M57p:/etc/sysconfig # 

What are your thoughts on this?

-Thanks