Bootstrap

EDAC sbridge MC0: HANDLING MCE MEMORY ERROR导致宕机

crash日志

[112624028.710701] EDAC MC0: 1 CE memory read error on CPU_SrcID#0_Ha#0_Chan#0_DIMM#1 (channel:0 slot:1 page:0xafbb3f offset:0x800 grain:32 syndr
ome:0x0 -  area:DRAM err_code:0001:0090 socket:0 ha:0 channel_mask:1 rank:5)
[112624028.710721] EDAC MC0: 1 CE memory read error on CPU_SrcID#0_Ha#0_Chan#0_DIMM#1 (channel:0 slot:1 page:0x9f4d2f offset:0xc80 grain:32 syndr
ome:0x0 -  area:DRAM err_code:0001:0090 socket:0 ha:0 channel_mask:1 rank:5)
[112624029.231467] EDAC sbridge MC0: HANDLING MCE MEMORY ERROR
[112624029.231479] EDAC sbridge MC0: CPU 0: Machine Check Event: 0 Bank 5: 8c00004000010090
[112624029.231481] EDAC sbridge MC0: TSC 0 
[112624029.231484] EDAC sbridge MC0: ADDR b072ecf80 
[112624029.231485] EDAC sbridge MC0: MISC 42188886 
[112624029.231487] EDAC sbridge MC0: PROCESSOR 0:206d7 TIME 1687586205 SOCKET 0 APIC 0
[112624029.231491] EDAC sbridge MC0: HANDLING MCE MEMORY ERROR
[112624029.231493] EDAC sbridge MC0: CPU 0: Machine Check Event: 0 Bank 8: 8800004600800090
[112624029.231495] EDAC sbridge MC0: TSC 0 
[112624029.231496] EDAC sbridge MC0: ADDR 0 
[112624029.231498] EDAC sbridge MC0: MISC 5229743c343c0c8c 
[112624029.231500] EDAC sbridge MC0: PROCESSOR 0:206d7 TIME 1687586205 SOCKET 0 APIC 0
[112624029.289617] EDAC sbridge MC0: HANDLING MCE MEMORY ERROR
[112624029.289633] EDAC sbridge MC0: CPU 0: Machine Check Event: 0 Bank 5: cc00008000010090
[112624029.289635] EDAC sbridge MC0: TSC 0 
[112624029.289637] EDAC sbridge MC0: ADDR 7788bb900 
[112624029.289639] EDAC sbridge MC0: MISC 20169686 

查看监控发现宕机重启后内存少了16G,判断为内存损坏或内容插槽损坏

;