Here is output from MCE log, for that one in ML
marekr2@queeg:~$ echo CPU 0: Machine Check Exception: 4 Bank 4: fe28a001fd080813 TSC 2eefd49369 ADDR f0050 MISC c0090e7e00000000 | /usr/sbin/mcelog --ascii --k8 mcelog: Cannot open /dev/mem for DMI decoding: Permission denied HARDWARE ERROR. This is *NOT* a software problem! Please contact your hardware vendor CPU 0 4 northbridge Northbridge RAM Chipkill ECC error Chipkill ECC syndrome = fd51 bit32 = err cpu0 bit45 = uncorrected ecc error bit57 = processor context corrupt bit59 = misc error valid bit61 = error uncorrected bit62 = error overflow (multiple errors) bus error 'local node origin, request didn't time out generic read mem transaction memory access, level generic' STATUS fe28a001fd080813 MCGSTATUS 4
And from Ward: CPU 0 4 northbridge Northbridge RAM Chipkill ECC error Chipkill ECC syndrome = fd51 bit32 = err cpu0 bit45 = uncorrected ecc error bit57 = processor context corrupt bit59 = misc error valid bit61 = error uncorrected bit62 = error overflow (multiple errors) bus error 'local node origin, request didn't time out generic read mem transaction memory access, level generic' STATUS fe28a001fd080813 MCGSTATUS 4
Seems something went wrong with ECC. Maybe because the memory is not cleared anymore?
Can someone test if http://tracker.coreboot.org/trac/coreboot/changeset/4099/trunk/coreboot-v2/s...
this change is reverted problem goes away?
Rudolf