re: [Hardware Error]: Machine check events logged

Top Page

Reply to this message
Author: Hervé DE DIANOUS
Date:  
To: guilde
Subject: re: [Hardware Error]: Machine check events logged
 Ecrit par le webmail orange :(

>
> Au démérage normal de Xubuntu-14-04, le boot s'interompt  sur l'apparition du fond d'écran après le login puis plus rien.
> dmesg indique :
> [Hardware Error]: Machine check events logged
> sensors n'indique rien d'anormal, température et tensions normales.
>
> Je reboote avec systemrescueCD qui démarre normalement :) mais
> mcelog --client reste muet !
> Je vois bien que SystemrescueCD utilise Zsh
> Comment bien utiliser mcelog dans cet environement ?
> Il n'y a pas de /var/log/mcelog  !

OK !

/var/log/mcelog existe bien :

----------------------------------------

Hardware event. This is not a software error.
MCE 0
CPU 3 THERMAL EVENT TSC 16bd41e790
TIME 1405779002 Sat Jul 19 16:10:02 2014
Processor 3 heated above trip temperature. Throttling enabled.
Please check your system cooling. Performance will be impacted
STATUS 8800002f MCGSTATUS 0
MCGCAP 806 APICID 3 SOCKETID 0
CPUID Vendor Intel Family 6 Model 15
Hardware event. This is not a software error.
MCE 1
CPU 0 THERMAL EVENT TSC 16d8c7fcee
TIME 1405779002 Sat Jul 19 16:10:02 2014
Processor 0 heated above trip temperature. Throttling enabled.
Please check your system cooling. Performance will be impacted
STATUS 8802000f MCGSTATUS 0
MCGCAP 806 APICID 0 SOCKETID 0
CPUID Vendor Intel Family 6 Model 15
Hardware event. This is not a software error.
MCE 2
CPU 0 THERMAL EVENT TSC 16d8d9c307
TIME 1405779002 Sat Jul 19 16:10:02 2014
Processor 0 below trip temperature. Throttling disabled
STATUS 8802000a MCGSTATUS 0
MCGCAP 806 APICID 0 SOCKETID 0
CPUID Vendor Intel Family 6 Model 15
Hardware event. This is not a software error.
MCE 0
CPU 3 THERMAL EVENT TSC 12a462f9134
TIME 1405779494 Sat Jul 19 16:18:14 2014
Processor 3 heated above trip temperature. Throttling enabled.
Please check your system cooling. Performance will be impacted
STATUS 8802002f MCGSTATUS 0
MCGCAP 806 APICID 3 SOCKETID 0
CPUID Vendor Intel Family 6 Model 15
Hardware event. This is not a software error.
MCE 1
CPU 3 THERMAL EVENT TSC 12a46417187
TIME 1405779494 Sat Jul 19 16:18:14 2014
Processor 3 below trip temperature. Throttling disabled
STATUS 8802002a MCGSTATUS 0
MCGCAP 806 APICID 3 SOCKETID 0
CPUID Vendor Intel Family 6 Model 15
(END)

--------------------------------

 sensors :

-------------------------------

radeon-pci-0100
Adapter: PCI adapter
temp1:        +58.0°C 

atk0110-acpi-0
Adapter: ACPI interface
Vcore Voltage:       +1.11 V  (min =  +0.85 V, max =  +1.60 V)
 +3.3 Voltage:       +3.17 V  (min =  +2.97 V, max =  +3.63 V)
 +5 Voltage:         +4.90 V  (min =  +4.50 V, max =  +5.50 V)
 +12 Voltage:       +11.93 V  (min = +10.20 V, max = +13.80 V)
CPU FAN Speed:      2766 RPM  (min =  600 RPM, max = 7200 RPM)
CHASSIS1 FAN Speed:    0 RPM  (min =  600 RPM, max = 7200 RPM)
CHASSIS2 FAN Speed:    0 RPM  (min =  600 RPM, max = 7200 RPM)
POWER FAN Speed:       0 RPM  (min =  600 RPM, max = 7200 RPM)
CPU Temperature:     +58.0°C  (high = +60.0°C, crit = +95.0°C)
MB Temperature:      +43.0°C  (high = +45.0°C, crit = +95.0°C)'y a pas

coretemp-isa-0000
Adapter: ISA adapter
Core 0:       +70.0°C  (high = +82.0°C, crit = +100.0°C)
Core 1:       +69.0°C  (high = +82.0°C, crit = +100.0°C)
Core 2:       +71.0°C  (high = +82.0°C, crit = +100.0°C)
Core 3:       +71.0°C  (high = +82.0°C, crit = +100.0°C)  ALARM (CRIT)

-------------------------------

Comme vous voyez, il n'y a pas de dépassement de température critique contrairement à ce qu'affirme mcelog.

Le système de fichier est OK

Je pige pas et google m'aide guère :(

kern.log m'indique que le CPU3 (coeur n°4) est "above threshold alors que sensors : va de 75° à 85° (

RV2D