EasyManua.ls Logo

Intel S2600GZ User Manual

Intel S2600GZ
141 pages
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Page #112 background imageLoading...
Page #112 background image
Miscellaneous Events
System Event Log Troubleshooting Guide for EPSD
Platforms Based on Intel
®
Xeon
®
Processor E5 4600/2600/2400/1600/1400 Product Families
102 Intel order number G90620-002 Revision 1.1
Table 78: IPMI Watchdog Sensor Event Trigger Offset Next Steps
Event Trigger Offset
Description
Next Steps
Hex
Description
00h
Timer expired,
status only
Our server systems support a BMC watchdog timer,
which can check to see whether the OS is still
responsive. The timer is disabled by default, and has to
be enabled manually. It then requires an IPMI-aware
utility in the operating system that will reset the timer
before it expires. If the timer does expire, the BMC can
take action if it is configured to do so (reset, power
down, power cycle, or generate a critical interrupt).
If this event is being logged, it is because the BMC has been
configured to check the watchdog timer.
1. Make sure you have support for this in your OS (typically
using a third-party IPMI-aware utility such as ipmitool or
ipmiutil along with the OpenIPMI driver).
2. If this is the case, it is likely your OS has hung, and you need
to investigate OS event logs to determine what may have
caused this.
01h
Hard reset
02h
Power down
03h
Power cycle
08h
Timer interrupt
11.2 SMI Timeout
SMI stands for system management interrupt and is an interrupt that gets generated so the processor can service server
management events (typically memory or PCI errors, or other forms of critical interrupts), in order to log them to the SEL. If this
interrupt times out, the system is frozen. The BMC will reset the system after logging the event.
Table 79: SMI Timeout Sensor Typical Characteristics
Byte
Field
Description
11
Sensor Type
F3h = SMI Timeout
12
Sensor Number
06h
13
Event Direction and
Event Type
[7] Event direction
0b = Assertion Event
1b = Deassertion Event
[6:0] Event Type = 03h (“digital” Discrete)
14
Event Data 1
[7:6] 00b = Unspecified Event Data 2
[5:4] 00b = Unspecified Event Data 3
[3:0] Event Trigger Offset = 1h = State Asserted
15
Event Data 2
Not used
16
Event Data 3
Not used

Table of Contents

Question and Answer IconNeed help?

Do you have a question about the Intel S2600GZ and is the answer not in the manual?

Intel S2600GZ Specifications

General IconGeneral
Product TypeServer Motherboard
Form FactorSSI EEB
ChipsetIntel C602
Max Memory512 GB
SATA Ports8
RAID SupportRAID 0, 1, 10, 5
Network ControllerIntel 82574L
CPU SocketLGA 2011
Supported ProcessorsIntel Xeon E5-2600 series
USB Ports2 x USB 3.0
Network2 x Gigabit Ethernet
Power Connector24-pin ATX

Summary

Introduction to System Event Log Troubleshooting

Purpose of the Guide

States the document's aim to list Intel platform events.

Industry Standards Overview

Covers industry standards for platform management.

Basic Decoding of SEL Records

Default SEL Values

Explains default values in SEL entries.

Collecting SEL Information

Guidance on capturing and understanding SEL logs.

Sensor Cross Reference List

BMC Owned Sensors

Details sensors managed by the BMC.

BIOS POST Owned Sensors

Details sensors managed by BIOS POST.

Microsoft OS Owned Events

Details records generated by the Microsoft OS.

Linux Kernel Panic Events

Details records generated by Linux kernel panics.

Power Subsystems

Threshold-based Voltage Sensors

Monitors main voltage sources in the system.

Power Supply Monitoring

Monitors the power supply subsystem for various conditions.

Cooling Subsystem

Fan Sensors Overview

Covers fan speed, presence, and redundancy sensors.

Temperature Sensors

Details various types of temperature sensors and their characteristics.

Processor Subsystem

Processor Status Monitoring

Monitors status information for each processor slot.

Catastrophic Error Reporting

Reports critical hardware errors via the CATERR# signal.

Memory Subsystem

RAS Configuration Status

Logs memory RAS configuration status after AC power-on.

ECC and Address Parity Errors

Covers memory data and address error detection.

PCI Express* and Legacy PCI Subsystem

PCI Express* Errors

Details correctable or fatal PCIe error events and their sources.

System BIOS Events

POST Error Logging

Logs POST errors and provides information on their causes.

Chassis Subsystem

Physical Security Monitoring

Monitors chassis intrusion and LAN leash status for security.

Miscellaneous Events

IPMI Watchdog Timer

Checks OS responsiveness using a watchdog timer for system stability.

BMC Firmware Health

Reports failures in BMC sensors and firmware health status.

Manageability Engine (ME) Events

ME Firmware Health

Reports health information for ME firmware, including upgrade and application errors.

Microsoft Windows* Records

Bug Check / Blue Screen Events

Records bug check or blue screen events for failure analysis.

Linux* Kernel Panic Records

Kernel Panic Event Format

Describes the format for Linux kernel panic events logged to the SEL.

Related product manuals