EasyManua.ls Logo

Intel S2600GL

Intel S2600GL
141 pages
To Next Page IconTo Next Page
To Next Page IconTo Next Page
Loading...
System Event Log Troubleshooting
Guide for EPSD Platforms Based on
Intel
®
Xeon
®
Processor E5
4600/2600/2400/1600/1400
Product Families
Intel order number G90620-002
Revision 1.1
September 2013
Enterprise Platforms and Services Division Marketing

Table of Contents

Question and Answer IconNeed help?

Do you have a question about the Intel S2600GL and is the answer not in the manual?

Intel S2600GL Specifications

General IconGeneral
Socket TypeLGA 2011
CPU SocketLGA 2011
Max CPU Support2
ChipsetIntel C602
Max Memory512 GB
Number of Memory Slots16
Expansion Slots7
CPU SupportIntel Xeon E5-2600 series
Storage InterfaceSATA 6Gb/s
RAID SupportYes
Network Interface2 x Gigabit Ethernet
Power Connector24-pin ATX
Form FactorSSI EEB 3.61

Summary

Introduction

1.1 Purpose

Defines the scope and purpose of this troubleshooting guide for Intel platforms.

1.2 Industry Standard

Explains industry standards like IPMI and BMC for server management.

Basic Decoding of a SEL Record

2.1 Default Values in the SEL Records

Details default values and formats for System Event Log (SEL) records.

2.2 Notes on SEL Logs and Collecting SEL Information

Provides guidance on capturing and interpreting SEL logs, including data formats.

Sensor Cross Reference List

3.1 BMC owned Sensors (GID = 0020 h)

Cross-references BMC-owned sensors, their details, and next steps for troubleshooting.

3.2 BIOS POST owned Sensors (GID = 0001 h)

Lists sensors owned by BIOS POST, including their details and troubleshooting steps.

3.5 Microsoft* OS owned Events (GID = 0041)

Lists Microsoft OS-owned events and their corresponding details and next steps.

3.3 BIOS SMI Handler owned Sensors (GID = 0033 h)

Cross-references sensors managed by BIOS SMI Handler with troubleshooting guidance.

3.4 Node Manager ME Firmware owned Sensors (GID = 002 Ch or 602 Ch)

Details sensors managed by Node Manager/ME firmware with associated next steps.

Power Subsystems

4.1 Threshold-based Voltage Sensors

Details threshold-based voltage sensors, their characteristics, and event triggers.

4.2 Voltage Regulator Watchdog Timer Sensor

Explains the Voltage Regulator Watchdog Timer sensor and its typical characteristics.

4.3 Power Unit

Monitors the power state of the system and logs state changes in the SEL.

4.4 Power Supply

Covers power supply status, power input, current output, and temperature sensors.

Cooling Subsystem

5.1 Fan Sensors

Details fan sensors: speed, presence, and redundancy, and their characteristics.

5.2 Temperature Sensors

Describes various temperature sensors like threshold-based, thermal margin, and discrete.

Processor Subsystem

6.1 Processor Status Sensor

Monitors status information for each processor slot and its event states.

6.2 Catastrophic Error Sensor

Reports when the Catastrophic Error signal (CATERR#) is asserted, indicating hardware issues.

6.3 CPU Missing Sensor

Detects and reports if a CPU is not installed or is in the incorrect socket.

6.4 Quick Path Interconnect Sensors

Covers sensors related to the Intel Quick Path Interconnect (QPI) bus for processor communication.

Memory Subsystem

7.1 Memory RAS Configuration Status

Logs memory RAS configuration status and errors after AC power-on.

7.2 Memory RAS Mode Select

Records changes in memory RAS Mode, such as Mirroring or Lockstep modes.

7.3 Mirroring Redundancy State

Monitors memory mirroring mode and logs events when redundancy is lost.

7.4 Sparing Redundancy State

Tracks memory sparing mode and logs events when redundancy is lost due to errors.

7.5 ECC and Address Parity

Details memory data errors (correctable/uncorrectable) and address parity errors.

PCI Express* and Legacy PCI Subsystem

8.1 PCI Express* Errors

Details on PCI Express errors, including fatal, correctable, and legacy types for troubleshooting.

System BIOS Events

9.1 System Events

Covers BIOS events occurring during POST or sleep state transitions.

9.2 System Firmware Progress (Formerly Post Error)

Logs POST errors and system firmware progress, providing information on potential issues.

Chassis Subsystem

10.1 Physical Security

Monitors chassis intrusion and LAN leash status for physical security.

10.2 FP (NMI) Interrupt

Logs events from diagnostic interrupt button presses or IPMI Chassis Control commands.

10.3 Button Sensor

Logs front panel power and reset button presses for informational purposes.

Miscellaneous Events

11.1 IPMI Watchdog

Checks OS responsiveness using an IPMI watchdog timer and logs expiry actions.

11.2 SMI Timeout

Handles system management interrupts; logs timeout events which freeze the system.

11.3 System Event Log Cleared

Logs when the System Event Log (SEL) is cleared, either manually or during manufacturing.

11.4 System Event - PEF Action

Logs events where BMC takes action based on Platform Event Filter (PEF) configuration.

11.5 BMC Watchdog Sensor

Reports BMC resets due to BMC Watchdog feature actions or BMC CPU resets.

11.6 BMC FW Health Sensor

Tracks BMC firmware health and reports sensor failures, indicating hardware access layer errors.

11.7 Firmware Update Status Sensor

Generates SEL events related to embedded firmware updates for BMC, BIOS, and ME.

11.8 Add-In Module Presence Sensor

Indicates whether add-in modules/boards are installed in dedicated server board slots.

11.9 Intel Xeon Phi Coprocessor Management Sensors

Provides limited manageability for Intel Xeon Phi Coprocessor adapters, including thermal and status sensors.

Hot-Swap Controller Backplane Events

12.1 HSC Backplane Temperature Sensor

Measures ambient temperature using a thermal sensor on the Hot-Swap Backplane.

12.2 Hard Disk Drive Monitoring Sensor

Monitors Hard Disk Drive status, including drive presence and faults.

12.3 Hot-Swap Controller Health Sensor

Indicates the health of the Hot-Swap Controller (HSC), reporting offline or degraded states.

Manageability Engine (ME) Events

13.1 ME Firmware Health Event

Reports health information for ME firmware, including upgrade and application errors.

13.2 Node Manager Exception Event

Logs events when the maintained policy power limit is exceeded over the correction time limit.

13.3 Node Manager Health Event

Provides runtime error indications about Intel Intelligent Power Node Manager's health.

13.4 Node Manager Operational Capabilities Change

Indicates changes in Node Manager's operational capabilities like policy interface and monitoring.

13.5 Node Manger Alert Threshold Exceeded

Logs events when maintained policy power limits are exceeded, affecting power management.

Microsoft Windows* Records

14.1 Boot up Event Records

Logs boot-up and OEM events when the system starts in Microsoft Windows OS.

14.2 Shutdown Event Records

Records OS stop/shutdown events, shutdown reason codes, and comments during system shutdown.

14.3 Bug Check Blue Screen Event Records

Logs bug check (blue screen) events, including OS stop/shutdown and OEM codes for failure analysis.

Linux* Kernel Panic Records

Linux* Kernel Panic Records

Details Linux kernel panic events captured in the system event log.

Related product manuals