Fault Detection and Diagnostics Overview
Monitoring Server Inventory and Health 81
2.
Use the hwmgmtcli list command:
hwmgmtcli list subsystem
Where subsystem is one of the following: all, server, cooling, processor, memory, power,
storage, network, firmware, device, bios, or iomodule
Related Information
■
Displaying Hardware Information (hwmgmtcli), Oracle Server CLI Tools User's Guide at
http://www.oracle.com/goto/ohmp/docs
Fault Detection and Diagnostics Overview
The server supports multiple fault detection and diagnostics tools. Fault detection tools, such
as the Oracle ILOM Fault Manager, automatically poll the system to detect hardware faults and
adverse environmental conditions. Diagnostics tools, such as Oracle VTS must be run manually
and can assist you in troubleshooting server issues. The following table provides an overview of
the fault detection and diagnostics tools supported by the server.
Tool Description Documentation
Oracle ILOM Fault
Manager
The Oracle ILOM Fault Manager is part of the Oracle ILOM
firmware embedded on the server service processor (SP). The
fault manager automatically detects system hardware faults and
environmental conditions on the server. If a problem occurs on the
server, Oracle ILOM identifies the problem in the Open Problems
table and logs information about the fault in the Event log.
Refer to Protecting Against Hardware
Faults: Oracle ILOM Fault Manager,
Oracle ILOM User's Guide for System
Monitoring and Diagnostics, Firmware
Release 3.2.x at:
http://www.oracle.com/goto/ilom/
docs
Oracle Linux
Fault Management
Architecture (FMA)
Oracle Linux FMA software can be optionally installed on the
server through Oracle Hardware Management Pack. Oracle Linux
FMA can be used to manage faults detected at the operating system
(OS) level in much the same way that you manage faults in Oracle
ILOM. Fault diagnosis messages from Linux FMA are maintained
on a fault management database, which is shared with Oracle
ILOM.
Refer to the Oracle Linux Fault
Management Architecture User's Guide
at:
http://docs.oracle.com/cd/
E52095_01
Oracle Solaris
Fault Management
Architecture (FMA)
Oracle Solaris FMA is included with the Oracle Solaris operating
system (OS). The fault manager receives data related to hardware
and software errors, automatically diagnoses the underlying
problem, and responds by trying to take faulty components offline.
Refer to Oracle Solaris Administration:
Common Tasks at:
http://docs.oracle.com/cd/
E23824_01/index.html
Auto Service Request
(ASR)
ASR is an optional support service for Oracle hardware. ASR
collects hardware telemetry data from telemetry sources (such as
Oracle ILOM) on ASR-enabled systems in your data center. ASR
filters this telemetry data and forwards what it determines to be
potential faults directly to Oracle, and then automatically initiates a
service request. You can configure features of the ASR service from
Oracle ILOM.
Go to:
http://www.oracle.com/us/support/
auto-service-request/index.html