Chapter 1 System Overview 19
If a power supply problem is detected, an error message is displayed on the system
console and logged in the /var/adm/messages file. The System Fault and Power
Fault LEDs on the status and control panel are also lit. LEDs located on the back of
each power supply will indicate the source and nature of the fault.
For more information about error messages generated by the environmental
monitoring subsystem, see Sun Fire V890 Diagnostics and Troubleshooting.Youcan
find this document at: http://www.sun.com/documentation. For more
information about system LEDs, see Chapter 8.
Automatic System Recovery
The Sun Fire V890 system provides a feature called automatic system recovery (ASR).
The ASR feature isolates failures and provides for the automatic restoration of the
operating system after certain non-fatal hardware faults or failures cause an
interruption. ASR does not prevent the operating system from going down in the
event of a hardware problem.
For more information, see “About Automatic System Recovery” on page 109.
Note – To enhance system restoration and server availability, Sun has recently
introduced a new standard (default) OpenBoot firmware configuration. These
changes, which affect the behavior of servers like the Sun Fire V890, are described in
OpenBoot PROM Enhancements for Diagnostic Operation. This document is included on
the Sun Fire V890 Documentation CD.
Hardware Watchdog Mechanism
To detect and respond to system hang conditions, the Sun Fire V890 system features
a hardware watchdog mechanism—a hardware timer that is continually reset as long
as the operating system is running. In the event of a system hang, the operating
system is no longer able to reset the timer. The timer will then expire and cause an
automatic system reset, eliminating the need for operator intervention.
Note – The hardware watchdog mechanism is not activated until you enable it.
To enable this feature, you must edit the /etc/system file to include the
following entry:
set watchdog_enable = 1