10 Netra T2000 Server Installation Guide • September 2006
The power subsystem is monitored in a similar fashion and any fault is indicated at
the front and rear panel LEDs. Additionally, LEDs located on each power supply
light to indicate failures.
Error Correction and Parity Checking
The UltraSPARC T1 multicore processor provides parity protection on its internal
cache memories, including tag parity and data parity on the D-cache and I-cache.
The internal 3MB L2 cache has parity protection on the tags, and ECC protection on
the data.
Advanced ECC, also called chipkill, corrects up to 4-bits in error on nibble
boundaries, as long as they are all in the same DRAM. If a DRAM fails, the DIMM
continues to function.
Fault Management and Predictive Self-
Healing
The server features the latest fault management technologies. With the Solaris 10
Operating System (OS), Sun is introducing a new architecture for building and
deploying systems and services capable of Predictive Self-Healing. Self-healing
technology enables Sun servers to accurately predict component failures and
mitigate many serious problems before they actually occur. This technology is
incorporated into both the hardware and software of the server.
At the heart of the predictive self-healing capabilities is the Solaris Fault Manager, a
new service that receives data relating to hardware and software errors, and
automatically and silently diagnoses the underlying problem. Once a problem is
diagnosed, a set of agents automatically responds by logging the event, and if
necessary, takes the faulty component offline. By automatically diagnosing
problems, business-critical applications and essential system services can continue
uninterrupted in the event of software failures, or major hardware component
failures.