Troubleshooting and problem solving
These procedures are intended to be used only during initial configuration, for the purpose of verifying that hardware setup is successful.
They are not intended to be used as troubleshooting procedures for configured systems using production data and I/O.
Topics:
• Overview
• Fault isolation methodology
• LEDs
• Troubleshooting 2U enclosures
• Troubleshooting 5U enclosures
• Temperature sensors
• Host I/O
Overview
The enclosure system includes a Storage Enclosure Processor (SEP) and associated monitoring and control logic to enable it to diagnose
problems with the enclosure’s power, cooling, and drive systems. Management interfaces allow for provisioning, monitoring, and managing
the storage system.
NOTE:
See Fault isolation methodology when conducting system diagnostics.
Fault isolation methodology
Dell EMC PowerVault ME4 Series Storage Systems provide many ways to isolate faults. This section presents the basic methodology used
to locate faults within a storage system, and to identify the pertinent CRUs affected.
Use the ME Storage Manager to configure and provision the system upon completing the hardware installation. As part of this process,
configure and enable event notification so the system will notify you when a problem occurs that is at or above the configured severity
(see the topic about configuring event notification within the Dell EMC PowerVault ME4 Series Storage System Administrator’s Guide).
With event notification configured and enabled, you can follow the recommended actions in the notification message to resolve the
problem, as further discussed in the options presented in the following section.
Fault isolation methodology basic steps
Following is a summary of the basic steps used to perform fault isolation and troubleshooting:
• Gather fault information, including using system LEDs as described in Gather fault information
• Determine where in the system the fault is occurring as described in Determine where the fault is occurring.
• Review event logs as described in Review the event logs.
• If required, isolate the fault to a data path component or configuration as described in Review the event logs .
Options available for performing basic steps
When performing fault isolation and troubleshooting steps, select the option or options that best suit your site environment. Use of any
option (four options are described below) is not mutually exclusive to the use of another option. You can use the ME Storage Manager to
check the health icons/values for the system and its components to ensure that everything is okay, or to drill down to a problem
component. If you discover a problem, either the ME Storage Manager or the CLI provide recommended-action text online. Options for
performing basic steps are listed according to frequency of use:
• Use the ME Storage Manager
• Use the CLI
• Monitor event notification
2
28 Troubleshooting and problem solving