EasyManua.ls Logo

Nvidia DGX H100 - Identifying the Failed DIMM; Replacing the DIMM

Nvidia DGX H100
146 pages
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Loading...
NVIDIA DGX H100 Service Manual
10.2. Identifying the Failed DIMM
1. From the console, run the following nvsm command to identify memory alerts:
sudo nvsm show health
2. Determine the DIMM manufacturer.
sudo nvsm show memory
3. Request the replacement DIMM from NVIDIA Enterprise Support, specifying the manufacturer.
10.3. Replacing the DIMM
1. Power o the system.
2. Remove the motherboard tray. Refer to Motherboard Tray - Removal and Installation for more
information.
3. Pull the motherboard out of the system and place it on a solid, at surface and remove the lid
and air baes to expose the DIMMs.
4. Identify the failed DIMM on the motherboard. Use the label on the lid to identify the position of
the DIMM to be replaced. The names of the DIMMs also include the CPU numbering for easier
identication.
62 Chapter 10. DIMM Replacement

Table of Contents

Other manuals for Nvidia DGX H100

Related product manuals