EasyManua.ls Logo

Nvidia DGX-2 SYSTEM - 2.3 SSH Connection

Nvidia DGX-2 SYSTEM
93 pages
Print Icon
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Loading...
DGX-2 System DU-09224-001 _v09|13
Chapter5. U.2 NVMe Cache Drive
Post-Installation Tasks
This chapter describes the tasks that are typically needed after replacing a U.2 NVME drive or
upgrading from 8 to 16 drives.
5.1. Recreating the Cache RAID 0 Volume
1. Stop cachefilesd.
$ sudo systemctl stop cachefilesd
2. Umount /raid and stop raid-0.
$ sudo umount –f /raid
$ sudo mdadm –-stop /dev/md1
3. Run the script to rebuild the RAID volume.
$ sudo /usr/bin/configure_raid_array.py –c –f
Press Y at any questions.
4. When completed, confirm that the /raid volume is mounted.
$ df -hl /raid
The /dev/md1 filesystem should be mounted on /raid with size 28 TB or 56 TB,
depending on whether 8 or 16 drives are installed.
5.2. Confirming the Volume is Ready
1. Confirm the storage devices and volumes in the system are healthy using the following
command.
$ sudo nvsm show systems/localhost/storage/volumes/md1
2. Verify Status_Health=OK and that the numbers of drives listed in Drives = is as
expected.
3. Confirm that the drives are now available.
$ sudo mdadm -D /dev/md1
If the drive manufacturer is Micron, perform the steps in Enabling the Temperature Sensor.

Table of Contents

Other manuals for Nvidia DGX-2 SYSTEM

Related product manuals