Maintaining and Servicing the NVIDIA DGX Station
DGX Station DU-08255-001 _v4.6|37
$ sudo nvhealth [-k output-file]
output-file
The name and the path of the file in which the raw state of the system is written. The
nvhealth command displays this file name at the end of the output from the command.
If you omit the output file, the information is written to the file /tmp/nvhealth-
log.random-string.jsonl, for example, /tmp/nvhealth-log.6wf3WriAC3.jsonl.
Note:
If you run the nvhealth command while the RAID array is being rebuilt after a change in RAID
level to RAID 5, nvhealth reports the status of the RAID volume as unhealthy. To avoid this
potentially misleading result, wait until RAID array is rebuilt before running nvhealth.
To check the progress of the rebuild and show the percentage complete and an estimate of the
time to completion, run this command:
# cat /proc/mdstat
Personalities : [raid6] [raid5] [raid4] [linear] [multipath] [raid0] [raid1]
[raid10]
md0 : active raid5 sdb[0] sdc[1] sdd[2]
181764096 blocks super 1.2 level 5, 512k chunk, algorithm 2 [4/3] [UUU_]
[===>.................] recovery = 17.2% (10426232/60588032)
finish=45.8min speed=18238K/sec
4.6. Replacing the System and
Components
Be sure to familiarize yourself with the NVIDIA Terms & Conditions documents before
attempting to perform any modification or repair to the DGX Station. These Terms &
Conditions for the DGX Station can be found through the NVIDIA DGX Systems Support (http://
www.nvidia.com/object/dgxsystems-support.html) page.
Contact NVIDIA Enterprise Support to obtain an RMA number for any system or component
that needs to be returned for repair or replacement. When replacing a component, use only
the replacement supplied to you by NVIDIA unless you are directed otherwise.
The following components are customer-replaceable:
‣
Solid State Drives (SSDs)
Note: If you want to add SSDs for data storage to the DGX Station, obtain the SSDs from
NVIDIA Enterprise Support.
‣
DIMMs
Note: DIMMs are customer replaceable if a DIMM fails or to increase the system memory
capacity to 512 GB. If you want to increase the system memory capacity to 512 GB, obtain
the replacement DIMMs from NVIDIA Enterprise Support.