Getting GPU Health Information from Within the VM
:~$ sudo nvme list
Node SN Model Namespace Usage Format FW Rev
------------ -------------- -------------------------- -- -------------------- ---------- --------
/dev/nvme0n1 S2X6NX0K501953 SAMSUNG MZ1LW960HMJP-00003 1 61.79 GB / 960.20 GB 512 B + 0 B CXV8601Q
<snip> ...
/dev/nvme9n1 18141C246847 Micron_9200_MTFDHAL3T8TCT 1 3.84 TB / 3.84 TB 512 B + 0 B 101008R0
:-$ sudo nvme smart-log /dev/nvme9n1