Chapter 7. Managing Power Capping
The GPU has three sources of power limits:
▶ VBIOS: denes the maximum possible TGP (Total Graphics Power) value.
▶ The nvidia-smi tool: sets the power limit of the GPU through the host by users.
▶ SMBPBI: sets the power limit of the GPU via an out-of-band channel.
The GPU Performance Monitoring Unit (PMU) selects the most conservative policy to cap a system’s
power consumption.
7.1. Querying the Current GPU Power Limit
Use the following curl command to query the current GPU power limit:
curl -k -u <username>:<password> https:∕∕<bmc>∕redfish∕v1∕Systems∕HGX_Baseboard_0∕
,→Processors∕GPU_SXM_<id>∕EnvironmentMetrics
Where
▶ <bmc> is the BMC IP address.
▶ <id> is the GPU instance number of 1 to 8.
As shown in the following example output, the Reading eld indicates the current power usage, and
the SetPoint eld indicates the current GPU power limit.
...
"PowerLimitWatts": {
"AllowableMax": 700,
"AllowableMin": 200,
"ControlMode": "Automatic",
"DefaultSetPoint": 700,
"Reading": 64.388,
"SetPoint": 700
}
...
55