Fault
Symptom
Diagnosis Procedure Quick Recovery Method
A
communic
ation
error
occurs on
a network
port.
1. Check whether the network
cable is connected properly to
the network port.
2. Use the Computing Product
Compatibility Checker to
check whether the NIC type is
compatible with the server
board. Use Computing
Product Firmware and Driver
Mapping Checker to check
whether the NIC
rmware and
driver versions match the OS.
If they do not match, upgrade
the NIC
rmware and driver
rst.
3. To check whether the network
ports are up, run the
ifcong
eth
N
up command in Linux
(the command may vary in
dierent OSs). To check
whether IP addresses are set
for the required network ports,
run the ethtool eth
N
command.
4. Run the ethtool -p eth
N
command in Linux (the
command may vary in other
OSs) to check whether the
information in the network
port
conguration le of the
rack server/Atlas 800 AI
inference server (model 3010)/
Atlas 800 AI training server
(model 9010) is consistent
with the actual physical
network ports, and check
whether the network port
status indicators are on and
whether the network ports on
the switch are up.
NOTE
The ethtool -p eth
N
command
applies only to plug-in PCIe cards.
5. Check the network port
conguration of the switch
module by referring to E9000
Blade Server Mezzanine
Module-Switch Module
1. Use the ping command to
check whether the server or
other servers on the
network have network
faults.
● If the fault occurs on
more than one server,
check whether the
external switching
network is normal.
● If the fault occurs only
on one server, go to 2.
2. Check the indicator to see
the NIC port status. If the
indicator is
o, switch the
optical module, optical
cable, and uplink switch
port related to the faulty
NIC port with those of a
normal NIC port if any of
these components are
faulty. Then replace them.
3. If the NIC is causing the
fault, restart the server
when interruption will not
aect services, and check
whether the communication
is normal. If the fault
persists, power the server
o and on. If the fault still
persists, replace the NIC.
Huawei Servers
Troubleshooting 5 Diagnosing and Rectifying Faults
Issue 20 (2020-09-25) Copyright © Huawei Technologies Co., Ltd. 99