EasyManua.ls Logo

Avaya S8700 - Page 68

Avaya S8700
2618 pages
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Loading...
Initialization and Recovery
555-233-143
3-6 Issue 1 May 2002
syslogd Linux system log daemon (manages logging from Linux
services)
xntpd Network Time Protocol daemon (manages clock synchronizations
across the network)
Watchdogs HiMonitor
The Watchdogs HiMonitor checks for run-away processes and terminates them.
HiMonitor deals with an infinitely looping process that is preventing lower-priority
processes from running. More specifically, the high-priority HiMonitor process
periodically (interval set in watchd.conf) looks for responses from the
low-priority LoMonitor process. If present, HiMonitor resets Watchdogs timer. If
not, HiMonitor issues and logs a top command to determine which processes
are taking up CPU resources. HiMonitor then takes one of three recovery actions
in this order:
1. If a process within Watchdogs or the Process Managers Linux process
group, is consuming too high a percentage (percentage set in
/etc/opt/ecs/watchd.conf) of CPU occupancy, HiMonitor kills the
process.
2. If no process is using too high a percentage, but more than 100 instances
of the same monitored process is running, HiMonitor reboots Linux.
3. Does nothing and waits for the system to recover on its own.
If LoMonitor does not respond to a preset threshold (currently 5 of 7) of HiMonitor
checks, then (as a final recovery action) HiMonitor reboots Linux.
!
CAUTION:
Escalate to an Avaya engineer for explicit guidance with this recovery, since
it is potentially disruptive. A process can legitimately occupy abnormally
high amounts of processor time due to server load, and killing it could make
the server totally unavailable.
However, with an engineers guidance, recovery can be disabled by setting
the sampling-interval or occupancy-threshold values to 0. More likely, the
sampling-interval and CPU-occupancy thresholds will need to be fine-tuned
to values that dont cause erroneous recovery attempts.
NOTE:
The value of the sampling interval must be greater or equal to 0. If set
to 0, then the top command is not run, and no recovery is performed.
Also, the threshold CPU-occupancy percentage must be between 0 and
100. If set to 0, then no recovery is performed, but the top commands
output is logged. Setting these values to 0 may help achieve stability by
obtaining useful data without disrupting the processes.

Table of Contents

Other manuals for Avaya S8700

Related product manuals