mO SharemO Share

SONARPLEX Monitoring's Guide

Welcome to the azeti SONARPLEX Monitoring's Guide. This document describes what happens while monitoring with the SONARPLEX appliance.

Introduction

With a SONARPLEX appliance as the central device you can monitor several local SONARPLEX appliances from a central location (distributed monitoring).

Distributed monitoring gives an immediate overview of the status of a host (UP or DOWN) or of the services (OK, WARNING, CRITICAL, UNKNOWN) in the entire network. Via the SONARPLEX appliance as the central device, you can access the User Web Interface of other SONARPLEX appliances to confirm a problem or comment an existing problem. You can also disable notifications and service checks via the host/service commands.

By using the User Web Interface, also called monitoring web interface, of the central SONARPLEX appliance you can also monitor the network structure. All occurred fault status are clearly displayed.

The possible fault status of hosts and services with corresponding color mean the following:

  • Service:
    • WARNING (yellow): The monitoring system detects a fault for the first time. Depending on the threshold value, the problem is further checked until a new status is detected.
      WARNING can also be executed when a threshold value is reached (e.g. 80 % of CP load reached).
    • CRITICAL (red): The service status changes from WARNING to CRITICAL if it has not improved according to the defined threshold values.
      CRITICAL can also be triggered if a second threshold value is reached (e.g. 90 % of the CPU load reached).
    • UNKNOWN (blue): The service does respond but the response does not correspond to the defined return value of the check. The host does also work. In this case, the service reacts in an unusual way and its status changes to UNKNOWN.
    • RECOVERY/OK (green): The monitored service is functional again.
    • PENDING
  • Host:
    • DOWN (red): The host does not respond anymore (due to a serious fault, for example).
    • UNREACHABLE (red): Due to infrastructure problems, the host cannot be reached. Its status cannot be determined.
    • RECOVERY/UP (green): The host is again available after a fault. Normal monitoring is again enabled.
    • PENDING

The monitor web interface delivers for example information about hosts or services with scheduled downtime and also the host or service comments (see User Web Interface > Information ). These comments are helpful when many people work on a host or use it. Just click on "Add a new host/service comment" in User Web Interface > Information > Comments to start writing a host or service comment. How to write a comment or rather which fields must be filled are described in the concerned page.

Recent updates

The following macros are not currently supported in the footer:
  • style