View Alarms Data

This article describes the alarm view for devices and interfaces and how to change the view to display the information that you need.
uim902
This article describes the alarm view for devices and interfaces and how to change the view to display the information that you need.
 
 
Overview
CA UIM monitors the system for system availability and operating parameters. If configured operating thresholds are exceeded, Infrastructure Management generates an error message called an alarm. Alarms represent changes in device performance levels or complete device failures: for instance, high system loads or a loss of access. Knowing when specific system elements need attention, you can correct performance problems before they affect system availability.
The Alarms view in Infrastructure Management displays a list of active alarms returned by the system. You can view details for individual alarms and identify the devices that created them. You can distinguish between critical and non-critical alarms to prioritize alarm resolution. Also, you can easily access dashboards that display performance metrics over time to address patterns in low system performance.
The Alarm Lifecycle
Infrastructure Management collects metrics on all monitored devices. If an operating parameter exceeds the threshold, Infrastructure Management:
  1. Generates an alarm.
  2. Assigns the alarm an associated severity.
Alarms that meet default parameters for severity are automatically sent to the email account configured in the emailgtw probe.
If the alarm condition persists, the time of the latest event is maintained internally. The calculated duration of the alarm represents the persistence of the alarm condition.
If the alarm level changes (another alarm threshold is exceeded or the current threshold is no longer met), the alarm view displays the new severity. When the alarm condition is corrected, the alarm condition is cleared and the alarm is removed from the alarm display.
How the Alarms View Works
The Alarms view gives you a list of current alarms and details on those alarms. To open the Alarms view, click on the Alarms icon ( Alarms view icon ) at the left of the Home view or the alarm count in the navigation card in the Home view for a displayed group or inventory.
  Alarm view links in the Home view
The view is context-sensitive. If you open the Alarms view without navigating to a specific group in the Home view, the Alarms view displays all system alarms. If you navigate to a group within the Home view, the Alarms view displays alarms for all devices in the group. If you navigate within the Home view to a specific device within a group, the Alarms view displays alarms for the selected device.
The alarm view is dynamic: On opening, the alarm list is automatically updated to show the latest alarms. Alarms that are cleared by the system are removed from the view.
An alarm is displayed so long as its device is operating outside of operating parameters. A condition that persists over time is represented by a single alarm, and the view displays the duration of the first instance of the alarm. Subsequent identical alarms are suppressed from the view. Any change in an alarm other than severity (for example, an alarm message revision) generates a new alarm.
The Alarms view opens as a list. Columns in the view are:
  • Severity icons (!)
  • Device Name
  • Device Type
  • Alarm Type
  • Alarm Message
  • Duration
     To see the creation date and time for the alarm, hover the cursor over the duration period.
  Alarms view columns  
Icons represent the alarm severities that you defined in monitoring configuration. Currently, you can set only two severity levels.
  Critical alarm icon  - Critical
  Major alarm icon  - Major
  Minor alarm icon  - Minor
  Warning Icon  - Warning
  Information Icon  - Info
The number of alarms in the list is displayed in the lower-right corner of the window. Alarm counts show the alarm number for the page and the total number of alarms in the list. When no filter is applied, the view displays the number of alarms for the view context. When a filter is applied, the view displays the number of alarms in the filtered list.
Alarm Filters
For each column in the Alarms view, you can use the additional filter options to narrow your search. Move the mouse pointer to the column to view the filter options.
Column Name
Filter
Description
 
Alarm Severity
 
  alarm_severity.png  
Filter alarms based on the alarm severity:
  • View a sorted list based on the alarm count (shown as 1 in the image).
  • Use specific alarm severity to view the related list of alarms (shown as 2 in the image).
 
Device Name
 
  alarms_device_name.png  
Filter alarms based on the device name:
  • View a sorted list based on the device name (shown as 1 in the image).
  • Use specific device name text to view the filtered list (shown as 2 and 3 in the image).
 
Device Type
 
  alarms_device_type.png  
Filter alarms based on the device type:
  • View a sorted alarms list based on the device type (shown as 1 in the image).
  • Use specific device type text to view the filtered list (shown as 2 in the image).
 
Alarm Type
 
  alarms_alarm_type.png  
Filter alarms based on the source of the alarm:
  • View a sorted alarms list based on the alarm type (shown as 1 in the image).
  • Use specific alarm type text to view the filtered list (shown as 2 and 3 in the image).
 
Alarm Message
 
  alarms_alarm_message.png  
Filter alarms based on the alarm message:
  • View a sorted alarms list based on the alarm message (shown as 1 in the image).
  • Use specific alarm message text to view the filtered list (shown as 2 and 3 in the image).
 
Duration
 
  alarms_duration.png  
Filter alarms based on the time when they were raised:
  • View a sorted alarms list based on the duration (shown as 1 in the image).
  • Use specific duration to view the filtered list (shown as 2 in the image).
You can control the display of alarms and their details. The following items are common questions and their answers.
How can I control the number of alarms in my window?
The number of alarms that are displayed on the page depends on the 'Rows per page' selection at the bottom of the page, the resolution setting of the monitor, and the available space in the browser window. "Auto" displays the number of alarms that fit in the browser window at the selected resolution. Other settings may generate a scroll bar to display all items in the page. Change any of these settings to change the alarm list view.
What happens to the view when a new alarm occurs?
The list of alarms is "paused" by default: that is, new alarms are held apart from the list. Pausing keeps the alarms list static while you view the current alarm data. When a new alarm is logged or an alarm is cleared, a message appears in the header of the view. You can update the alarm list by clicking the update message to view the new alarms and drop the cleared or expired alarms. The page contents changes accordingly but keep a currently selected alarm visible.
Can I sort the alarms list? Can I filter alarms?
You can sort the view by any column by clicking the column title and then the sort icon for an ascending or descending sort order. Strings are sorted alphabetically. If no alarm is selected, the sort opens to the first page of results.
You can filter alarms. Click next to the Filter icon ( Filter icon ) at the upper left of the list to enter a quick filter. The alarm list is filtered on device names. Filters are case-sensitive.
How do I view alarm details, such as device information?
Select an alarm to open the alarm details pane below the list. If the monitoring probe collects metrics, the details pane contains alarm, device, and metric details. Opening the details pane automatically adjusts the length of the list to accommodate the pane.
What happens to the list when an alarm condition is corrected?
Alarms are displayed until the monitoring probe clears the condition. The alarm condition is cleared when the next monitoring interval no longer reports the condition. Each probe has its own monitoring interval, so different types of alarms can take different amounts of time to clear.
View Alarm Details
Select an alarm in the list to view details for the alarm. The details identify device and alarm attributes to help identify performance issues. For example, knowing what probe monitors the device might prompt you to check the probe status and log through the Admin Console view. Such details can help you determine whether the issue is in the device or in the probe.
The details view contains two tabs: Overview and Device Details.
  • The Overview tab displays alarm details.
  • The Device Details tab displays attributes of the device.
  • The Metrics tab displays performance metrics for the device. This tab only exists when an alarm is based on a metric.
The details view remains open for any selected alarm on the page but disappears when another page of alarms is selected, a filter removes the device from the display, or the alarm clears during viewing. You can close the details view and return to the list of alarms by clicking the 
X
 at the upper-right corner of the view.
View Device Metrics
You can view performance metrics for devices through the Alarms view. Metrics let you see the performance of the device for the last 24 hours.
Device metrics are available in two places: under the Metrics tab in the Alarms view and in the CABI dashboard for the device, accessible through the CA Business Intelligence (CABI) dashboards view. The Metrics tab is only available for alarms that are associated with metrics. Alarms for some conditions, such as an expired license, do not produce metrics, and no Metrics tab is generated.
A link in the device name under the Device Details tab opens the dashboard view for the device. The dashboard view provides additional information about the device details. You can also open this view through the Dashboards view by drilling down through the group cards to the device.
  Performance Metrics for Devices through Alarms View  
For information about viewing dashboards for groups and devices, see View Your Dashboards.
Clear Alarms
Infrastructure Management clears alarms when you correct the condition that created the event or the alarm expires due to inactivity. After you correct an alarm condition, the monitoring probe will clear the alarm at the next monitoring interval and Infrastructure Management removes the alarm from the Alarms view. If an alarm is not updated for 72 hours, Infrastructure Management clears the alarm automatically and removes the alarm from the Alarms view.
Refer to the following displays to identify possible causes of an alarm. Be aware that an alarm condition for one device—for instance, a power supply—can cause dependent devices to register alarm conditions too. Be selective about the priority of alarms to restore system operation quickly.
  • View the metrics for the alarm in by clicking the Metrics tab and viewing the metrics graph to identify any pattern in device performance that might result in an out-of-range condition. The graph displays the last 24 hours of device operation. The graph might indicate an hourly or daily pattern of heavy use.
  • View the alarm metrics for the device in the 
    Dashboard
     ( Dashboard icon ) view for the same information in context of the device. In the dashboard, you can change the displayed period to view metrics patterns to days or weeks. Changing the display period can be useful if periods of high use occur on specific days of the week or month.
  • If a minimum value is required for normal operation and no metrics are being returned, the monitoring probe may not be active. Check the status of the monitoring probe through the 
    Admin Console
     to verify that the monitoring probe is running. If the probe status shows that it is inactive, restart the probe. For instructions, see Activate, Deactivate, or Restart a Probe.
  • If an alarm is being returned for otherwise an acceptable operating condition, alarm threshold settings may be set too low.
    Follow these steps:
    1. Select the group for the device in the Home ( Home icon ) view.
    2. Click the Settings ( Settings Icon ) icon at the upper right of the window to view the group monitoring profile.
    3. Check for an alarm threshold value that should be raised or a metric that should be turned off.
  • Review the monitoring probe log file for the device through the 
    Admin Console
     view to identify potential sources of the alarm condition. The alarm condition may be a one-time occurrence or a pattern of performance problems. If a configuration parameter (for example, a memory partition or the number of CPUs available) causes the condition, you might need to reconfigure or replace the device. Refer to internal troubleshooting procedures to proceed. For instructions on viewing the log file, see View a Probe Log File.
Consult with your IT administrator for troubleshooting steps and configuration changes that are needed to correct the alarm condition.
Troubleshooting
 
Symptom:
 
There may be instances where the alarms are not generated though alarm policy is active.
 
Solution:
 
Alarm policy works only for enhanced templates. Ensure that the template that generates the metrics is an enhanced template.