Configure Threshold Profiles

Threshold profiles let users monitor specific devices or components, such as interfaces and IPSLA tests. Different components often require different threshold configuration. Creating threshold profiles lets users determine a threshold for a given metric or set of metrics. Failing to meet the threshold produces a violation event, and returning to acceptable operation produces an event which indicates that the violation has been cleared.
capm320
HID_Configure_Threshold_Profiles
Threshold profiles let users monitor specific devices or components, such as interfaces and IPSLA tests. Different components often require different threshold configuration. Creating threshold profiles lets users determine a threshold for a given metric or set of metrics. Failing to meet the threshold produces a violation event, and returning to acceptable operation produces an event which indicates that the violation has been cleared.
Example
You want to monitor the utilization of an interface, and trigger a violation when the utilization is above 75 percent. When the utilization drops below 75 percent, you want the violation to clear.
  • Event Rules
    Event rules let users define logic, which uses metric data to determine when a violation of a threshold occurs. Each event rule uses metrics from a single metric family, and various operators that compare the threshold to the actual value. Up to five conditions can be added to an event rule, and a violation is produced when the threshold is exceeded. In the example on this page, at least two rules are needed. One rule determines when the violation occurs, while the other determines when it is cleared.
  • Event Condition
    Event conditions automatically appear in a view when all the conditions in a certain event rule are met. Conditions include "Violation" and "Cleared".
The following video shows the threshold profile configuration process:

2
Threshold Best Practices
Consider the following best practices when you configure thresholding:
  • Apply threshold profiles to groups with specific components instead of devices. This group structure increases the granularity and flexibility of thresholding.
  • Expand threshold monitoring slowly. Start with a small group of components and verify that the monitoring engine does not become degraded. For more information, see Threshold Event Processing Self-Monitoring Metrics.
  • Thresholds on components with 1-minute polling have a high resource cost to the system.
  • Threshold evaluations might slow down after a Data Aggregator restart while cached poll data from the the Data Collectors is processed.
Create Threshold Profile
Log in as a user or administrator to create a threshold profile. Users with the Create DA Threshold Profile or the Administer DA Threshold Profile role right can create threshold profiles.
You can edit or delete exiting threshold profiles.
Follow these steps:
  1. Select
    Administration
    , and click the data source.
  2. Expand
    Monitoring Configuration
    , and click
    Threshold Profiles
    .
  3. Create a folder, or select an existing folder.
  4. Click
    New Profile
    .
  5. Specify the required information.
  6. (Administrator Only) Select an owner. Only the owner or a user with the Administer DA Threshold Profile role right can edit the profile.
  7. Add event rules to the profile.
  8. Click
    Save
    .
    The threshold profile is added to the system. To generate events, assign the profile to a group.
Add Event Rules
Each event rule is based on a single metric family, and determines the conditions that cause or clear a violation. Each threshold profile requires at least one event rule.
You can edit or delete existing event rules.
To use an existing event rule as a template, select an event rule, and click Copy.
Follow these steps:
  1. Create or edit a threshold profile.
  2. In the Event Rules pane, click
    New
    .
  3. Specify the required information for the event rule. The following fields require explanation:
    • Duration
      Specifies the total amount of time a given condition must be true within the specified Window to generate an event. The poll cycles that trigger the condition do not need to be consecutive.
    • Window
      Specifies the overall range of time to evaluate the rule condition.
    • Aggregation
      Specifies whether the threshold applies to an aggregate value of all components for the device. This field appears only when you select a supported metric family.
      Currently, only the Utilization (%) Metric for the CPU and Memory metric families are supported for aggregation. When you select this option, the event rule must use Fixed Value for the Condition Type.
  4. Save the event rule.
  5. Save the threshold profile.
Duration and Window Example:
A monitored device has a poll rate of 5 minutes. An associated threshold profile has an event rule with a duration of 600 (10 minutes) and a window of 3600 (1 hour). An event does not occur when the conditions are triggered for a single poll result because the 5-minute poll does not reach the 10-minute duration. The event occurs only if the conditions of the rule are triggered for a second poll result within one hour of the first triggering poll.
When a threshold is breached, an alarm is created. When the event clears, the threshold is rechecked with the next poll cycle. If the threshold is breached again, a new alarm is created.
Standard Deviation Event Rules
Event rules that use standard deviation compare the poll results to the baseline for the device or component. The baseline and the standard deviation value are calculated for the specific hour of the day of the week. For more information about these calculations, see Baseline Calculations.
Standard deviation rules are triggered when the value of the metric differs from the baseline by the specified number of standard deviations. For rules with the Above operator, the rule is triggered when the value of the metric exceeds the baseline value plus the number of standard deviations. For rules with the Below operator, the rule is triggered when the value of the metric is lower than the baseline value minus the number of standard deviations.
Example
The baseline is 65% and the standard deviation is 10%. The rules states that an event triggers when CPU utilization is above 2 standard deviations. This condition triggers when the CPU utilization is greater than 85%.
Assign Groups to Threshold Profile
To generate violation events, associate groups to the threshold profile. Assigning groups to a profile identifies the devices or components that the profile monitors. When a device is in the group, the threshold profile applies to each component of the device that supports the selected metric family.
Threshold profiles that are assigned to collections apply the event rule only to devices in the collection. Components and interfaces in the collections are not analyzed. Use custom groups to generate events for components and interfaces.
Follow these steps:
  1. Select a threshold profile from the Folder View or Table View.
  2. Click the
    Groups
    tab in the right-hand pane.
  3. Click 
    Manage
    at the bottom of the screen.
  4. Select the groups from the
    Available Groups
    tree, and click the right arrow to add it to the Selected list.
  5. Click OK.
    The groups in the Selected list are assigned to the threshold profile.
View Threshold Profile Events
Event rules that you create trigger events when a threshold is violated and when the violation is cleared. The Threshold Profiles screen shows events that have occurred as a result of specific event rules. In contrast, the Events Display dashboard shows threshold events that relate to all event rules in all threshold profiles.
Follow these steps:
  1. (Administrator) Select
    Administration
    ,
    Threshold Profiles
    .
    (User) Select
    My Settings
    ,
    Threshold Profiles
    .
  2. Select a threshold profile.
  3. Click the
    Events
    tab.
  4. (Optional) Click the
    Details
    button.
  5. (Optional) Click
    Change
    next to the time range, and select a default time range.
    You can also select
    Custom Time Range
    to set a different time range.