rsp Metrics

This article describes the metrics that can be configured for the Remote System Probe (rsp) probe. 
uimpga-ga
This article describes the metrics that can be configured for the Remote System Probe (rsp) probe.
 
QoS Metrics
The following table describes all the QoS metrics generated by the rsp probe.
CPU
Monitor Name
Units
Description
Version
QOS_CPU_MULTI_USAGE
Percent
Difference between highest and lowest CPU usage on Multi-CPU systems. The metric calculates data in percentage for the individual CPU idle time, user time, system time, wait time, and CPU usage.
2.8
QOS_CPU_USAGE
Percent
CPU usage. The metric calculates data in percentage for total usage, user time, system time, wait time, and idle time.
2.8
Disk
Monitor Name
Units
Description
Version
QOS_DISK_USAGE
Megabytes
Aggregated disk I/O rate in megabytes.
2.8
QOS_DISK_USAGE_PERC
Percent
Disk Usage (%)
2.8
If the rsp probe 5.10 or later is migrated using threshold_migrator version 2.01 or later, the threshold values for the Disk alarms are altered. The probe will now send the used space in the QoS value, instead of the free space. For example:
  • QOS_DISK_USAGE (MB): If the threshold is set as 1000, and the disk size of the host is 5000, the threshold value sent will be 4000 (5000-1000). 
  • QOS_DISK_USAGE_PERC: If the threshold is set as 40%, the threshold value sent will be 60 (100-40).
If during migration, the threshold_migrator probe is unable to read the disk size of the host, the Disk QoS will not be migrated .
Load
Monitor Name
Units
Description
Version
QOS_PROC_QUEUE_LEN
Processes
Processor Queue Length. The current calculated average load of the system.
2.8
Memory
Monitor Name
Units
Description
Version
QOS_MEMORY_PAGING
Kilobytes/Second
Memory paging in kilobytes per second.
2.8
QOS_MEMORY_PAGING_PGPS
Pages/Second
Memory paging in pages per second.
2.8
QOS_MEMORY_PERC_USAGE
Percent
Total memory usage in percent.
2.8
QOS_MEMORY_PHYSICAL
Megabytes
The size of the physical memory available in the system in megabytes.
2.8
QOS_MEMORY_SWAP
Megabytes
Swap memory usage.
2.8
QOS_MEMORY_USAGE
Megabytes
Total memory usage in megabytes.
2.8
QOS_SWAP_MEMORY_PERC_USAGE
Percent
Swap memory usage in percent.
5.1
QOS_PHYSICAL_MEMORY_PERC_USAGE
Percent
Physical memory usage in percent.
5.1
Ntevents
Monitor Name
Units
Description
Version
QOS_NTEVENT_COUNT
Count
Number of events in interval. This metric sends the count of matched events, irrespective of the profile.
2.8
Processes
Monitor Name
Units
Description
Version
QOS_PROC_QUEUE_LEN
Processes
Processor queue length. The current calculated average load of the system.
2.8
QOS_PROCESS_CPU_USAGE
Percent
Process CPU usage.
2.8
QOS_PROCESS_INSTANCE
Number
Number of process instances.
2.8
QOS_PROCESS_STATE
Running
Process availability.
2.8
QOS_PROCESS_THREADS
Number
Number of process threads.
2.8
QOS_PROCESS_MEMORY_USAGE
Kilobytes
Process memory usage.
2.8
Services
Monitor Name
Units
Description
Version
QOS_NTSERVICE_STATE
State
NT Service Availability
2.8
WMI_Object
Monitor Name
Units
Description
Version
QOS_WMI_OBJECT_VAL
Variant
Value of WMI object.
4.0
 
 
Individual CPU metrics (User, System, Wait, and Idle) are not supported on Windows platform. These metrics would not return a value even if they are selected during configuration.
Alert Metrics Default Settings
The following table contains the alert metrics and default settings for the rsp probe.
Alarm Metric
Warning Threshold
Warning Severity
Error Threshold
Error Severity
Description
Generic
connection error
-
-
-
Major
Alarm to be issued when connection to host failed or login refused.
connection timeout
-
-
-
Major
Alarm to be issued when connection to host timed out.
data collection failure
-
-
-
Major
Alarm to be issued when data collection failure for checkpoint on host happened.
database integrity
-
-
-
Major
Alarm to be issued when database integrity problem occurred with host.
duplicate data
-
-
-
Major
Alarm to be issued when possibly duplicated data in QoS series in the database data collection failure occurred for checkpoint as CDM probe is also running on the same host.
duplicate series
-
-
-
Minor
Alarm to be issued when possibly duplicated data series in the QoS database as CDM probe is also running on the same host.
lua load error
-
-
-
Major
Alarm to be issued when error in loading file occurred.
wmi class missing
-
-
-
Major
Alarm to be issued when WMI class missing on host.
CPU Usage and Thresholds
Total CPU Usage (%)
50
Minor
80
Major
Alarm to be issued when total CPU Usage is above the configured thresholds.
Total CPU Data not found
-
-
-
Major
Alarm to be issued when CPU data not found.
Single CPU Usage (%)
50
Minor
80
Major
Alarm to be issued when single CPU usages is above the configured thresholds.
Single CPU Data not found
-
-
-
Major
Alarm to be issued when cpuid data not found.
Disk Usage and Thresholds
Disk usage (%) free
20
Minor
10
Major
Alarm to be issued when Disk usage (%) free is below the configured thresholds.
Disk usage (Mb) free
20
Minor
10
Major
Alarm to be issued when Disk usage (Mb) free is below the configured thresholds.
Disk File System data unavailable
-
-
-
Major
Alarm to be issued when no data for file system currently available.
Memory Usage and Thresholds
Physical Memory usage
70
Minor
80
Major
Alarm to be issued when Physical Memory is above the configured thresholds.
Swap Memory usage
70
Minor
80
Major
Alarm to be issued when Swap Memory is above the configured thresholds.
Total Memory usage
50
Minor
80
Major
Alarm to be issued when Total Memory is above the configured thresholds.
Memory Data unavailable
-
-
-
Major
Alarm to be issued when no Memory type data currently available.
Memory Paging and Thresholds
Memory Paging
150
Warning
500
Major
Alarm to be issued when Memory Paging is above the configured thresholds.
Memory Paging data unavailable
-
-
-
Major
Alarm to be issued when Paging data is not currently available.
Processor Queue Length and Thresholds
Processor Queue Length
5
Warning
10
Major
Alarm to be issued when Load is above the configured thresholds.
Load Data unavailable
-
-
-
Major
Alarm to be issued when Load data is currently available.
Processes and Threshold
Process Owner
-
-
Authority/ System
Minor
Alarm to be issued when the configured threshold is satisfied.
CPU usage
-
-
90
Minor
Alarm to be issued when the CPU usage satisfied the configured threshold.
Process size
-
-
-
Minor
Alarm to be issued when Process size satisfied the configured threshold.
Thread Count
-
-
-
Minor
Alarm to be issued when Thread Count satisfied the configured threshold.
Instances
-
-
-
Minor
Alarm to be issued when instances satisfied the configured threshold.
Process when up
-
-
-
Minor
Alarm to be issued when Process is running.
Process when down
-
-
-
Minor
Alarm to be issued when Process is not running.
Events and Threshold
NTEvents Count
-
-
-
from eventlog
Alarm to be issued when event generates and configured parameter matches.
Services and Threshold
Windows Services State
-
-
-
Major
Alarm to be issued when Expected state is not same as service_state.
WMI
WMI Counter Thresholds
 
 
 
 
Alarm to be issued when value of WMI object satisfied the configured threshold.