aws Metrics

Contents
uimpga-ga
aws_Metrics
The article describes the metrics that can be configured using the Amazon Web Services Monitoring (aws) probe.
Contents
Metrics for CABI for UIM Dashboards
The following metrics must be enabled for the predefined AWS CABI for UIM Dashboards. These metrics are automatically enabled if the probe is configured with the default template.
QoS Monitors:
  • QOS_AWS_AUTO_SCALING_CPU_UTILIZATION
  • QOS_AWS_AUTO_SCALING_STATUSCHECK
  • QOS_AWS_BUCKET_SIZE
  • QOS_AWS_CPU_UTILIZATION
  • QOS_AWS_DISK_READ_BYTES
  • QOS_AWS_DISK_WRITE_BYTES
  • QOS_AWS_ELASTICACHE_CPU_UTILIZATION
  • QOS_AWS_ELASTICACHE_FREEABLE_MEMORY
  • QOS_AWS_ELASTICACHE_SWAP_USAGE
  • QOS_AWS_ELB_HEALTHY_HOST_COUNT
  • QOS_AWS_ELB_LATENCY
  • QOS_AWS_ELB_REQUEST_COUNT
  • QOS_AWS_FILE_READ_TIME
  • QOS_AWS_FILE_WRITE_TIME
  • QOS_AWS_NETWORK_IN
  • QOS_AWS_NETWORK_OUT
  • QOS_AWS_NUMBER_OF_OBJECTS
  • QOS_AWS_RDS_CPU_UTILIZATION
  • QOS_AWS_RDS_FREE_STORAGE_SPACE
  • QOS_AWS_RDS_SWAP_USAGE
  • QOS_AWS_SNS_NUMBER_OF_MESSAGES_PUBLISHED
  • QOS_AWS_SNS_NUMBER_OF_NOTIFICATION_DELIVERED
  • QOS_AWS_SNS_NUMBER_OF_NOTIFICATION_FAILED
  • QOS_AWS_SNS_PUBLISHED_SIZE
  • QOS_AWS_SQS_APPROXIMATE_NUMBER_OF_MESSAGES_DELAYED
  • QOS_AWS_SQS_APPROXIMATE_NUMBER_OF_MESSAGES_NOT_VISIBLE
  • QOS_AWS_SQS_APPROXIMATE_NUMBER_OF_MESSAGES_VISIBLE
  • QOS_AWS_SQS_NUMBER_OF_EMPTY_RECEIVES
  • QOS_AWS_SQS_NUMBER_OF_MESSAGES_DELETED
  • QOS_AWS_SQS_NUMBER_OF_MESSAGES_RECEIVED
  • QOS_AWS_SQS_NUMBER_OF_MESSAGES_SENT
  • QOS_AWS_SQS_SENT_MESSAGE_SIZE
  • QOS_AWS_VOLUME_QUEUE_LENGTH
  • QOS_AWS_VOLUME_READ_OPS
  • QOS_AWS_VOLUME_THROUGHPUT_PERCENTAGE
  • QOS_AWS_VOLUME_WRITE_OPS
QoS data for the AWS Geographical Health
The name of the monitors are the same as the name of the AWS services in the region.
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_HEALTH
AWS Service Name
State
This metric is the availability status of individual AWS services in each AWS geographical region. You can set up the threshold for status using a numeric value between 0 and 1. Each number is assigned a status value, as follows:
  • 0: The system is unavailable. You can check the alarm for more information.
  • 1: The system is available
3.0
QoS data for the AWS Service Instances
QoS Name
Metric Name
Unit
Description
Version
QOS_AWS_AUTO_SCALING_TOTAL_INSTANCES
Total Auto Scale Instances
Count
This metric is the total number of Auto Scaling instances.
5.25
QOS_AWS_DYNAMODB_TOTAL_INSTANCES
Total DynamoDB Instances
Count
This metric is the total number of Dynamo DB instances.
5.25
QOS_AWS_EBS_TOTAL_INSTANCES
Total EBS Instances
Count
This metric is the total number of EBS instances.
5.25
QOS_AWS_EC2_TOTAL_ACTIVE_INSTANCES
Total Active EC2 Instances
Count
This metric is the number of EC2 instances that are currently active.
5.25
QOS_AWS_ECS_TOTAL_INSTANCES
Total ECS Instances
Count
This metric is the total number of ECS instances.
5.25
QOS_AWS_ELASTICACHE_TOTAL_INSTANCES
Total ElastiCache Instances
Count
This metric is the total number of Elasticache instances.
5.25
QOS_AWS_ELB_TOTAL_INSTANCES
Total ELB Instances
Count
This metric is the total number of ELB instances.
5.25
QOS_AWS_RDS_TOTAL_INSTANCES
Total RDS Instances
Count
This metric is the total number of RDS instances.
5.25
QOS_AWS_ROUTE53_TOTAL_INSTANCES
Total Route53 Instances
Count
This metric is the total number of Route53 instances.
5.25
QOS_AWS_S3_TOTAL_INSTANCES
Total S3 Instances
Count
This metric is the total number of S3 instances.
5.25
QOS_AWS_SNS_TOTAL_INSTANCES
Total SNS Instances
Count
This metric is the total number of SNS instances.
5.25
QOS_AWS_SQS_TOTAL_INSTANCES
Total SQS Instances
Count
This metric is the total number of SQS instances.
5.25
QOS_AWS_VPC_TOTAL_INSTANCES
Total VPC Instances
Count
This metric is the total number of VPC instances.
5.25
QoS data for the AWS Auto Scaling service
Service Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_AUTO_SCALING_CPU_UTILIZATION
CPUUtilization
Percent
This metric is the percentage of allocated EC2 compute units that are currently in use on the instance. You can use the information to identify the processing power required to execute an application on the selected instance.
3.5
QOS_AWS_AUTO_SCALING_DISK_READ_OPS
DiskReadOps
Count
This metric is the number of completed read operations from all ephemeral disks available to the instance during the specified time period. The metric can be used to determine the speed in which an application reads data from a disk.
3.5
QOS_AWS_AUTO_SCALING_DISK_WRITE_OPS
DiskWriteOps
Count
This metric is the number of completed write operations to all ephemeral disks available to the instance during the specified time period. The metric can be used to determine the speed in which an application writes data to a disk.
3.5
QOS_AWS_AUTO_SCALING_DISK_READ_BYTES
DiskReadBytes
Bytes
This metric is the number of bytes read from all ephemeral disks available to the instance during the specified time period. The metric can be used to determine the volume of data that the application reads from the disk of the instance.
3.5
QOS_AWS_AUTO_SCALING_DISK_WRITE_BYTES
DiskWriteBytes
Bytes
This metric is the number of bytes written to all ephemeral disks available to the instance during the specified time period. The metric can be used to determine the volume of data that the application writes to the disk of the instance.
3.5
QOS_AWS_AUTO_SCALING_NETWORK_IN
NetworkIn
Bytes
This metric is the number of bytes received on all network interfaces by the instance during the specified time period. The metric identifies the volume of incoming network traffic to an application on the instance.
3.5
QOS_AWS_AUTO_SCALING_NETWORK_OUT
NetworkOut
Bytes
This metric is the number of bytes sent out on all network interfaces by the instance during the specified time period. The metric identifies the volume of outgoing network traffic to an application on the instance.
3.5
QOS_AWS_AUTO_SCALING_STATUSCHECK
StatusCheckFailed
State
This metric is the combination of StatusCheckFailed_Instance and StatusCheckFailed_System that reports if either of the status checks has failed. You can set up the threshold for status using a numeric value between 0 and 1. Each number is assigned a status value, as follows:
  • 0: The status check has passed and both the instance and the system are available
  • 1: The status check has passed and either the instance or the system or both are unavailable
3.5
QOS_AWS_AUTO_SCALING_STATUSCHECK_INSTANCE
StatusCheckFailed_Instance
State
This metric reports whether the instance has passed the EC2 instance status check in the last minute. You can set up the threshold for status using a numeric value between 0 and 1. Each number is assigned a status value, as follows:
  • 0: The status check has passed and the instance is available
  • 1: The status check has failed and the instance is unavailable
3.5
QOS_AWS_AUTO_SCALING_STATUSCHECK_SYSTEM
StatusCheckFailed_System
State
This metric reports whether the instance has passed the EC2 system status check in the last minute. You can set up the threshold for status using a numeric value between 0 and 1. Each number is assigned a status value, as follows:
  • 0: The status check has passed and the system is available
  • 1: The status check has failed and the system is unavailable
3.5
Group Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_AUTO_SCALING_GROUP_MINIMUM_SIZE
Minimum Size
Count
This metric is the minimum size of the Auto Scaling group during the specified time period.
5.1
QOS_AWS_AUTO_SCALING_GROUP_MAXIMUM_SIZE
Maximum Size
Count
This metric is the maximum size of the Auto Scaling group during the specified time period.
5.1
QOS_AWS_AUTO_SCALING_GROUP_DESIRED_CAPACITY
Desired Capacity
Count
This metric is the number of instances that the Auto Scaling group maintains.
5.1
QOS_AWS_AUTO_SCALING_GROUP_IN_SERVICE_INSTANCES
In Service Instances
Count
This metric is the number of instances that are currently
executing
in the Auto Scaling group. The metric does not include instances that are pending or terminating states.
5.1
QOS_AWS_AUTO_SCALING_GROUP_PENDING_INSTANCES
Pending Instances
Count
This metric is the number of instances that are currently pending in the Auto Scaling group. Amazon defines a pending instance as
not yet in service
.
5.1
QOS_AWS_AUTO_SCALING_GROUP_STANDBY_INSTANCES
Standby Instances
Count
This metric is the number of instances that are in Standby state in the Auto Scaling group. Amazon defines an instances as standby when the instance is
executing, but is not actively in service
.
5.1
QOS_AWS_AUTO_SCALING_GROUP_TERMINATING_INSTANCES
Terminating Instances
Count
This metric is the number of instances that are currently
terminating
in the Auto Scaling group. The metric does not include instances that are in service or pending states.
5.1
QOS_AWS_AUTO_SCALING_GROUP_TOTAL_INSTANCES
Total Instances
Count
This metric is the total number of instances that are
in service
,
pending
, or
terminating
in the Auto Scaling group.
5.1
QoS data for the AWS Billing Details
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_BILLING_ESTIMATED_CHARGE
EstimatedCharges
USD
This metric is the estimated billing charges of the service since the last billing cycle, in US Dollars.
5.0
QoS data for the AWS DynamoDB Service
Index Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_DYNAMODB_INDEX_CONSUMED_READ_CAPACITY_UNITS
Consumed Read Capacity Units
Count
This metric is the number of read capacity units of index consumed over the specified time period.
5.0
QOS_AWS_DYNAMODB_INDEX_CONSUMED_WRITE_CAPACITY_UNITS
Consumed Write Capacity Units
Count
This metric is the number of write capacity units of index consumed over the specified time period.
5.0
QOS_AWS_DYNAMODB_INDEX_ONLINE_INDEX_CONSUMED_WRITE_CAPACITY
Online Index Consumed Write Capacity
Count
This metric is the number of write capacity units of index consumed when adding a new global secondary index to a table.
5.0
QOS_AWS_DYNAMODB_INDEX_ONLINE_INDEX_PERCENTAGE_PROGRESS
Online Index Percentage Progress
Count
This metric is the percentage of completion when a new global secondary index is being added to a table.
5.0
QOS_AWS_DYNAMODB_INDEX_ONLINE_INDEX_THROTTLE_EVENTS
Online Index Throttle Events
Count
This metric is the number of write throttle events that occur when adding a new global secondary index to a table.
5.0
QOS_AWS_DYNAMODB_INDEX_PROVISIONED_READ_CAPACITY_UNITS
Provisioned Read Capacity Units
Count
This metric is the number of provisioned read capacity units for a table or a global secondary index.
5.0
QOS_AWS_DYNAMODB_INDEX_PROVISIONED_WRITE_CAPACITY_UNITS
Provisioned Write Capacity Units
Count
This metric is the number of provisioned write capacity units for a table or a global secondary index
5.0
QOS_AWS_DYNAMODB_INDEX_READ_THROTTLE_EVENTS
Read Throttle Events
Count
This metric is the number of read events that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_INDEX_WRITE_THROTTLE_EVENTS
Write Throttle Events
Count
This metric is the number of write events that exceeded the preset provisioned throughput limits in the specified time period.
5.0
Table Metrics
QoS Name
Monitor Name
Units
Description
Version
QOS_AWS_DYNAMODB_CONDITIONAL_CHECK_FAILED_REQUESTS
Conditional Check Failed Requests
Count
This metric is the number of failed attempts to perform conditional writes.
5.0
QOS_AWS_DYNAMODB_CONSUMED_READ_CAPACITY_UNITS
Consumed Read Capacity Units
Count
This metric is the number of read capacity units consumed over the specified time period.
5.0
QOS_AWS_DYNAMODB_CONSUMED_WRITE_CAPACITY_UNITS
Consumed Write Capacity Units
Count
This metric is the number of write capacity units consumed over the specified time period.
5.0
QOS_AWS_DYNAMODB_ONLINE_INDEX_CONSUMED_WRITE_CAPACITY
Online Index Consumed Write Capacity
Count
This metric is the number of write capacity units consumed when adding a new global secondary index to a table.
5.0
QOS_AWS_DYNAMODB_ONLINE_INDEX_PERCENTAGE_PROGRESS
Online Index Percentage Progress
Count
This metric is the percentage of completion when a new global secondary index is being added to a table.
5.0
QOS_AWS_DYNAMODB_ONLINE_INDEX_THROTTLE_EVENTS
Online Index Throttle Events
Count
This metric is the number of write throttle events that occur when adding a new global secondary index to a table.
5.0
QOS_AWS_DYNAMODB_PROVISIONED_READ_CAPACITY_UNITS
Provisioned Read Capacity Units
Count
This metric is the number of provisioned read capacity units for a table or a global secondary index.
5.0
QOS_AWS_DYNAMODB_PROVISIONED_WRITE_CAPACITY_UNITS
Provisioned Write Capacity Units
Count
This metric is the number of provisioned write capacity units for a table or a global secondary index
5.0
QOS_AWS_DYNAMODB_READ_THROTTLE_EVENTS
Read Throttle Events
Count
This metric is the number of read events that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_PUT_ITEM_SUCCESSFUL_REQUEST_LATENCY
Put Item Successful Request Latency
ms
This metric is the elapsed time for successful Put Item requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_DELETE_ITEM_SUCCESSFUL_REQUEST_LATENCY
Delete Item Successful Request Latency
ms
This metric is the elapsed time for successful Delete Item requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_UPDATE_ITEM_SUCCESSFUL_REQUEST_LATENCY
Update Item Successful Request Latency
ms
This metric is the elapsed time for successful Update Item requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_GET_ITEM_SUCCESSFUL_REQUEST_LATENCY
Get Item Successful Request Latency
ms
This metric is the elapsed time for successful Get Item requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_BATCH_GET_ITEM_SUCCESSFUL_REQUEST_LATENCY
BatchGet Item Successful Request Latency
ms
This metric is the elapsed time for successful Batch Get Item requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_SCAN_SUCCESSFUL_REQUEST_LATENCY
Scan Successful Request Latency
ms
This metric is the elapsed time for successful Scan requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_QUERY_SUCCESSFUL_REQUEST_LATENCY
Query Successful Request Latency
ms
This metric is the elapsed time for successful Query requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_BATCH_WRITE_ITEM_SUCCESSFUL_REQUEST_LATENCY
BatchWrite Item Successful Request Latency
ms
This metric is the elapsed time for successful Batch write Item requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_GET_RECORDS_SUCCESSFUL_REQUEST_LATENCY
Get Records Successful Request Latency
ms
This metric is the elapsed time for successful Get Records requests during the specified time period.
5.0
QOS_AWS_DYNAMODB_SYSTEM_ERRORS
System Errors
Count
This metric is the number of requests generating a 500 status code (likely indicating a server error) response in the specified time period.
5.0
QOS_AWS_DYNAMODB_PUT_ITEM_THROTTLED_REQUESTS
Put Item Throttled Requests
Count
This metric is the number of user requests for PutItem operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_DELETE_ITEM_THROTTLED_REQUESTS
Delete Item Throttled Requests
Count
This metric is the number of user requests for DeleteItem operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_UPDATE_ITEM_THROTTLED_REQUESTS
Update Item Throttled Requests
Count
This metric is the number of user requests for UpdateItem operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_GET_ITEM_THROTTLED_REQUESTS
Get Item Throttled Requests
Count
This metric is the number of user requests for GetItem operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_BATCH_GET_ITEM_THROTTLED_REQUESTS
Batch Get Item Throttled Requests
Count
This metric is the number of user requests for BatchGetItem operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_SCAN_THROTTLED_REQUESTS
Scan Throttled Requests
Count
This metric is the number of user requests for Scan operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_QUERY_THROTTLED_REQUESTS
Query Throttled Requests
Count
This metric is the number of user requests for Query operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_BATCH_WRITE_ITEM_THROTTLED_REQUESTS
BatchWrite Item Throttled Requests
Count
This metric is the number of user requests for BatchWriteItem operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_GET_RECORDS_THROTTLED_REQUESTS
Get Records Throttled Requests
Count
This metric is the number of user requests for GetRecords operation that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_WRITE_THROTTLE_EVENTS
Write Throttle Events
Count
This metric is the number of write events that exceeded the preset provisioned throughput limits in the specified time period.
5.0
QOS_AWS_DYNAMODB_SCAN_RETURNED_ITEM_COUNT
Scan Returned Item Count
Count
This metric is the number of items returned by a Scan operation.
5.0
QOS_AWS_DYNAMODB_QUERY_RETURNED_ITEM_COUNT
Query Returned Item Count
Count
This metric is the number of items returned by a Query operation.
5.0
QoS data for the AWS ECS service
ECS Cluster Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_ECS_CLUSTER_CPU_UTILIZATION
CPUUtilization
Percent
This metric is the percentage of CPU that is used in the cluster.
5.0
QOS_AWS_ECS_CLUSTER_CPU_RESERVATION
CPUReservation
Percent
This metric is the percentage of CPU that is reserved by running tasks in the cluster.
5.0
QOS_AWS_ECS_CLUSTER_MEMORY_UTILIZATION
MemoryUtilization
Percent
This metric is the percentage of memory that is used in the cluster.
5.0
QOS_AWS_ECS_CLUSTER_MEMORY_RESERVATION
MemoryReservation
Percent
This metric is the percentage of memory that is reserved by running tasks in the cluster.
5.0
ECS Service Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_ECS_SERVICE_CPU_UTILIZATION
CPUUtilization
Percent
This metric is the percentage of CPU that is used in the service.
5.0
QOS_AWS_ECS_SERVICE_MEMORY_UTILIZATION
MemoryUtilization
Percent
This metric is the percentage of memory that is used in the service.
5.0
QoS data for the AWS EBS service
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_VOLUME_READ_BYTES
VolumeReadBytes
Bytes
This metric is the number of bytes read in the time period specified in the
Start Time
field.
3.0
QOS_AWS_VOLUME_WRITE_BYTES
VolumeWriteBytes
Bytes
This metric is the number of bytes written in the time period specified in the
Start Time
field. .
3.0
QOS_AWS_VOLUME_READ_OPS
VolumeReadOps
Count
This metric is the number of Read operations in the time period specified in the
Start Time
field.
3.0
QOS_AWS_VOLUME_WRITE_OPS
VolumeWriteOps
Count
This metric is the number of Write operations in the time period specified in the
Start Time
field.
3.0
QOS_AWS_VOLUME_TOTAL_READ_TIME
VolumeTotalReadTime
Seconds
This metric is the number of seconds spent by all read operations that completed in a specified period of time. If multiple requests are submitted at the same time, this total could be greater than the length of the period.
3.0
QOS_AWS_VOLUME_TOTAL_WRITE_TIME
VolumeTotalWriteTime
Seconds
This metric is the number of seconds spent by all write operations that completed in a specified period of time. If multiple requests are submitted at the same time, this total could be greater than the length of the period.
3.0
QOS_AWS_VOLUME_IDLE_TIME
VolumeIdleTime
Seconds
This metric is the number of seconds in the time period specified in the
Start Time
field when no read or write operations were submitted.
3.0
QOS_AWS_VOLUME_QUEUE_LENGTH
VolumeQueueLength
Count
This metric is the number of read and write operation requests waiting to be completed in the time period specified in the
Start Time
field.
3.0
QOS_AWS_VOLUME_THROUGHPUT_PERCENTAGE
VolumeThroughputPercentage
Percentage
This metric is the percentage of I/O operations per second (IOPS) delivered of the total IOPS provisioned for an Amazon EBS volume. Used with Provisioned IOPS (SSD) volumes only. Provisioned IOPS (SSD) volumes deliver within 10 percent of the provisioned IOPS performance 99.9 percent of the time over a given year.
3.0
QOS_AWS_VOLUME_CONSUMED_READ_WRITE_OPS
VolumeConsumedReadWriteOps
Count
This metric is the total amount of read and write operations (normalized to 16K capacity units) consumed in the time period specified in the
Start Time
field. Used with Provisioned IOPS (SSD) volumes only.
3.0
QoS data for the AWS EC2 service
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_CPU_CREDIT_USAGE
CPUCreditUsage
Count
This metric is the number of CPU credits consumed by the instance. You can use the information to evaluate the performance of your EC2 instance.
5.34
QOS_AWS_CPU_CREDIT_BALANCE
CPUCreditBalance
Count
This metric is the number of CPU credits available for the instance to burst beyond its base CPU utilization. You can use the information to evaluate whether the performance of your EC2 instance is being affected by the available CPU credits.
5.34
QOS_AWS_CPU_UTILIZATION
CPUUtilization
Percent
This metric is the percentage of allocated EC2 compute units that are currently in use on the instance. You can use the information to identify the processing power required to execute an application on the selected instance.
2.0
QOS_AWS_DISK_WRITE_BYTES
DiskWriteBytes
Bytes
This metric is the number of bytes written to all ephemeral disks available to the instance. You can use the information to determine the volume of data that the application writes to the hard disk of the instance.
2.0
QOS_AWS_DISK_READ_BYTES
DiskReadBytes
Bytes
This metric is the number of bytes read from all ephemeral disks available to the instance. You can use the information to determine the volume of data that the application reads from the hard disk of the instance.
2.0
QOS_AWS_DISK_READ_OPS
DiskReadOps
Count
This metric is the number of completed read operations from all ephemeral disks available to the instance. You can use the information to identify the rate at which an application reads from a disk.
2.0
QOS_AWS_DISK_WRITE_OPS
DiskWriteOps
Count
This metric is the number of completed write operations to all ephemeral disks available to the instance. You can use the information to identify the rate at which an application writes to a disk.
2.0
QOS_AWS_NETWORK_IN
NetworkIn
Bytes
This metric is the number of bytes received on all network interfaces by the instance. You can use the information to identify the volume of incoming network traffic to an application on the instance.
2.0
QOS_AWS_NETWORK_OUT
NetworkOut
Bytes
This metric is the number of bytes sent on all network interfaces by the instance. You can use the information to identify the volume of outgoing network traffic from an application on the instance.
2.0
QOS_AWS_INSTANCE_STATE
Instance State
Count
This metric is the operational status of the EC2 instance. You can set up the threshold for status using a numeric value between 0 and 2. Each number is assigned a status value, as follows:
  • 0: Instance is executing
  • 1: User has stopped the instance
  • 2: Instance has crashed
    Note:
    The probe calculates the crashed state using the instance state reason metric on AWS Cloudwatch.
4.1
QOS_AWS_INSTANCE_POWER_STATE
PowerState
State
This metric is the current power state of the EC2 instance. You can set up the threshold for status using a numeric value between 0 and 1. Each number is assigned a status value, as follows:
  • 0: Powered Off
  • 1: Powered On
5.0
QoS data for the AWS ELB service
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_ELB_HEALTHY_HOST_COUNT
HealthyHostCount
Count
This metric is the number of healthy instances in each Availability Zone. Hosts are declared healthy if they meet the threshold for the number of consecutive health checks that are successful. Hosts that have failed more health checks than the value of the unhealthy threshold are considered unhealthy.
3.5
QOS_AWS_ELB_UNHEALTHY_HOST_COUNT
UnHealthyHostCount
Count
This metric is the number of unhealthy instances in each Availability Zone. Hosts that have failed more health checks than the value of the unhealthy threshold are considered unhealthy. Instances may become unhealthy due to connectivity issues
3.5
QOS_AWS_ELB_REQUEST_COUNT
RequestCount
Count
This metric is the number of completed requests that were received and routed to the back-end instances.
3.5
QOS_AWS_ELB_LATENCY
Latency
Seconds
This metric is the time elapsed after the request leaves the load balancer until the response is received.
3.5
QOS_AWS_ELB_HTTP_CODE_ELB_4XX
HTTPCode_ELB_4XX
Count
This metric is the number of HTTP 4XX client error codes generated by the load balancer when the listener is configured to use HTTP or HTTPS protocols. Client errors are generated when a request is malformed or is incomplete.
3.5
QOS_AWS_ELB_HTTP_CODE_ELB_5XX
HTTPCode_ELB_5XX
Count
This metric is the number of HTTP 5XX server error codes generated by the load balancer when the listener is configured to use HTTP or HTTPS protocols. The metric is reported if there are no back-end instances that are healthy or registered to the load balancer
3.5
QOS_AWS_ELB_HTTP_CODE_BACKEND_2XX
HTTPCode_Backend_2XX
Count
This metric is the number of HTTP response codes generated by back-end instances. The 2XX class status codes represent successful actions. This metric does not include any response codes generated by the load balancer.
3.5
QOS_AWS_ELB_HTTP_CODE_BACKEND_3XX
HTTPCode_Backend_3XX
Count
This metric is the number of HTTP response codes generated by back-end instances. The 3XX class status code indicates that the user agent requires action. This metric does not include any response codes generated by the load balancer.
3.5
QOS_AWS_ELB_HTTP_CODE_BACKEND_4XX
HTTPCode_Backend_4XX
Count
This metric is the number of HTTP response codes generated by back-end instances. The 4XX class status code represents client errors. This metric does not include any response codes generated by the load balancer.
3.5
QOS_AWS_ELB_HTTP_CODE_BACKEND_5XX
HTTPCode_Backend_5XX
Count
This metric is the number of HTTP response codes generated by back-end instances. The 5XX class status code represents back-end server errors. This metric does not include any response codes generated by the load balancer.
3.5
QOS_AWS_ELB_BACKEND_CONNECTION_ERRORS
BackendConnectionErrors
Count
This metric is the number of connections that were not successfully established between the load balancer and the registered instances. Because the load balancer will retry when there are connection errors.
3.5
QOS_AWS_ELB_SURGE_QUEUE_LENGTH
SurgeQueueLength
Count
This metric is the number of requests that are pending submission to a registered instance.
3.5
QOS_AWS_ELB_SPILL_OVER_COUNT
SpilloverCount
Count
This metric is the number of requests that were rejected due to the queue being full.
3.5
QoS data is for the AWS ElastiCache service
Host Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_ELASTICACHE_CPU_UTILIZATION
CPUUtilization
Percent
This metric is the percentage of CPU utilization.
3.0
QOS_AWS_ELASTICACHE_FREEABLE_MEMORY
FreeableMemory
Bytes
This metric is the amount of free memory available on the host.
3.0
QOS_AWS_ELASTICACHE_NETWORK_IN
NetworkBytesIn
Bytes
This metric is the number of bytes the host has read from the network.
5.0
QOS_AWS_ELASTICACHE_NETWORK_OUT
NetworkBytesOut
Bytes
This metric is the number of bytes the host has written to the network.
5.0
QOS_AWS_ELASTICACHE_SWAP_USAGE
SwapUsage
Bytes
This metric is the amount of swap used on the host.
5.0
Memcached Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_ELASTICACHE_MEMCACHED_CURRENT_ITEMS
CurrItems
Count
This metric is the number of items currently stored in the cache.
3.0
QOS_AWS_ELASTICACHE_MEMCACHED_EVICTIONS
Evictions
Count
This metric is the number of non-expired items the cache evicted to allow space for new writes.
3.0
QOS_AWS_ELASTICACHE_MEMCACHED_RECLAIMED
Reclaimed
Count
This metric is the number of expired items the cache evicted to allow space for new writes.
3.0
QOS_AWS_ELASTICACHE_MEMCACHED_GET_HITS
GetHits
Count
This metric is the number of get requests the cache has received where the key requested was found.
3.0
QOS_AWS_ELASTICACHE_MEMCACHED_GET_MISSES
GetMisses
Count
This metric is the number of get requests the cache has received where the key requested was not found.
3.0
QOS_AWS_ELASTICACHE_MEMCACHED_BYTES_USED_FOR_CACHE_ITEMS
BytesUsedForCacheItems
Bytes
This metric is the number of bytes used to store cache items.
3.0
QOS_AWS_ELASTICACHE_MEMCACHED_CURRENT_CONNECTIONS
CurrConnections
Count
This metric is the number of current connections connected to the cache.
5.0
QOS_AWS_ELASTICACHE_MEMCACHED_UNUSED_MEMORY
UnusedMemory
Bytes
This metric is the amount of unused memory the cache can use to store items. This is derived from the memcached statistics limit_maxbytes and bytes by subtracting bytes from limit_maxbytes.
3.0
Redis Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_ELASTICACHE_REDIS_CURRENT_CONNECTIONS
CurrConnections
Count
This metric is the number of client connections, excluding connections from read replicas.
3.0
QOS_AWS_ELASTICACHE_REDIS_BYTES_USED_FOR_CACHE
BytesUsedForCache
Bytes
This metric is the number of bytes allocated by Redis.
3.0
QOS_AWS_ELASTICACHE_REDIS_GET_HITS
CacheHits
Count
This metric is the number of successful key lookups.
5.0
QOS_AWS_ELASTICACHE_REDIS_GET_MISSES
CacheMisses
Count
This metric is the number of unsuccessful key lookups.
5.0
QOS_AWS_ELASTICACHE_REDIS_EVICTIONS
Evictions
Count
This metric is the number of keys that have been evicted due to the maxmemory limit.
5.0
QOS_AWS_ELASTICACHE_REDIS_HYPERLOGLOG_BASEDCMDS
HyperLogLogBasedCmds
Count
This metric is the total number of HyperLogLog based commands.
5.0
QOS_AWS_ELASTICACHE_REDIS_NEWCONNECTIONS
NewConnections
Count
This metric is the total number of connections that have been accepted by the server during this period.
5.0
QOS_AWS_ELASTICACHE_REDIS_RECLAIMED
Reclaimed
Count
This metric is the total number of key expiration events.
5.0
QOS_AWS_ELASTICACHE_REDIS_REPLICATIONBYTES
ReplicationBytes
Bytes
This metric is the number of bytes that the primary is sending to all of its replicas.
5.0
QOS_AWS_ELASTICACHE_REDIS_REPLICATIONLAG
ReplicationLag
Seconds
This metric is the time lag, in seconds, between applying changes to the replica and the primary cache cluster.
5.0
QOS_AWS_ELASTICACHE_REDIS_CURRENT_ITEMS
CurrItems
Count
This metric is the number of items in the cache.
5.0
QoS data for the AWS Lambda service
QoS Name
Metric Name
Unit
Description
Version
QOS_AWS_LAMBDA_DEAD_LETTER_ERRORS
Dead Letter Errors
Count
This metric is the number of times when AWS Lambda is unable to write the failed event payload to the configured Dead Letter Queues.
5.25
QOS_AWS_LAMBDA_ERRORS
Errors
Count
This metric is the number of invocations that failed due to errors in the function (response code 4XX).
5.25
QOS_AWS_LAMBDA_EXECUTION_DURATION
Execution Duration
ms
This metric is the elapsed duration for the function code from initial execution as a result of an invocation to the end.
5.25
QOS_AWS_LAMBDA_INVOCATIONS
Invocations
Count
This metric is the number of times a function is invoked in response to an event or invocation API call.
5.25
QOS_AWS_LAMBDA_ITERATOR_AGE
Iterator Age
ms
This metric is the age of the last record for each batch of records processed by AWS Lambda.
5.25
QOS_AWS_LAMBDA_THROTTLES
Throttles
Count
This metric is the number of invocation attempts of a function that are throttled as the invocation rates exceed the configured limit of concurrent requests (error code 429).
5.25
QoS data for the AWS RDS service
Host Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_DATABASE_CONNECTIONS
DatabaseConnections
Count
This metric is the number of database connections in use.
3.0
QOS_AWS_REPLICA_LAG
ReplicaLag
Seconds
This metric is the time for which a
Read Replica DB
instance lags behind the
Source DB
instance. Replica lag occurs due to slow execution of data manipulation queries and is seen in MySQL database only.
3.0
CPU Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_RDS_CPU_UTILIZATION
CPUUtilization
Percent
This metric is the percentage of CPU utilization.
3.0
QOS_AWS_CPU_CREDIT_USAGE
CPUCreditUsage
Count
This metric is the number of CPU credits consumed by the instance. You can use the information to evaluate the performance of your RDS T2 instance.
5.34
QOS_AWS_CPU_CREDIT_BALANCE
CPUCreditBalance
Count
This metric is the number of CPU credits available for the instance to burst beyond its base CPU utilization. You can use the information to evaluate whether the performance of your RDS T2 instance is being affected by the available CPU credits.
5.34
Disk Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_RDS_BIN_LOG_DISK_USAGE
BinLogDiskUsage
Bytes
This metric is the amount of disk space occupied by binary logs on the master. Applies to MySQL read replicas.
3.0
QOS_AWS_RDS_DISK_QUEUE_DEPTH
DiskQueueDepth
Count
This metric is the number of outstanding IOs (read/write requests) waiting to access the disk.
3.0
QOS_AWS_RDS_FREE_STORAGE_SPACE
FreeStorageSpace
Bytes
This metric is the amount of available storage space.
3.0
QOS_AWS_RDS_READ_IOPS
ReadIOPS
Count/Second
This metric is the average number of disk read I/O operations per second.
3.0
QOS_AWS_RDS_WRITE_IOPS
WriteIOPS
Count/Second
This metric is the average number of disk write I/O operations per second.
3.0
QOS_AWS_RDS_READ_LATENCY
ReadLatency
Seconds
This metric is the average amount of time taken per disk read I/O operation.
3.0
QOS_AWS_RDS_WRITE_LATENCY
WriteLatency
Seconds
This metric is the average amount of time taken per disk write I/O operation.
3.0
QOS_AWS_RDS_READ_THROUGHPUT
ReadThroughput
Bytes/Second
This metric is the average number of bytes read from disk per second.
3.0
QOS_AWS_RDS_WRITE_THROUGHPUT
WriteThroughput
Bytes/Second
This metric is the average number of bytes written to disk per second.
3.0
Memory Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_RDS_SWAP_USAGE
SwapUsage
Bytes
This metric is the amount of swap space used on the DB Instance.
3.0
QOS_AWS_RDS_FREEABLE_MEMORY
FreeableMemory
Bytes
This metric is the amount of available random access memory.
3.0
Network Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_RDS_NETWORK_RECEIVE_THROUGHPUT
NetworkReceiveThroughput
Bytes
This metric is the incoming (Receive) network traffic on the DB instance,  including both customer database traffic and Amazon RDS traffic used for monitoring and replication.
3.0
QOS_AWS_RDS_NETWORK_TRANSMIT_THROUGHPUT
NetworkTransmitThroughput
Bytes
This metric is the outgoing (Transmit) network traffic on the DB instance, including both customer database traffic and Amazon RDS traffic used for monitoring and replication.
3.0
Database Status Metrics
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_DB_INSTANCE_STATUS
DBInstanceStatus
State
This metric is the status of a DB instance and indicates the health of the instance. You can use this information to monitor the current state of your RDS DB instance. You can set up the threshold for status using a numeric value between 0 and 23. Each number is assigned a status value, as follows:
  • 0: Available
  • 1: Backing-up
  • 2: Configuring-enhanced-monitoring
  • 3: Creating
  • 4: Deleting
  • 5: Failed
  • 6: Inaccessible-encryption-credentials
  • 7: Incompatible-credentials
  • 8: Incompatible-network
  • 9: Incompatible-option-group
  • 10: Incompatible-parameters
  • 11: Incompatible-restore
  • 12: Maintenance
  • 13: Modifying
  • 14: Rebooting
  • 15: Renaming
  • 16: Resetting-master-credentials
  • 17: Restore-error
  • 18: Starting
  • 19: Stopping
  • 20: Stopped
  • 21: Storage-full
  • 22: Storage-optimization
  • 23: Upgrading
5.34
QoS data for the AWS Route53 service
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_ROUTE53_CONNECTION_TIME
ConnectionTime
ms
This metric is the time that it takes Amazon Route 53 health checkers to establish a TCP connection with the endpoint.
5.0
QOS_AWS_ROUTE53_HEALTH_CHECK_PERCENTAGE
HealthCheckPercentageHealthy
Percent
This metric is the percentage of Amazon Route 53 health checkers that consider the selected endpoint to be healthy.
5.0
QOS_AWS_ROUTE53_HEALTH_CHECK_STATUS
HealthCheckStatus
State
This metric is the health status of the service. You can set up the threshold for status using a numeric value between 0 and 1. Each number is assigned a status value, as follows:
  • 0: Unhealthy
  • 1: Healthy
5.0
QOS_AWS_ROUTE53_SSL_HANDSHAKE_TIME
SSLHandshakeTime
ms
This metric is the time that it takes Amazon Route 53 health checkers to complete the SSL handshake.
5.0
QOS_AWS_ROUTE53_TIME_TO_FIRST_BYTE
TimeToFirstByte
ms
This metric is the time that it takes Amazon Route 53 health checkers to receive the first byte of the response to an HTTP or HTTPS request.
5.0
QoS data for the AWS SNS service
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_SNS_NUMBER_OF_MESSAGES_PUBLISHED
NumberOfMessagesPublished
Count
This metric is the number of messages published.
3.5
QOS_AWS_SNS_PUBLISHED_SIZE
PublishSize
Bytes
This metric is the size of messages published.
3.5
QOS_AWS_SNS_NUMBER_OF_NOTIFICATION_DELIVERED
NumberOfNotificationsDelivered
Count
This metric is the number of messages successfully delivered.
3.5
QOS_AWS_SNS_NUMBER_OF_NOTIFICATION_FAILED
NumberOfNotificationsFailed
Count
This metric is the number of messages that SNS failed to deliver.
3.5
QoS data for the AWS SQS service
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_SQS_NUMBER_OF_MESSAGES_SENT
NumberOfMessagesSent
Count
This metric is the number of messages added to a queue.
3.5
QOS_AWS_SQS_SENT_MESSAGE_SIZE
SentMessageSize
Bytes
This metric is the size of messages added to a queue.
3.5
QOS_AWS_SQS_NUMBER_OF_MESSAGES_RECEIVED
NumberOfMessagesReceived
Count
This metric is the number of messages returned by calls to the ReceiveMessage API action.
3.5
QOS_AWS_SQS_NUMBER_OF_EMPTY_RECEIVES
NumberOfEmptyReceives
Count
This metric is the number of ReceiveMessage API calls that did not return a message.
3.5
QOS_AWS_SQS_NUMBER_OF_MESSAGES_DELETED
NumberOfMessagesDeleted
Count
This metric is the number of messages deleted from the queue.
3.5
QOS_AWS_SQS_APPROXIMATE_NUMBER_OF_MESSAGES_DELAYED
ApproximateNumberOfMessagesDelayed
Count
This metric is the number of messages in the queue that are delayed and not available for reading immediately. This can happen when the queue is configured as a delay queue or when a message has been sent with a delay parameter.
3.5
QOS_AWS_SQS_APPROXIMATE_NUMBER_OF_MESSAGES_VISIBLE
ApproximateNumberOfMessagesVisible
Count
This metric is the number of messages available for retrieval from the queue.
3.5
QOS_AWS_SQS_APPROXIMATE_NUMBER_OF_MESSAGES_NOT_VISIBLE
ApproximateNumberOfMessagesNotVisible
Count
This metric is the number of messages that are in flight. Messages are considered in flight if they have been sent to a client but have not yet been deleted or have not yet reached the end of their visibility window.
3.5
QoS data for the AWS S3 service
QoS Name
Metric Name
Units
Description
Version
QOS_AWS_FILE_WRITE_TIME
FileWriteTime
Seconds
This metric is the time taken to write the file to the bucket.
2.0
QOS_AWS_FILE_READ_TIME
FileReadTime
Seconds
This metric is the time taken to read the file from the bucket.
2.0
QOS_AWS_BUCKET_SIZE
Bucket Size
MB
This metric is the amount of data that is stored in a bucket.
5.0
QOS_AWS_NUMBER_OF_OBJECTS
Number Of Objects
Count
This metric is the total number of objects that are stored in a bucket.
5.0