AWS Cloud Operations Blog

Tag: HAQM CloudWatch

Automate creation of HAQM CloudWatch alarms and dashboards with AWS Systems Manager and Ansible

Automate creation of HAQM CloudWatch alarms and dashboards with AWS Systems Manager and Ansible

Monitoring HAQM EC2 instances is critical to proactively identify any underlying issues or to troubleshoot the performance of the instances. HAQM CloudWatch provides a reliable, scalable, and flexible monitoring solution. Customers running EC2 instances in a self-managed environment typically use HAQM CloudWatch metrics to monitor the performance of their instances and set up alarms for […]

How to set up HAQM CloudWatch alarms to monitor IO metrics of AWS EBS volumes performance using metric math

How to set up HAQM CloudWatch alarms to monitor IO metrics of HAQM EBS volumes performance using metric math

To prevent application or database performance hiccups from latency of a disk, it is very critical to monitor disk I/O and usage for performance issues. Disk I/O is the amount of read or write or input/output operations that occur during a period, in other words it measures the data transfer speed between a disk and […]

Monitoring best practices for AWS Outposts

Monitoring best practices for AWS Outposts

AWS Outposts allows you to run AWS infrastructure and services on-premises for a consistent fully managed hybrid experience. Outposts supports workloads and devices requiring low latency access to on-premises systems, local data processing, data residency, and application migration with local system inter dependencies. Outposts offers customers HAQM CloudWatch metrics and AWS Health events to effectively […]

How Hapag-Lloyd established observability for serverless multi-account workloads

This post is co-authored by Grzegorz Kaczor from Hapag-Lloyd AG and Michael Graumann and Daniel Moser from AWS. Introduction Establishing observability over the state, performance, health, and security posture of applications is key to successfully operating multi-account workloads in the cloud. As the number and size of workloads increases, finding and correlating all available information […]

Delete Empty CloudWatch Log Streams

Customers that use HAQM CloudWatch to monitor their applications and resources on AWS can accumulate a large number of log streams that are used only briefly, or are no longer required. While there is no charge for maintaining an empty log stream, having potentially thousands of them can be overwhelming, especially while using the CloudWatch […]

Build Cloud Operations skills using the new AWS Observability Training

Full-stack observability at AWS includes AWS-native, Application Performance Monitoring (APM), and open-source solutions, giving you the ability to understand what is happening across your technology stack at any time. AWS Observability lets you collect, correlate, aggregate, and analyze telemetry in your network, infrastructure, and applications in the cloud, hybrid, or on-premises environments so you can gain […]

Moving to event-driven architectures with serverless event aggregators

Event-driven architectures are gaining attention because they help organizations achieve benefits through the decoupling of services, increasing scalability, adding flexibility, and increasing developers’ agility. As companies grow over time, they need to find ways to integrate (couple) multiple processes and applications (which are usually owned by different teams.) Coupled applications work in certain scenarios where […]

Using HAQM CloudWatch metrics to monitor time to expiration for Reserved Instances | HAQM Web Services

This post shows you how to monitor the days remaining for HAQM EC2 Reserved Instances. The solution uses a custom HAQM CloudWatch metric published via an AWS Lambda function. It creates a CloudWatch alarm and an HAQM Simple Notification Service (HAQM SNS) topic for notification when the alarm exceeds the user-defined threshold. CloudWatch allows you […]

Monitoring HAQM RDS and HAQM Aurora using HAQM Managed Grafana

Organizations running critical applications on AWS using fully managed database services such as HAQM Relational Database Service (HAQM RDS) and HAQM Aurora rely on robust monitoring to ensure that their databases are performant, and cause no service disruptions to their customers. HAQM Managed Grafana is a fully managed and secure data visualization service that you […]

Monitoring the status of Windows services with HAQM CloudWatch

When you have an application that relies on a specific Windows service being up and running, knowing the status of this service can be a useful part of your observability solution. This service status data can be displayed on dashboards, used to create alarms, or used to trigger automated resolutions. This post presents a solution […]