AWS Cloud Operations Blog

Category: Monitoring and observability

Update your HAQM CloudWatch dashboards automatically using HAQM EventBridge and AWS Lambda

HAQM CloudWatch lets customers collect monitoring and operational data in the form of logs, metrics, and alarms. This allows for easy visualization and notifications regarding their workload health. HAQM CloudWatch dashboards are customizable home pages in the CloudWatch console that you can use to monitor your resources in a single view, even those resources that […]

How and when to enable session cookies with HAQM CloudWatch RUM

HAQM CloudWatch RUM is a real user monitoring service that closes the gap between the end-user experience in a web application, and the serving of that content from your AWS or on-premises environment. By measuring client-side application performance, such as page load time and JavaScript errors, you have access to new and powerful tools for […]

Monitoring AWS Lambda errors using HAQM CloudWatch

When we troubleshoot failed invocations from our Lambda functions, we often must identify the invocations that failed (from among all of the invocations), identify the root cause, and reduce mean time to resolution (MTTR). In this post, we will demonstrate how to utilize HAQM CloudWatch to identify failed AWS Lambda invocations. Likewise, we will show how […]

Visualize HAQM EC2 based VPN metrics with HAQM CloudWatch Logs

Organizations have many options for connecting to on-premises networks or third parties, including AWS Site-to-Site VPN. However, some organizations still need to use an HAQM Elastic Compute Cloud (HAQM EC2) instance running VPN software, such as strongSwan. Gaining insight into HAQM EC2-based VPN metrics can be challenging when compared to AWS native VPN services that […]

Create metrics and alarms for specific web pages with HAQM CloudWatch RUM

HAQM CloudWatch RUM makes it easy for AWS customers to access real-world performance metrics from web applications, thereby giving insights into the end-user experience. These user experiences are quantified into discrete metrics that you can then create alarms for. But what if you must have different load time alarms for certain pages? Or you’re testing […]

Identify operational issues quickly by using Grafana and HAQM CloudWatch Metrics Insights (Preview)

HAQM CloudWatch has recently launched Metrics Insights (Preview) – a fast, flexible, SQL-based query engine that enables you to identify trends and patterns across millions of operational metrics in real-time. With Metrics Insights, you can easily query and analyze your metrics to gain better visibility into the health and performance of your infrastructure and large scale […]

Monitoring Service Level Objectives (“SLOs”) Made Easier with Nobl9 and HAQM CloudWatch Metrics Insights

The updated version (June 2022) that follows is based on working backward from a customer need to understand Service Level Objectives (“SLOs”) and the benefits from monitoring SLOs. This post was originally written in Nov 2021 by Natalia Sikora-Zimna, Product Owner at Nobl9. A service can be provided by infrastructure, a platform, software, or people. […]

How to validate authentication using HAQM CloudWatch Synthetics – Part 2

In the second post of this two-part series, I will demonstrate how to utilize the HAQM CloudWatch Synthetics canary that uses the multiple HTTP endpoints blueprint in order to monitor an application requiring an authentication certificate. The first post Multi-step API monitoring using HAQM CloudWatch Synthetics provided steps to create an HAQM CloudWatch Synthetics script for executing a […]

Share your HAQM CloudWatch Dashboards with anyone using AWS Single Sign-On

HAQM CloudWatch enables customers to collect monitoring and operational data in the form of logs, metrics, alarms, and events, thereby allowing easy workload visualization and notifications. Traditionally, operational health data access was only viewable for technical support staff, thereby making operational health opaque to a wider business audience. However, actionable and valuable business insights can […]

Monitor Private VPC Endpoint Health in Hybrid DNS Environments Using CloudWatch Synthetics

We start by paying homage to the HAQM CloudWatch Synthetics canary naming convention, which nods to the original use of canaries to detect carbon monoxide in coal mines. The bird’s small size, high metabolism, and intensified breathing led to their early demise when exposed to the poisonous gas, thereby allowing miners to take corrective action […]