AWS Cloud Operations Blog
Tag: HAQM CloudWatch
Introducing HAQM CloudWatch Alarm Recommendations
HAQM CloudWatch is a foundational AWS service that provides you with actionable insights into your cloud resources and applications. With HAQM CloudWatch Metrics, you can gain better visibility into your infrastructure and large-scale application performance. You can set up alarms using HAQM CloudWatch Alarms for metrics emitted by AWS services or your applications. Identifying which metrics […]
How to centralize CloudWatch Alarms with HAQM EventBridge and AWS CloudFormation
HAQM CloudWatch lets customers collect monitoring and operational data in the form of logs, metrics, and events, providing an easy way to monitor and receive notifications regarding their workload health and often integrate directly with other systems, such as JIRA Service Desk and ServiceNow. The CloudWatch alarms feature lets you monitor CloudWatch metrics and receive […]
Automate the creation of AWS Support cases using HAQM CloudWatch alarms and HAQM Bedrock
For production applications, the Mean-Time-To-Recovery (MTTR) is critical. In line with this, AWS offers Business, Enterprise On-Ramp and Enterprise support plans where AWS customers can benefit from shorter response time for cases related to production and business critical workloads. However, without having an automated way to notify AWS support, creating a case is a manual […]
Observe your Azure and AWS workloads simultaneously with HAQM CloudWatch
Overview Effective operation of cloud applications and services demands a strong focus on monitoring and observability. It’s critical for your teams to define, capture, and analyze metrics, ensuring operational visibility and extracting actionable insights from logs. In many companies, technical teams share integrated systems to monitor the services or infrastructure they manage. Shared observability systems […]
Leverage generative AI to create custom dashboard widgets in HAQM CloudWatch using HAQM CodeWhisperer
Observability describes how well you can understand what is happening in a system, often by instrumenting it to collect metrics, logs, and traces. To achieve operational excellence and meet business objectives, you need to understand how your systems are performing. In order to accomplish this, many customers use HAQM CloudWatch to get real-time monitoring, alerts […]
Announcing AWS CloudTrail Lake one-year extendable retention pricing option
In 2022 HAQM Web Services (AWS) released AWS CloudTrail Lake, a managed audit and security lake that allows you to aggregate, immutably store, visualize, and query your activity logs for auditing, security investigation, and operational troubleshooting. Working backwards from our customers we have added capabilities to CloudTrail Lake such as the ability to copy CloudTrail events into […]
Monitoring GPU workloads on HAQM EKS using AWS managed open-source services
As machine learning (ML) workloads continue to grow in popularity, many customers are looking to run them on Kubernetes with graphics processing unit (GPU) support. HAQM Elastic Compute Cloud (HAQM EC2) instances powered by NVIDIA GPUs deliver the scalable performance needed for fast ML training and cost-effective ML inference. Monitoring GPU utilization gives valuable information for researchers working […]
Observe dynamic sites with HAQM CloudWatch Synthetics and AWS Systems Manager Parameter Store
Overview Maintaining and improving end user experience is key and as your business grows, the number of endpoints you need to observe can grow quickly. It can become more challenging and time consuming to build multiple canaries to observe them. This solution is designed to show how you can use a consistent and automated approach […]
Observability using native HAQM CloudWatch and AWS X-Ray for serverless modern applications
Introduction In this blog post, we will share how you can use AWS-native observability tools to measure the current state of your modern serverless applications and how to get started with the minimal effort. We will review tools like HAQM CloudWatch and AWS X-Ray and explore how these services can help you instrument your application […]
Building a central HAQM CloudWatch Dashboard to monitor Lambda@Edge logs and metrics
Introduction Lambda@Edge is a powerful feature of HAQM CloudFront that allows you to execute serverless code closer to your application users, resulting in improved performance and reduced latency. By distributing Lambda@Edge functions to edge locations worldwide, AWS ensures that the code executes closer to end users, providing faster response times. Moreover, the serverless nature of […]