AWS Cloud Operations Blog
Category: Monitoring and observability
Setting up an HAQM CloudWatch Billing Alarm to Proactively Monitor Estimated Charges
I’m pleased to announce the start of a multi-part series for CloudWatch Billing in which I will explore the techniques for proactively managing your AWS costs. This series kicks off with a walkthrough of setting up CloudWatch Billing Alarms from the AWS console. This walkthrough demonstrates how to enable Billing Alerts, create an HAQM CloudWatch […]
Visual monitoring of applications with HAQM CloudWatch Synthetics
Monitoring application endpoints is a reliable way to measure availability. This best practice can be extended by adding synthetic monitoring to your observability strategy. Synthetic monitoring lets you continually verify your customers’ experience by following the same routes and actions as your customers. You can create scripts or canaries that monitor things such as availability, […]
HAQM Managed Service for Prometheus is now Generally Available
At re:Invent 2020, we launched HAQM Managed Service for Prometheus, a fully managed Prometheus-compatible service in Preview on AWS. It is a secure and scalable service customers can utilize to collect infrastructure and application metrics from workloads hosted on various environments, such as HAQM Elastic Kubernetes Service (HAQM EKS), HAQM Elastic Container Service (HAQM ECS), […]
Using AWS X-Ray and AWS Application Cost Profiler to track tenant cost of shared AWS Infrastructure
In our last blog post, we introduced AWS Application Cost Profiler (ACP), where we discussed this new service that allows customers, running multi-tenant applications, to receive granular cost breakdowns of shared AWS resources across their tenants. AWS Application Cost Profiler provides customers, especially SaaS ISVs, with a standard mechanism to correlate and report their infrastructure […]
Improve your application availability with AWS observability solutions
Distributed systems are complex due to their high number of interconnected components and susceptibility to failures caused by constant updates. Legacy monolithic applications can be distributed across instances and geographic locations or microservices. These rely on thousands of resources to operate and can be updated frequently, scaled elastically, or invoked on demand. In turn, these […]
Implementing a cross-account and cross-Region AWS Config status dashboard
AWS Config helps central IT administrators monitor the compliance of multiple AWS accounts and multiple regions in large enterprises. AWS Config utilizes a configuration recorder to detect changes in your resource configurations and capture these as configuration items. A separate configuration recorder exists for every region in each AWS account. However, AWS Config recorders can […]
Simplify your canary by batching multiple URLs in HAQM CloudWatch Synthetics
Learn with Shree on how to simplify your canary by batching multiple URLs in HAQM CloudWatch Synthetics.
Collect, aggregate, and analyze Rancher Kubernetes Cluster logs with HAQM CloudWatch
Rancher is a popular open-source container management tool utilized by many organizations that provides an intuitive user interface for managing and deploying the Kubernetes clusters on HAQM Elastic Kubernetes Service (HAQM EKS) or HAQM Elastic Compute Cloud (HAQM EC2). When Rancher deploys Kubernetes onto nodes in HAQM EC2, it uses Rancher Kubernetes Engine (RKE), which is Rancher’s […]
SNMP monitoring using HAQM CloudWatch and Elastic Logstash
Customers want a single pane of glass for their systems operations where they can visualize the health and performance of applications running in several AWS Regions and in their on-premises environment. Simple Network Management Protocol (SNMP) is an internet standard protocol for collecting and organizing information about managed devices on IP networks and for modifying […]
HAQM Managed Grafana is now Generally Available
At re:Invent 2020, we introduced HAQM Managed Grafana and made it available in preview. Since then, we’ve been working on numerous enhancements that were made available during preview. Now we’re excited to launch HAQM Managed Grafana in General Availability (GA), and with this post we’ll lay out exactly what this means. Figure 1: List of […]