AWS Cloud Operations Blog

How BMW Group uses automation to achieve end-to-end compliance at scale on AWS

This post is co-written with Dr. Jens Kohl, Daniel Engelhardt, and Sascha Kallin from BMW Group. The BMW Group – headquartered in Munich, Germany – is a vehicle manufacturer with 149,000 employees worldwide and manufactures in over 30 production and assembly facilities across 15 countries. Today, the BMW Group (BMW) is the world’s leading manufacturer […]

Real User Monitoring with HAQM CloudWatch RUM and HAQM Managed Grafana

Real User Monitoring with HAQM CloudWatch RUM and HAQM Managed Grafana

In today’s fast-paced digital world, users expect fast and reliable web experiences. Slow-loading pages, errors, and other performance issues can lead to lower engagement and conversion rates, ultimately hurting a business’s bottom line. That’s where Real User Monitoring (RUM) comes in. Real User Monitoring (RUM) is a crucial aspect of modern web application development, allowing developers and […]

Analyzing HAQM CloudWatch Internet Monitor measurement logs using HAQM Athena and HAQM QuickSight

Analyzing HAQM CloudWatch Internet Monitor measurement logs using HAQM Athena and HAQM QuickSight

Overview In this blog post, we describe how to use HAQM Athena with HAQM CloudWatch Internet Monitor’s measurement information stored on HAQM S3. You can use Athena to analyze event logs, and then build a dashboard to visualize the measurement logs and gain insights by using HAQM QuickSight. Internet Monitor publishes measurements to CloudWatch Logs […]

Identifying resilience drift using AWS Resilience Hub

Most people think of disaster recovery as a mechanism to protect their applications against big events. However, in the fast-paced world of development where new code and infrastructure changes are occurring several times a month, it is important to put mechanisms in place to proactively understand impacts to the resilience posture of your applications. In […]

Selecting your first workloads to migrate your organization to HAQM Web Services (AWS).

Introduction Selecting your first workloads to migrate your organization to AWS is a key stage in delivering a successful migration. In this blog post we provide guidance on how to select early migration candidates. We cover how selecting these candidates can help to kick-off a successful migration, reduce risk, and build skills inside your organization. […]

VTEX scales to 150 million metrics using HAQM Managed Service for Prometheus

VTEX scales to 150 million metrics using HAQM Managed Service for Prometheus

VTEX is a multi-tenant platform with a distributed engineering operation. Observing hundreds of services in real time in an efficient manner is a technical challenge for the business. In this blog, we will show how VTEX created a resilient open source-based architecture aligned with a sharding strategy, using HAQM Managed Service for Prometheus (AMP) to […]

Automating HAQM EC2 Instances Monitoring with Prometheus EC2 Service Discovery and AWS Distro for OpenTelemetry

Traditionally, scraping application Prometheus metrics required manual updates to a configuration file, posing challenges in dynamic AWS environments where HAQM EC2 instances are frequently created or terminated. This not only proves time consuming but also introduces the risk of configuration errors, lacking the agility necessary in dynamic environments. In this blog post, we will demonstrate […]

Monitor your AWS resources on your mobile device with AWS Console Mobile Application

AWS customers are increasingly relying on AWS User Notifications to monitor and get real-time notifications about the AWS resources that are most important to them. The AWS Console Mobile Application can be configured as a notification delivery channel, where users can monitor AWS resources, get detailed resource notifications, diagnose issues, and take remedial actions, from […]

Delivering Business Value with Cloud Platform Teams

Imagine a world where software developers build applications quickly on the cloud, focusing on innovative features, unburdened by complex infrastructure and intricate configurations. Welcome to the era of the cloud platform team. Cloud platform teams build internal tools, automation, and self-service infrastructure to free developers from commodity tasks and enable them to innovate faster. However, […]

Streamline Platform Engineering using AWS CodeStar Connections with AWS Service Catalog

Introduction AWS Service Catalog and AWS CloudFormation now support Git-sync capabilities to allow Platform Engineers to streamline their DevOps processes by keeping their Infrastructure as Code (IaC) templates in their source control libraries like GitHub and BitBucket. These enhancements help Platform Engineers to more effectively create, version, and manage their Well-Architected patterns with application teams […]