AWS Cloud Operations Blog

Category: HAQM EC2

Use AWS Systems Manager Explorer to optimize your compute resources across your AWS Organizations

As a solutions architect with AWS, I work with customers to right-size their HAQM Elastic Compute Cloud (EC2) instances to achieve a balance between performance and cost. Optimization is an iterative task that involves several cycles of making changes, analyzing results, and repeating until you reach a satisfactory state. You need to understand the details […]

Automating HAQM CloudWatch Alarms with AWS Systems Manager

HAQM CloudWatch is a monitoring and observability service built for DevOps engineers, developers, Site Reliability Engineers (SRE), and IT managers. CloudWatch provides you with data and actionable insights to monitor your applications, respond to system-wide performance changes, optimize resource utilization, and get a unified view of operational health. Are you looking for an automated way […]

Deploying packages sequentially using AWS Systems Manager

AWS Systems Manager helps to control the sequence of package deployment in managed instances. Managed instances can be HAQM Elastic Compute Cloud (HAQM EC2) instances, virtual machines (VM) including VMs in other cloud environments, and on-premises servers. Customers are trying to automate the process of managing their state of hybrid infrastructure. They need to run […]

Improve monitoring efficiency using HAQM CloudWatch Composite Alarms

OVERVIEW HAQM CloudWatch alarms help customers improve infrastructure monitoring efficiency by reducing the time to detect, triage, and diagnose issues that impact workload performance. CloudWatch alarms can be used extensively as a means to alert customers when application and infrastructure metrics exceed static or dynamically set thresholds. On March 4, 2020 AWS released CloudWatch composite alarms to extend existing […]

Comcast adds CloudWatch metrics aggregation when monitoring Kinesis Video Streams

This post walks through how Comcast Corporation set up monitoring for HAQM Kinesis Video Streams. To support their business requirements and performance SLAs, they added additional aggregated metrics vended by the Kinesis Video Stream service into HAQM CloudWatch. Comcast Corporation is a global media and technology company that offers a home security solution called Xfinity […]

How to set up automatic failover for AWS OpsWorks for Chef Automate

Creating a resilient configuration management system comes with a variety of challenges. The goal of this post is to solve an important component of this: failing over to a standby AWS OpsWorks for Chef Automate server when a primary server is unavailable. With the procedure described in this post, if the main OpsWorks for Chef […]

Leveraging AWS CloudFormation to create an immutable infrastructure at Nubank

Bruno Halley Schaefer, software engineer, Nubank Hugo Carvalho, senior solutions architect, AWS Marcelo Nunes, senior technical account manager, AWS Enterprise Support Team   Nubank, a Brazilian company that is one of the world’s largest independent digital banks, is innovatively transforming Latin America’s financial landscape by providing transparent, simple, and efficient services. The company fights complexity […]

Use Atlassian Opsgenie with AWS Systems Manager to run the EC2Rescue tool

On-call engineers are responsible for responding to alerts, troubleshooting high priority incidents, and taking action to remediate issues. Automation tools like AWS Systems Manager and Atlassian Opsgenie can help these engineers by reducing repetitive work and allowing them to focus on the most important tasks. In this blog post, Merve Bolat, Associate Product Manager at […]

Centralized multi-account and multi-Region patching with AWS Systems Manager Automation

Update 01/2023: AWS Systems Manager announces Patch Policies, enabling cross account and cross Region patching. Patch Policies provide a user experience in a single console to easily define and enforce patch compliance across accounts and Regions with a few clicks. For more information, see Centrally deploy patching operations across your AWS Organization using Systems Manager […]