AWS Cloud Operations Blog
Continuously optimize your operational excellence posture through AWS Trusted Advisor
AWS Trusted Advisor continuously evaluates your AWS environment using best practice checks in the categories of cost optimization, performance, resilience, security, service limits, and operational excellence and recommends actions to remediate any deviations from AWS best practices in the AWS Well-Architected Framework. AWS Well-Architected Framework is a collection of architectural best practices and guidance to […]
Disaster Recovery (DR) Failover to the Disconnected Edge
Introduction Many enterprises rely on AWS to host the entirety of their infrastructure due to the inherent advantages of cloud computing. However, some enterprises operate mission critical workloads from remote areas at an increased risk to lose external network connectivity. For instance, a research facility located in a remote desert, an oil rig in international […]
How to drive the discussions around carbon footprint reduction to support modernization and migration to the Cloud?
A Gartner, Inc. survey revealed that 87% of business leaders expect to increase their organization’s investment in sustainability over the next two years. This blog aims to equip Information Technology (IT) teams with the necessary resources to start the conversation with business leaders and prepare a compelling business case that highlights the opportunity for carbon […]
Manage your AWS multi-account environment with Account Factory for Terraform (AFT)
Independent software vendors (ISVs) are AWS Partners who build products or services using AWS. Their workloads are typically diverse and require a flexible and customizable multi-account setup. Following are some examples: Backoffice workloads, which tend be deployed once and are then regularly updated, typically relying on commercial off-the-shelf software. Presales workloads, which are short lived […]
Leverage generative AI to create custom dashboard widgets in HAQM CloudWatch using HAQM CodeWhisperer
Observability describes how well you can understand what is happening in a system, often by instrumenting it to collect metrics, logs, and traces. To achieve operational excellence and meet business objectives, you need to understand how your systems are performing. In order to accomplish this, many customers use HAQM CloudWatch to get real-time monitoring, alerts […]
Improving Mergers & Acquisitions Due Diligence with AWS Audit Manager
The purpose of this narrative is to provide guidance for Mergers & Acquisitions (M&A) Due Diligence stakeholders on how to leverage AWS Audit Manager to support compliance and risk assessments during technical due diligence. The target audience of this guidance includes practitioners that support diligence, integration, corporate development (CorpDev), technology/IT, auditing, and advisory activities during […]
Monitoring and Visualizing HAQM EKS signals with Kiali and AWS managed open-source services
Microservices architecture enables scalability and agility for modern applications. However, distributed systems can introduce complexity when troubleshooting issues across services on different machines. To gain observability into microservices environments, operators need tools to monitor, analyze, and debug the interconnected services. Istio service mesh connects, secures, and observes microservices communications. It provides a way to manage […]
Achieve domain consistency in event-driven architectures
Application modernization is an important and growing migration strategy for many businesses. Most applications begin as a monolith, focusing on a specific business use case. As businesses grow, so does the complexity and number of business use-cases that their monoliths must support. This causes monolith application components to be tightly coupled and less cohesive, making […]
Accelerate Modernization outcomes with Automation
Introduction As organizations move towards modernizing their workloads in the cloud, there are key capabilities that need to be in place to enable the success of the modernization journey. The capabilities include organization structure, modernization strategy, automation, team readiness, and stakeholder sponsorship. Out of these, automation plays an outsized role in realizing the benefits of […]
Using Puppet to automate AWS Elastic Disaster Recovery for HAQM EC2 instances at scale
Customers improve their disaster recovery posture with automation. Automation reduces the operational overhead of managing source servers and automatically implementing your disaster recovery strategy. AWS Elastic Disaster Recovery replicates your servers, such as HAQM EC2 instances. In the case of a disaster, you can use AWS Elastic Disaster Recovery to recover your application servers. It […]