AWS Cloud Operations Blog
Category: Intermediate (200)
Introducing HAQM EKS Observability Accelerator
Some of the details in this blog post are now outdated. For the latest information on the AWS Observability Accelerator please see Announcing AWS Observability Accelerator to configure comprehensive observability for HAQM EKS. Also explore the GitHub repository where you can find more details on how to get started. Observability is critical for any application […]
Introducing vended metrics for HAQM Managed Service for Prometheus
Today, I’m happy to announce that HAQM Managed Service for Prometheus now vends usage metrics to HAQM CloudWatch. These metrics can be used to help you gain better visibility into your HAQM Managed Service for Prometheus workspace. Let’s dive in to see how you could use these new Prometheus usage metrics in CloudWatch. I‘ve set […]
Using AWS AppConfig Feature Flags
AWS has a native feature flagging solution, AWS AppConfig Feature Flags. Feature flags are a powerful tool that allow engineers to safely push out new features to customers, but doing so in a measured and usually gradual way. In this blog post, you will learn about what feature flags are, what are the benefits to […]
Enforce best practices in AWS Systems Manager documents leveraging CFN Guard
Many of us use AWS Systems Manager (SSM) documents to help automate various tasks. As we author documents and move them toward deployment, we’ll likely enforce certain standards and best practices. The AWS CloudFormation team released a general-purpose tool called AWS CloudFormation Guard that we can use to help enforce these best practices. In this […]
Gaining more control over Multi-Regional AWS CloudFormation deployments
Routinely deploying resources to multiple regions is increasingly normal for situations like Disaster Recovery (DR), regulatory and compliance, and end-user latency requirements. Keeping multiple environments in sync is challenging and drives Infrastructure as Code (IaC) adoption through services like AWS CloudFormation. This post demonstrates a generic design pattern for orchestrating multi-Regional deployments when you need […]
Integrate HAQM CloudWatch metrics with ServiceNow using HAQM Managed Grafana
ServiceNow ITSM is a cloud-based platform designed to improve IT services, increase user satisfaction, and boost IT flexibility and agility. With ServiceNow IT Service Management, you can consolidate your legacy on-premise systems and IT tools into our single data model to transform the service experience, automate workflows, gain real-time visibility, and improve IT productivity. HAQM […]
Manage AWS resources in your Slack channels with AWS Chatbot
**This post was written while the feature to manage AWS resources in Slack channels was in public preview. This feature is now generally available. The information contained within this post is still relevant and helpful.** DevOps and engineering teams are increasingly moving their operations, system management, and CI/CD workflows to chat applications to streamline activities […]
Quantify custom application metrics with HAQM CloudWatch Logs and metric filters
Customers have valuable metrics emitted to their logs. Examples include web server response times, slow queries, purchases by partners, custom application metrics, and cache hits or misses. This data has unrealized potential value for increasing observability. Consumed by HAQM CloudWatch Logs and extracted using metric filters, customers can translate this data into actual CloudWatch metrics, […]
Avoid zero-day vulnerabilities with same-day security patching using AWS Systems Manager
This post was co-authored by Jordan Koch at Veradigm. Applying operating systems patches is one of the easiest ways to secure a system from ever-changing cybersecurity threats. However, for many organizations it is one of the most difficult and time-consuming tasks. Many organizations deploy operating system patches through their various environments, first applying to Development, […]
How CloudFix uses AWS Systems Manager Change Manager to deliver cost savings
For years, the CloudFix team has managed and maintained 120+ AWS hosted SaaS products across hundreds of AWS accounts. Although this model follows established AWS best practices, the team’s scope introduced operational challenges. Their team needed a way to identify cost-saving opportunities across their applications without making architectural compromises or introducing service disruption. The team […]