AWS Cloud Operations Blog

Category: Intermediate (200)

Centralized view of support cases opened from multiple AWS accounts using AWS Systems Manager

AWS Systems Manager Explorer is a customizable operations dashboard that reports information about your AWS resources. Explorer displays an aggregated view of operations data (OpsData) for your AWS accounts and AWS Regions. OpsData also includes information from supporting AWS services, such as AWS Trusted Advisor, AWS Compute Optimizer, and AWS Support Center cases, among other […]

Service Notice – Upcoming changes required for AWS Config

On July 5, 2022, the AWS managed policy AWSConfigRole will be deprecated. This policy is being replaced by a more scoped-down policy, AWS_ConfigRole. The AWSConfigRole managed policy will continue working for all currently attached users, groups, and roles. However, after July 5, 2022, the AWSConfigRole managed policy can’t be attached to any new users, groups, […]

How to automate AWS Support API with HAQM EventBridge

The practice of Operational Integration defines how one organization’s people, processes, and tools integrate with the people, processes, and tools of another organization. When certain activities occur within one organization, it may trigger an automated or manual response in another. For example, it’s common for AWS customers to operationally integrate with AWS Support or AWS Managed Services whenever […]

How to enable VPC Flow Logs automatically using AWS Config rules

This post discusses an automated process for enabling HAQM Virtual Private Cloud (HAQM VPC) Flow Logs using AWS Config rule remediation. Customers use HAQM VPC Flow logs to capture information about the IP traffic going to and from network interfaces in an HAQM VPC. You can deploy this solution with the help of AWS Control […]

How to Monitor Databricks with HAQM CloudWatch

This post was written by Lei Pan and Sajith Appukuttan from Databricks. In this post, we look closely at monitoring and alerting systems – both critical components of any production-level environment. We’ll start with a review of the key reasons why engineers should build a monitoring/alerting system for their environment, the benefits, as well as […]

Streamline Automation with Outbound Webhooks for AWS Systems Manager Runbooks

Automation runbooks let you define a set of actions that automate various operations in your AWS environment. Runbooks allow our customers to simply configure automation workflows that they can execute based on either events or a scheduled cadence. These workflows commonly require integration with third-party systems, such as Slack, Jira, and ServiceNow. As of January […]

Resizing volumes and instances using ServiceNow and AWS

The AWS Service Management Connector for ServiceNow enables ServiceNow end users to provision, manage, and operate AWS resources natively through ServiceNow. This lets our customers connect a technical operation with a business workflow, perhaps requiring approvals from management or other teams. The key in all of this is empowering and enabling end-users, thereby removing manual […]

Viewing custom metrics from statsd with HAQM Managed Service for Prometheus and HAQM Managed Grafana

Monitoring applications based on custom metrics is important for a resilient system. One of the mechanisms to generate custom metrics from applications is statsd – a NodeJs process to collect custom application performance metrics periodically. However, statsd doesn’t provide long-term storage, rich querying, visualization, or an alerting solution. HAQM Managed Service for Prometheus and HAQM […]

Viewing collectd statistics with HAQM Managed Service for Prometheus and HAQM Managed Service for Grafana

Monitoring systems are essential for a resilient solution. A popular tool to monitor Linux-based physical or virtual machines is collectd – a daemon to collect system and application performance metrics periodically. However, collectd doesn’t provide long-term storage for metrics, rich querying, visualization, or an alerting solution. The HAQM Managed Service for Prometheus is a serverless […]

AWS offerings available to support your cloud migration

With over a million active customers, AWS has experience helping organizations of all sizes migrate workloads to the cloud. A common theme is that selecting the right tools is crucial for reducing risk and enabling a reliable migration. In this blog post, I will select some of the most popular AWS migration tools and outline […]