AWS Cloud Operations Blog

Tag: Monitoring & Observability

Monitor AWS Transit Gateway Flow Logs centrally using HAQM Managed Grafana

Monitor AWS Transit Gateway Flow Logs centrally using HAQM Managed Grafana

As organizations continue to expand their cloud infrastructure by connecting multiple HAQM Virtual Private Clouds (HAQM VPC) across accounts and regions, the complexity of managing their network environment increases. AWS Transit Gateway has emerged as a powerful solution to simplify this complexity by providing a centralized hub for secure communication between HAQM VPCs, on-premises systems, and […]

Know Before You Go – AWS re:Invent 2024 Monitoring and Observability

Planning to join us in Las Vegas from Dec 2 to Dec 6 at AWS re:Invent 2024 and looking to learn more about monitoring and observability? If you are, this blog highlights Cloud Operations sessions that focus on monitoring and observability at re:Invent 2024! Monitoring and Observability allows you to understand the health of your applications and […]

Sign-in to AWS Console Mobile Application with an AWS Access Portal or third-party IdP URL

AWS customers rely on the AWS Console Mobile Application to monitor, manage, and receive notifications to stay informed about their AWS resources while away from their desktop devices. Customers who use Single-Sign-On (SSO) can face a unique set of challenges while signing into the AWS Console Mobile Application. While SSO can offer enhanced security and […]

Title image that says managing access to AWS accounts from Microsoft Teams and Slack at scale using AWS Organizations and AWS Chatbot

Managing access to AWS accounts from Microsoft Teams and Slack at scale using AWS Organizations and AWS Chatbot

Customers use chat collaboration applications like Microsoft Teams and Slack to collaborate and manage their AWS applications. AWS Chatbot is a ChatOps service that enables customers to monitor, troubleshoot issues, and manage AWS applications from chat channels. AWS Chatbot provides autonomy and customizability to DevOps teams operating their AWS environments on the go from chat […]

Improve HAQM Bedrock Observability with HAQM CloudWatch AppSignals

With the pace of innovation with Generative AI applications, there is increasing demand for more granular observability into applications using Large Language Models (LLMs). Specifically, customers want visibility into: Prompt metrics like token usage, costs, and model IDs for individual transactions and operations, apart from service-level aggregations. Output quality factors including potential toxicity, harm, truncation […]

Troubleshooting AWS Glue ETL Jobs using HAQM CloudWatch Logs Insights enhanced queries

Troubleshooting AWS Glue ETL Jobs using HAQM CloudWatch Logs Insights enhanced queries

Introduction In the realm of data integration and ETL (Extract, Transform, Load) processes, organizations often face challenges in ensuring efficiency and performance of the ETL jobs. Monitoring the efficiency of ETL jobs becomes crucial in maintaining seamless data workflows. This is where HAQM CloudWatch Logs Insights comes into play, offering powerful log analytics to unearth […]

Introducing HAQM CloudWatch Alarm Recommendations

HAQM CloudWatch is a foundational AWS service that provides you with actionable insights into your cloud resources and applications. With HAQM CloudWatch Metrics, you can gain better visibility into your infrastructure and large-scale application performance. You can set up alarms using HAQM CloudWatch Alarms for metrics emitted by AWS services or your applications. Identifying which metrics […]

What’s new in AWS Observability at re:Invent 2023

What’s new in AWS Observability at re:Invent 2023

Let’s recap the week at AWS re:Invent 2023 with a round-up of the AWS Observability launches across HAQM CloudWatch, HAQM Managed Grafana, and HAQM Managed Service for Prometheus. From automatic instrumentation and operation of applications in CloudWatch, to agentless scraping of Prometheus metrics in Managed Service for Prometheus, read on to learn about the features […]

Observability using native HAQM CloudWatch and AWS X-Ray for serverless modern applications

Introduction In this blog post, we will share how you can use AWS-native observability tools to measure the current state of your modern serverless applications and how to get started with the minimal effort. We will review tools like HAQM CloudWatch and AWS X-Ray and explore how these services can help you instrument your application […]

Automate insights for your EC2 fleets across AWS accounts and regions

Automate insights for your EC2 fleets across AWS accounts and regions

Introduction Gaining insights and managing large HAQM Elastic Compute Cloud (HAQM EC2) fleet that is spread across multiple accounts and regions can be a challenging task. It’s crucial to have a quick and efficient method to identify which instances are managed by AWS Systems Manager (SSM) and gather detailed information about the instances that are […]