AWS Cloud Operations Blog
Category: HAQM CloudWatch
Respond to CloudWatch Alarms with HAQM Bedrock Insights
Overview When operating complex, distributed systems in the cloud, quickly identifying the root cause of issues and resolving incidents can be a daunting task. Troubleshooting often involves sifting through metrics, logs, and traces from multiple AWS services, making it challenging to gain a comprehensive understanding of the problem. So how can you streamline this process […]
Monitor Java apps running on Tomcat server with HAQM CloudWatch Application Signals
Traditionally, Java web applications are packaged into Web Application Resource (WAR) files, which can be deployed on any Servlet/JSP container like Tomcat server. These applications often operate within distributed environments, involving multiple interconnected components such as databases, external APIs, and caching layers. Monitoring the performance and health of Java web applications can be challenging due […]
Troubleshooting AWS Glue ETL Jobs using HAQM CloudWatch Logs Insights enhanced queries
Introduction In the realm of data integration and ETL (Extract, Transform, Load) processes, organizations often face challenges in ensuring efficiency and performance of the ETL jobs. Monitoring the efficiency of ETL jobs becomes crucial in maintaining seamless data workflows. This is where HAQM CloudWatch Logs Insights comes into play, offering powerful log analytics to unearth […]
Testing and debugging HAQM CloudWatch Synthetics canary locally
Introduction HAQM CloudWatch Synthetics canaries are scripts that monitor your endpoints and APIs by simulating the actions of a user. These canaries run on a schedule, check the availability and latency of your applications, and alert you when there are issues. Canary scripts are written in Node.js and Python, and they run inside an AWS […]
Monitor Python apps with HAQM CloudWatch Application Signals (Preview)
AWS announced HAQM CloudWatch Application Signals during re:Invent 2023. It is a new feature to monitor and understand the health of Java applications. Today we are excited to announce that Application Signals now supports Python applications. Enabling Application Signals allows you to use AWS Distro for OpenTelemetry (ADOT) to instrument Python applications without code changes. […]
Using the unified CloudWatch Agent to send traces to AWS X-Ray
Today, applications are more distributed than ever before and they no longer run in isolation. This is especially the case when utilizing HAQM Elastic Container Service (HAQM ECS) or HAQM Elastic Kubernetes Service (HAQM EKS). A distributed workload or system is one that encompasses multiple small independent components, all working together to complete a task or job. […]
Unlocking Insights: Turning Application Logs into Actionable Metrics
Modern software development teams understand the importance of observability as a critical aspect of building reliable and resilient applications. By implementing observability practices, software teams can proactively identify issues, uncover performance bottlenecks, and enhance system reliability. However, it is a fairly recent trend and still lacks industry-wide adoption. As organizations standardize on containers, they often […]
Analyze AWS Microservices architecture to identify and address performance issues
HAQM Payment Services (APS) is a payment service provider in the Middle East and North Africa. With its secure and seamless payment experience, it empowers businesses to build their online presence. HAQM Payment Services is based on a broad and complex microservice based architecture that are dependent on multiple AWS services, including HAQM Elastic Compute […]
Real User Monitoring with HAQM CloudWatch RUM and HAQM Managed Grafana
In today’s fast-paced digital world, users expect fast and reliable web experiences. Slow-loading pages, errors, and other performance issues can lead to lower engagement and conversion rates, ultimately hurting a business’s bottom line. That’s where Real User Monitoring (RUM) comes in. Real User Monitoring (RUM) is a crucial aspect of modern web application development, allowing developers and […]
Analyzing HAQM CloudWatch Internet Monitor measurement logs using HAQM Athena and HAQM QuickSight
Overview In this blog post, we describe how to use HAQM Athena with HAQM CloudWatch Internet Monitor’s measurement information stored on HAQM S3. You can use Athena to analyze event logs, and then build a dashboard to visualize the measurement logs and gain insights by using HAQM QuickSight. Internet Monitor publishes measurements to CloudWatch Logs […]