AWS Cloud Operations Blog

Tag: HAQM Elastic Container Service

Gain operational insights for NVIDIA GPU workloads using HAQM CloudWatch Container Insights

As machine learning models grow more advanced, they require extensive computing power to train efficiently. Many organizations are turning to GPU-accelerated Kubernetes clusters for both model training and online inference. However, properly monitoring GPU usage is critical for machine learning engineers and cluster administrators to understand model performance and to optimize infrastructure utilization. Without visibility […]

Accelerate End-to-End Application Modernization with AWS App2Container and AWS Migration Hub Refactor Spaces

Accelerate End-to-End Application Modernization with AWS App2Container and AWS Migration Hub Refactor Spaces

This blog post was written with contributions from Gaurav Parashar who is prior AWS Customers often have challenges accelerating the modernization of their applications. The complexity of refactoring a monolith application often provides hurdles in depth of expertise, time and effort. In this blog, we will explore two mechanisms that can help you accelerate your […]

AppConfig Featured Image

Application configuration deployment to container workloads using AWS AppConfig

UPDATE (15 Dec 22): AWS AppConfig released an Agent for containers (EKS, ECS, Docker, Kubernetes) in December 2022, which makes calling AppConfig much simpler from containerized applications. We recommend using the AppConfig Agent for containers instead of the method below. Read the Agent documentation.   AWS AppConfig is a capability of AWS Systems Manager that you […]

Introducing HAQM CloudWatch Container Insights for HAQM ECS

HAQM Elastic Container Service (HAQM ECS) lets you monitor resources using HAQM CloudWatch, a service that provides metrics for CPU and memory reservation and cluster and services utilization. In the past, you had to enable custom monitoring of services and tasks. Now, you can monitor, troubleshoot, and set alarms for all your HAQM ECS resources using […]