Networking & Content Delivery
Category: HAQM Machine Learning
Networking best practices for generative AI on AWS
Introduction As generative artificial intelligence (generative AI) continues to evolve, the demand for more powerful and efficient computing resources grows, along with the need to manage exponentially increasing amounts of data. Datasets used for training generative AI models are typically measured in terabytes (TB), orders of magnitude bigger than traditional machine learning (ML) datasets whose […]
Private network for data movement in generative AI
Private network for data movement in generative AI In this post, we cover the architecture patterns for building secure, private network connectivity for data movement in generative artificial intelligence (generative AI) using HAQM Web Services (AWS) and AWS Partner Network (APN) services. Data privacy and security are top of mind for customers exploring generative AI […]
Monitoring load balancers using HAQM CloudWatch anomaly detection alarms
Load balancers are a critical component in the architecture of distributed software services. AWS Elastic Load Balancing (ELB) provides highly performant automatic distribution for any scale of incoming traffic across many compute targets (HAQM Elastic Compute Cloud (HAQM EC2), HAQM Elastic Container Service (HAQM ECS), AWS Lambda, etc.), while enabling developers to adopt security best practices […]