Artificial Intelligence

Enhanced Performance for Whisper Audio Transcription on AWS Batch and AWS Inferentia

In this post, we’ll review the key optimizations and performance gains for our Whisper audio transcription solution powered by AWS Batch and AWS Inferentia.

Engineering at the speed of thought: Accelerating complex processes with multi-agent AI and Synera

In this post, we’ll examine how this multi-agent approach works, the architecture behind it, and the efficiency improvements it enables. While the focus is on an engineering use case, the principles apply broadly to any organization facing the challenge of coordinating specialized expertise to deliver faster, more consistent results.

Scale Reinforcement Learning with AWS Batch Multi-Node Parallel Jobs

Autonomous robots are increasingly used across industries, from warehouses to space exploration. While developing these robots requires complex simulation and reinforcement learning (RL), setting up training environments can be challenging and time-consuming. AWS Batch multi-node parallel (MNP) infrastructure, combined with NVIDIA Isaac Lab, offers a solution by providing scalable, cost-effective robot training capabilities for sophisticated behaviors and complex tasks.

Enhancing Equity Strategy Backtesting with Synthetic Data: An Agent-Based Model Approach

Developing robust investment strategies requires thorough testing, but relying solely on historical data can introduce biases and limit your insights. Learn how synthetic data from agent-based models can provide an unbiased testbed to systematically evaluate your strategies and prepare for future market scenarios. Part 1 of 2 covers the theoretical foundations of the approach.

Scaling your LLM inference workloads: multi-node deployment with TensorRT-LLM and Triton on HAQM EKS

LLMs are scaling exponentially. Learn how advanced technologies like Triton, TRT-LLM and EKS enable seamless deployment of models like the 405B parameter Llama 3.1. Let’s go large.

Deploying Generative AI Applications with NVIDIA NIM Microservices on HAQM Elastic Kubernetes Service (HAQM EKS) – Part 2

Learn how to deploy AI models at scale with @AWS using NVIDIA’s NIM and HAQM EKS! This step-by-step guide shows you how to create a GPU cluster for inference in this second post of a two-part series!

Whisper audio transcription powered by AWS Batch and AWS Inferentia

AWS HPC Blog

Category: Artificial Intelligence

Enhanced Performance for Whisper Audio Transcription on AWS Batch and AWS Inferentia

Engineering at the speed of thought: Accelerating complex processes with multi-agent AI and Synera

Scale Reinforcement Learning with AWS Batch Multi-Node Parallel Jobs

Enhancing Equity Strategy Backtesting with Synthetic Data: An Agent-Based Model Approach

Scaling your LLM inference workloads: multi-node deployment with TensorRT-LLM and Triton on HAQM EKS

Deploying Generative AI Applications with NVIDIA NIM Microservices on HAQM Elastic Kubernetes Service (HAQM EKS) – Part 2

Whisper audio transcription powered by AWS Batch and AWS Inferentia

Deploying generative AI applications with NVIDIA NIMs on HAQM EKS

Gang scheduling pods on HAQM EKS using AWS Batch multi-node processing jobs

Large scale training with NVIDIA NeMo Megatron on AWS ParallelCluster using P5 instances