AWS Machine Learning Blog

Category: Intermediate (200)

Multi-account support for HAQM SageMaker HyperPod task governance

In this post, we discuss how an enterprise with multiple accounts can access a shared HAQM SageMaker HyperPod cluster for running their heterogenous workloads. We use SageMaker HyperPod task governance to enable this feature.

Contextual retrieval in Anthropic using HAQM Bedrock Knowledge Bases

Contextual retrieval enhances traditional RAG by adding chunk-specific explanatory context to each chunk before generating embeddings. This approach enriches the vector representation with relevant contextual information, enabling more accurate retrieval of semantically related content when responding to user queries. In this post, we demonstrate how to use contextual retrieval with Anthropic and HAQM Bedrock Knowledge Bases.

Detailed MCP Bedrock architecture with intelligent query processing workflow and AWS service connections

Unlocking the power of Model Context Protocol (MCP) on AWS

We’ve witnessed remarkable advances in model capabilities as generative AI companies have invested in developing their offerings. Language models such as Anthropic’s Claude Opus 4 & Sonnet 4 and HAQM Nova on HAQM Bedrock can reason, write, and generate responses with increasing sophistication. But even as these models grow more powerful, they can only work […]

Streamline personalization development: How automated ML workflows accelerate HAQM Personalize implementation

This blog post presents an MLOps solution that uses AWS Cloud Development Kit (AWS CDK) and services like AWS Step Functions, HAQM EventBridge and HAQM Personalize to automate provisioning resources for data preparation, model training, deployment, and monitoring for HAQM Personalize.

Generative AI platform maturity stages

Architect a mature generative AI foundation on AWS

In this post, we give an overview of a well-established generative AI foundation, dive into its components, and present an end-to-end perspective. We look at different operating models and explore how such a foundation can operate within those boundaries. Lastly, we present a maturity model that helps enterprises assess their evolution path.

Using HAQM OpenSearch ML connector APIs

OpenSearch offers a wide range of third-party machine learning (ML) connectors to support this augmentation. This post highlights two of these third-party ML connectors. The first connector we demonstrate is the HAQM Comprehend connector. In this post, we show you how to use this connector to invoke the LangDetect API to detect the languages of ingested documents. The second connector we demonstrate is the HAQM Bedrock connector to invoke the HAQM Titan Text Embeddings v2 model so that you can create embeddings from ingested documents and perform semantic search.

Workflow for HAQM Bedrock Copy and Model Share.

Bridging the gap between development and production: Seamless model lifecycle management with HAQM Bedrock

HAQM Bedrock Model Copy and Model Share features provide a powerful option for managing the lifecycle of an AI application from development to production. In this comprehensive blog post, we’ll dive deep into the Model Share and Model Copy features, exploring their functionalities, benefits, and practical applications in a typical development-to-production scenario.

Real-world applications of HAQM Nova Canvas for interior design and product photography

In this post, we explore how HAQM Nova Canvas can solve real-world business challenges through advanced image generation techniques. We focus on two specific use cases that demonstrate the power and flexibility of this technology: interior design and product photography.

Architecture diagram

Building a multimodal RAG based application using HAQM Bedrock Data Automation and HAQM Bedrock Knowledge Bases

In this post, we walk through building a full-stack application that processes multimodal content using HAQM Bedrock Data Automation, stores the extracted information in an HAQM Bedrock knowledge base, and enables natural language querying through a RAG-based Q&A interface.