AWS Machine Learning Blog
Category: Intermediate (200)
Multi-account support for HAQM SageMaker HyperPod task governance
In this post, we discuss how an enterprise with multiple accounts can access a shared HAQM SageMaker HyperPod cluster for running their heterogenous workloads. We use SageMaker HyperPod task governance to enable this feature.
Contextual retrieval in Anthropic using HAQM Bedrock Knowledge Bases
Contextual retrieval enhances traditional RAG by adding chunk-specific explanatory context to each chunk before generating embeddings. This approach enriches the vector representation with relevant contextual information, enabling more accurate retrieval of semantically related content when responding to user queries. In this post, we demonstrate how to use contextual retrieval with Anthropic and HAQM Bedrock Knowledge Bases.
Impel enhances automotive dealership customer experience with fine-tuned LLMs on HAQM SageMaker
In this post, we share how Impel enhances the automotive dealership customer experience with fine-tuned LLMs on SageMaker.
Unlocking the power of Model Context Protocol (MCP) on AWS
We’ve witnessed remarkable advances in model capabilities as generative AI companies have invested in developing their offerings. Language models such as Anthropic’s Claude Opus 4 & Sonnet 4 and HAQM Nova on HAQM Bedrock can reason, write, and generate responses with increasing sophistication. But even as these models grow more powerful, they can only work […]
Streamline personalization development: How automated ML workflows accelerate HAQM Personalize implementation
This blog post presents an MLOps solution that uses AWS Cloud Development Kit (AWS CDK) and services like AWS Step Functions, HAQM EventBridge and HAQM Personalize to automate provisioning resources for data preparation, model training, deployment, and monitoring for HAQM Personalize.
Architect a mature generative AI foundation on AWS
In this post, we give an overview of a well-established generative AI foundation, dive into its components, and present an end-to-end perspective. We look at different operating models and explore how such a foundation can operate within those boundaries. Lastly, we present a maturity model that helps enterprises assess their evolution path.
Using HAQM OpenSearch ML connector APIs
OpenSearch offers a wide range of third-party machine learning (ML) connectors to support this augmentation. This post highlights two of these third-party ML connectors. The first connector we demonstrate is the HAQM Comprehend connector. In this post, we show you how to use this connector to invoke the LangDetect API to detect the languages of ingested documents. The second connector we demonstrate is the HAQM Bedrock connector to invoke the HAQM Titan Text Embeddings v2 model so that you can create embeddings from ingested documents and perform semantic search.
Bridging the gap between development and production: Seamless model lifecycle management with HAQM Bedrock
HAQM Bedrock Model Copy and Model Share features provide a powerful option for managing the lifecycle of an AI application from development to production. In this comprehensive blog post, we’ll dive deep into the Model Share and Model Copy features, exploring their functionalities, benefits, and practical applications in a typical development-to-production scenario.
Real-world applications of HAQM Nova Canvas for interior design and product photography
In this post, we explore how HAQM Nova Canvas can solve real-world business challenges through advanced image generation techniques. We focus on two specific use cases that demonstrate the power and flexibility of this technology: interior design and product photography.
Building a multimodal RAG based application using HAQM Bedrock Data Automation and HAQM Bedrock Knowledge Bases
In this post, we walk through building a full-stack application that processes multimodal content using HAQM Bedrock Data Automation, stores the extracted information in an HAQM Bedrock knowledge base, and enables natural language querying through a RAG-based Q&A interface.