AWS Machine Learning Blog
Category: HAQM Bedrock
Build an enterprise synthetic data strategy using HAQM Bedrock
In this post, we explore how to use HAQM Bedrock for synthetic data generation, considering these challenges alongside the potential benefits to develop effective strategies for various applications across multiple industries, including AI and machine learning (ML).
Multi-tenancy in RAG applications in a single HAQM Bedrock knowledge base with metadata filtering
This post demonstrates how HAQM Bedrock Knowledge Bases can help you scale your data management effectively while maintaining proper access controls on different management levels.
Effectively use prompt caching on HAQM Bedrock
Prompt caching, now generally available on HAQM Bedrock with Anthropic’s Claude 3.5 Haiku and Claude 3.7 Sonnet, along with Nova Micro, Nova Lite, and Nova Pro models, lowers response latency by up to 85% and reduces costs up to 90% by caching frequently used prompts across multiple API calls. This post provides a detailed overview of the prompt caching feature on HAQM Bedrock and offers guidance on how to effectively use this feature to achieve improved latency and cost savings.
Prompting for the best price-performance
In this blog, we discuss how to optimize prompting in HAQM Nova for the best price-performance.
Evaluate models or RAG systems using HAQM Bedrock Evaluations – Now generally available
Today, we’re excited to announce the general availability of these evaluation features in HAQM Bedrock Evaluations, along with significant enhancements that make them fully environment-agnostic. In this post, we explore these new features in detail, showing you how to evaluate both RAG systems and models with practical examples. We demonstrate how to use the comparison capabilities to benchmark different implementations and make data-driven decisions about your AI deployments.
How Lumi streamlines loan approvals with HAQM SageMaker AI
Lumi is a leading Australian fintech lender empowering small businesses with fast, flexible, and transparent funding solutions. They use real-time data and machine learning (ML) to offer customized loans that fuel sustainable growth and solve the challenges of accessing capital. This post explores how Lumi uses HAQM SageMaker AI to meet this goal, enhance their transaction processing and classification capabilities, and ultimately grow their business by providing faster processing of loan applications, more accurate credit decisions, and improved customer experience.
Shaping the future: OMRON’s data-driven journey with AWS
OMRON Corporation is a leading technology provider in industrial automation, healthcare, and electronic components. In their Shaping the Future 2030 (SF2030) strategic plan, OMRON aims to address diverse social issues, drive sustainable business growth, transform business models and capabilities, and accelerate digital transformation. At the heart of this transformation is the OMRON Data & Analytics Platform (ODAP), an innovative initiative designed to revolutionize how the company harnesses its data assets. This post explores how OMRON Europe is using HAQM Web Services (AWS) to build its advanced ODAP and its progress toward harnessing the power of generative AI.
Using Large Language Models on HAQM Bedrock for multi-step task execution
This post explores the application of LLMs in executing complex analytical queries through an API, with specific focus on HAQM Bedrock. To demonstrate this process, we present a use case where the system identifies the patient with the least number of vaccines by retrieving, grouping, and sorting data, and ultimately presenting the final result.
Introducing AWS MCP Servers for code assistants (Part 1)
We’re excited to announce the open source release of AWS MCP Servers for code assistants — a suite of specialized Model Context Protocol (MCP) servers that bring HAQM Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers. In this post, we walk through how these specialized MCP servers can dramatically reduce your development time while incorporating security controls, cost optimizations, and AWS Well-Architected best practices into your code.
Harness the power of MCP servers with HAQM Bedrock Agents
Today, MCP is providing agents standard access to an expanding list of accessible tools that you can use to accomplish a variety of tasks. In this post, we show you how to build an HAQM Bedrock agent that uses MCP to access data sources to quickly build generative AI applications.