AWS Machine Learning Blog
Category: Intermediate (200)
Tailoring foundation models for your business needs: A comprehensive guide to RAG, fine-tuning, and hybrid approaches
In this post, we show you how to implement and evaluate three powerful techniques for tailoring FMs to your business needs: RAG, fine-tuning, and a hybrid approach combining both methods. We provid ready-to-use code to help you experiment with these approaches and make informed decisions based on your specific use case and dataset.
Optimize query responses with user feedback using HAQM Bedrock embedding and few-shot prompting
This post demonstrates how HAQM Bedrock, combined with a user feedback dataset and few-shot prompting, can refine responses for higher user satisfaction. By using HAQM Titan Text Embeddings v2, we demonstrate a statistically significant improvement in response quality, making it a valuable tool for applications seeking accurate and personalized responses.
Boosting team productivity with HAQM Q Business Microsoft 365 integrations for Microsoft 365 Outlook and Word
HAQM Q Business integration with Microsoft 365 applications offers powerful AI assistance directly within the tools that your team already uses daily. In this post, we explore how these integrations for Outlook and Word can transform your workflow.
Integrate HAQM Bedrock Agents with Slack
In this post, we present a solution to incorporate HAQM Bedrock Agents in your Slack workspace. We guide you through configuring a Slack workspace, deploying integration components in HAQM Web Services, and using this solution.
Customize DeepSeek-R1 671b model using HAQM SageMaker HyperPod recipes – Part 2
In this post, we use the recipes to fine-tune the original DeepSeek-R1 671b parameter model. We demonstrate this through the step-by-step implementation of these recipes using both SageMaker training jobs and SageMaker HyperPod.
Build an intelligent community agent to revolutionize IT support with HAQM Q Business
In this post, we demonstrate how your organization can reduce the end-to-end burden of resolving regular challenges experienced by your IT support teams—from understanding errors and reviewing diagnoses, remediation steps, and relevant documentation, to opening external support tickets using common third-party services such as Jira.
Build a gen AI–powered financial assistant with HAQM Bedrock multi-agent collaboration
This post explores a financial assistant system that specializes in three key tasks: portfolio creation, company research, and communication. This post aims to illustrate the use of multiple specialized agents within the HAQM Bedrock multi-agent collaboration capability, with particular emphasis on their application in financial analysis.
Get faster and actionable AWS Trusted Advisor insights to make data-driven decisions using HAQM Q Business
In this post, we show how to create an application using HAQM Q Business with Jira integration that used a dataset containing a Trusted Advisor detailed report. This solution demonstrates how to use new generative AI services like HAQM Q Business to get data insights faster and make them actionable.
Combine keyword and semantic search for text and images using HAQM Bedrock and HAQM OpenSearch Service
In this post, we walk you through how to build a hybrid search solution using OpenSearch Service powered by multimodal embeddings from the HAQM Titan Multimodal Embeddings G1 model through HAQM Bedrock. This solution demonstrates how you can enable users to submit both text and images as queries to retrieve relevant results from a sample retail image dataset.
Accuracy evaluation framework for HAQM Q Business – Part 2
In the first post of this series, we introduced a comprehensive evaluation framework for HAQM Q Business, a fully managed Retrieval Augmented Generation (RAG) solution that uses your company’s proprietary data without the complexity of managing large language models (LLMs). The first post focused on selecting appropriate use cases, preparing data, and implementing metrics to […]