AWS Machine Learning Blog

Category: Advanced (300)

Use everyday language to search and retrieve data with Mixtral 8x7B on HAQM SageMaker JumpStart

With the widespread adoption of generative artificial intelligence (AI) solutions, organizations are trying to use these technologies to make their teams more productive. One exciting use case is enabling natural language interactions with relational databases. Rather than writing complex SQL queries, you can describe in plain language what data you want to retrieve or manipulate. […]

Improving Content Moderation with HAQM Rekognition Bulk Analysis and Custom Moderation

HAQM Rekognition makes it easy to add image and video analysis to your applications. It’s based on the same proven, highly scalable, deep learning technology developed by HAQM’s computer vision scientists to analyze billions of images and videos daily. It requires no machine learning (ML) expertise to use and we’re continually adding new computer vision […]

Analytics model

Understanding and predicting urban heat islands at Gramener using HAQM SageMaker geospatial capabilities

This is a guest post co-authored by Shravan Kumar and Avirat S from Gramener. Gramener, a Straive company, contributes to sustainable development by focusing on agriculture, forestry, water management, and renewable energy. By providing authorities with the tools and insights they need to make informed decisions about environmental and social impact, Gramener is playing a […]

Build a news recommender application with HAQM Personalize

With a multitude of articles, videos, audio recordings, and other media created daily across news media companies, readers of all types—individual consumers, corporate subscribers, and more—often find it difficult to find news content that is most relevant to them. Delivering personalized news and experiences to readers can help solve this problem, and create more engaging […]

Build a contextual text and image search engine for product recommendations using HAQM Bedrock and HAQM OpenSearch Serverless

In this post, we show how to build a contextual text and image search engine for product recommendations using the HAQM Titan Multimodal Embeddings model, available in HAQM Bedrock, with HAQM OpenSearch Serverless.

Gradient makes LLM benchmarking cost-effective and effortless with AWS Inferentia

This is a guest post co-written with Michael Feil at Gradient. Evaluating the performance of large language models (LLMs) is an important step of the pre-training and fine-tuning process before deployment. The faster and more frequent you’re able to validate performance, the higher the chances you’ll be able to improve the performance of the model. […]

Scale LLMs with PyTorch 2.0 FSDP on HAQM EKS – Part 2

This is a guest post co-written with Meta’s PyTorch team and is a continuation of Part 1 of this series, where we demonstrate the performance and ease of running PyTorch 2.0 on AWS. Machine learning (ML) research has proven that large language models (LLMs) trained with significantly large datasets result in better model quality. In […]

Provide live agent assistance for your chatbot users with HAQM Lex and Talkdesk cloud contact center

HAQM Lex provides advanced conversational artificial intelligence (AI) capabilities to enable self-service support for your organization’s contact center. With HAQM Lex, you can implement an omnichannel strategy where customers engage via phone, websites, and messaging platforms. The bots can answer FAQs, provide self-service experiences, or triage customer requests before transferring to a human agent. HAQM Lex integrates […]

Enhance performance of generative language models with self-consistency prompting on HAQM Bedrock

With the batch inference API, you can use HAQM Bedrock to run inference with foundation models in batches and get responses more efficiently. This post shows how to implement self-consistency prompting via batch inference on HAQM Bedrock to enhance model performance on arithmetic and multiple-choice reasoning tasks.

Federated learning on AWS using FedML, HAQM EKS, and HAQM SageMaker

This post is co-written with Chaoyang He, Al Nevarez and Salman Avestimehr from FedML. Many organizations are implementing machine learning (ML) to enhance their business decision-making through automation and the use of large distributed datasets. With increased access to data, ML has the potential to provide unparalleled business insights and opportunities. However, the sharing of […]