AWS Machine Learning Blog

Category: HAQM Titan

Visualisation of text and image embeddings

Implement serverless semantic search of image and live video with HAQM Titan Multimodal Embeddings

In today’s data-driven world, industries across various sectors are accumulating massive amounts of video data through cameras installed in their warehouses, clinics, roads, metro stations, stores, factories, or even private facilities. This video data holds immense potential for analysis and monitoring of incidents that may occur in these locations. From fire hazards to broken equipment, […]

Personalized image search weighted score

Enhance image search experiences with HAQM Personalize, HAQM OpenSearch Service, and HAQM Titan Multimodal Embeddings in HAQM Bedrock

A variety of different techniques have been used for returning images relevant to search queries. Historically, the idea of creating a joint embedding space to facilitate image captioning or text-to-image search has been of interest to machine learning (ML) practitioners and businesses for quite a while. Contrastive Language–Image Pre-training (CLIP) and Bootstrapping Language-Image Pre-training (BLIP) […]

Cost-effective document classification using the HAQM Titan Multimodal Embeddings Model

Organizations across industries want to categorize and extract insights from high volumes of documents of different formats. Manually processing these documents to classify and extract information remains expensive, error prone, and difficult to scale. Advances in generative artificial intelligence (AI) have given rise to intelligent document processing (IDP) solutions that can automate the document classification, […]

Build a contextual text and image search engine for product recommendations using HAQM Bedrock and HAQM OpenSearch Serverless

In this post, we show how to build a contextual text and image search engine for product recommendations using the HAQM Titan Multimodal Embeddings model, available in HAQM Bedrock, with HAQM OpenSearch Serverless.

Automate the process to change image backgrounds using HAQM Bedrock and AWS Step Functions

Many customers, including those in creative advertising, media and entertainment, ecommerce, and fashion, often need to change the background in a large number of images. Typically, this involves manually editing each image with photo software. This can take a lot of effort, especially for large batches of images. However, HAQM Bedrock and AWS Step Functions […]

Use HAQM Titan models for image generation, editing, and searching

HAQM Bedrock provides a broad range of high-performing foundation models from HAQM and other leading AI companies, including Anthropic, AI21, Meta, Cohere, and Stability AI, and covers a wide range of use cases, including text and image generation, searching, chat, reasoning and acting agents, and more. The new HAQM Titan Image Generator model allows content […]

Talk to your slide deck using multimodal foundation models hosted on HAQM Bedrock and HAQM SageMaker – Part 1

With the advent of generative AI, today’s foundation models (FMs), such as the large language models (LLMs) Claude 2 and Llama 2, can perform a range of generative tasks such as question answering, summarization, and content creation on text data. However, real-world data exists in multiple modalities, such as text, images, video, and audio. Take […]