AWS Machine Learning Blog

Category: HAQM Polly

Video auto-dubbing using HAQM Translate, HAQM Bedrock, and HAQM Polly

This post is co-written with MagellanTV and Mission Cloud.  Video dubbing, or content localization, is the process of replacing the original spoken language in a video with another language while synchronizing audio and video. Video dubbing has emerged as a key tool in breaking down linguistic barriers, enhancing viewer engagement, and expanding market reach. However, […]

Highlight text as it’s being spoken using HAQM Polly

HAQM Polly is a service that turns text into lifelike speech. It enables the development of a whole class of applications that can convert text into speech in multiple languages. This service can be used by chatbots, audio books, and other text-to-speech applications in conjunction with other AWS AI or machine learning (ML) services. For […]

Create powerful self-service experiences with HAQM Lex on Talkdesk CX Cloud contact center

This blog post is co-written with Bruno Mateus, Jonathan Diedrich and Crispim Tribuna at Talkdesk. Contact centers are using artificial intelligence (AI) and natural language processing (NLP) technologies to build a personalized customer experience and deliver effective self-service support through conversational bots. This is the first of a two-part series dedicated to the integration of […]

Read webpages and highlight content using HAQM Polly

Read webpages and highlight content using HAQM Polly

In this post, we demonstrate how to use HAQM Polly—a leading cloud service that converts text into lifelike speech—to read the content of a webpage and highlight the content as it’s being read. Adding audio playback to a webpage improves the accessibility and visitor experience of the page. Audio-enhanced content is more impactful and memorable, […]

Localize content into multiple languages using AWS machine learning services

Over the last few years, online education platforms have seen an increase in adoption of and an uptick in demand for video-based learnings because it offers an effective medium to engage learners. To expand to international markets and address a culturally and linguistically diverse population, businesses are also looking at diversifying their learning offerings by […]

Generate synchronized closed captions and audio using the HAQM Polly subtitle generator

HAQM Polly, an AI generated text-to-speech service, enables you to automate and scale your interactive voice solutions, helping to improve productivity and reduce costs. As our customers continue to use HAQM Polly for its rich set of features and ease of use, we have observed a demand for the ability to simultaneously generate synchronized audio […]

Break through language barriers with HAQM Transcribe, HAQM Translate, and HAQM Polly

April 2024: This post was reviewed and updated for accuracy. Imagine a surgeon taking video calls with patients across the globe without the need of a human translator. What if a fledgling startup could easily expand their product across borders and into new geographical markets by offering fluid, accurate, multilingual customer support and sales, all […]

Create audio for content in multiple languages with the same TTS voice persona in HAQM Polly

HAQM Polly is a leading cloud-based service that converts text into lifelike speech. Following the adoption of Neural Text-to-Speech (NTTS), we have continuously expanded our portfolio of available voices in order to provide a wide selection of distinct speakers in supported languages. Today, we are pleased to announce four new additions: Pedro speaking US Spanish, […]