Posted On: Mar 17, 2022

The HAQM Chime SDK lets developers add real-time audio, video, screen-sharing, and messaging capabilities to their web or mobile applications. HAQM Polly is a service that turns text into lifelike speech. Starting today, the HAQM Chime SDK supports native integration with HAQM Polly, making it easy for builders to create applications that turn text and numerical data into lifelike speech and automatically plays the output to a caller.

This integration streamlines the development of voice self-service prompts since it eliminates the costly and time-consuming dependency on the professional voice recording. For example, you could build a voice order status application that prompts a caller to enter their order number, retrieves the order information from an external database, and then speaks the order ship date back to the caller. You can use HAQM Polly for calls to or from the public telephone network and for calls with on-premises telephone equipment using the Session Initiation Protocol (SIP).

This integration supports all available languages and voices in HAQM Polly. You can also choose either plain text or Speech Synthesis Markup Language (SSML) to enrich the generated speech by adding pauses, emphasizing certain words, or to change the speaking style. In addition, to help fine-tune the caller experience, developers can choose between the standard or neural text-to-speech (NTTS) engine for improved speech quality.

HAQM Polly for HAQM Chime SDK is available in the US-East (N. Virginia) and US-West (Oregon) AWS regions. There are no additional HAQM Chime SDK charges to use HAQM Polly. Regular HAQM Polly pricing applies.

To get started with the HAQM Chime SDK integration with HAQM Polly, please refer to HAQM Chime SDK documentation and HAQM Polly Developer Guide.