HAQM Nova Sonic

State-of-the-art speech-to-speech model for conversational AI

What is HAQM Nova Sonic?

HAQM Nova Sonic is a state-of-the-art speech-to-speech model that delivers real-time, human-like voice conversations with industry-leading price performance and low latency. Available in HAQM Bedrock via the bidirectional streaming API, the model understands streaming speech in various speaking styles and generates expressive speech responses that dynamically adapt to the prosody of input speech.

HAQM Nova Sonic supports expressive voices, including both masculine-sounding and feminine-sounding voices, in different English accents including American and British. The model can be utilized across a wide range of applications, including customer support call automation, outbound marketing, voice-enabled personal assistants and agents, and interactive education and language learning.

Key capabilities

HAQM Nova Sonic delivers industry-leading speed and price performance.

HAQM Nova Sonic enables knowledge grounding with enterprise data using Retrieval-Augmented Generation (RAG).

HAQM Nova Sonic supports functional calling, enabling seamless interaction with external services and efficient agentic task automation.

HAQM Nova Sonic is accessed via bidirectional streaming API in HAQM Bedrock. This API enables two-way streaming of content, which is critical for low latency interactive communication between a human user and the AI model.

Built-in protections including content moderation and watermarking.

See HAQM Nova Sonic in action

Language learning for non-native speaker

Voice-enabled business assistant

Customer service call automation

  • HAQM Nova Sonic

Discover real-world use cases

Getting Started with HAQM Nova Sonic

This video provides a step-by-step tutorial on how to use HAQM Nova Sonic in HAQM Bedrock to build your own voice-enabled bot.