The Speech Settings allow you to customize how AI-generated speech sounds, ensuring it is natural, clear, and suited to the context or the audience. You can adjust latency, voice style, speed, and similarity to match your desired output.

Speech Settings

Optimize Streaming Latency

Adjust this setting to optimize the delay between generating and playing the speech.
Lower latency is useful for real-time interactions.

Stability

Controls the voice style of the agent, from natural to steady.
A natural voice feels more human, while a steady voice is more monotone but consistent.

Speed

Adjust the speech speed of the agent.

Ex : For older listeners or slower-paced audiences, reduce the speed to make speech easier to understand.

Similarity

Determines how closely the AI voice matches the original voice.
Recommended: High for accurate representation.

🎉 By customizing these Speech Settings, you can ensure that AI-generated voice interactions are clear and easy to follow.