SSML
Speech Synthesis Markup Language (SSML) provides a standard way to mark up text for the generation of synthesized speech. It supports 27 languages and more than 40 voices, and allows developers to control pronunciation, pitch, and volume.
For more information on SSML, see Getting Started with SSML.
Speak elements with SSML can be nested inside Gather XML element tags.
Examples
Basic SSML Example
XML Response
<Response>
<Speak voice="Polly.Amy">
<prosody rate="medium">
Hello and welcome to the Vobiz text-to-speech engine.
<break/>
<break/>
We're now testing the
<say-as interpret-as="spell-out">SSML</say-as>
feature.
</prosody>
</Speak>
</Response>SSML with Gather
XML Response
<Response>
<Gather numDigits="1" playBeep="true">
<Speak voice="Polly.Salli">
<prosody rate="fast">
Please press 1 to proceed.
<break/>
<break/>
We're now testing the
<say-as interpret-as="spell-out">SSML</say-as>
feature.
</prosody>
</Speak>
</Gather>
</Response>