SSML

Speech Synthesis Markup Language (SSML) provides a standard way to mark up text for the generation of synthesized speech. It supports 27 languages and more than 40 voices, and allows developers to control pronunciation, pitch, and volume.

For more information on SSML, see Getting Started with SSML.

Speak elements with SSML can be nested inside Gather XML element tags.

Examples

Basic SSML Example

XML Response

<Response>
  <Speak voice="Polly.Amy">
    <prosody rate="medium">
    Hello and welcome to the Vobiz text-to-speech engine.
    <break/>
    <break/>
    We're now testing the
    <say-as interpret-as="spell-out">SSML</say-as>
    feature.
    </prosody>
  </Speak>
</Response>

SSML with Gather

XML Response

<Response>
  <Gather numDigits="1" playBeep="true">
  <Speak voice="Polly.Salli">
  <prosody rate="fast">
  Please press 1 to proceed.
  <break/>
  <break/>
  We're now testing the
  <say-as interpret-as="spell-out">SSML</say-as>
  feature.
  </prosody>
  </Speak>
  </Gather>
</Response>

Play in a loop

Wait

SSML

Examples

Basic SSML Example

SSML with Gather

On this page