mirror of
https://github.com/jambonz/next-static-site.git
synced 2025-12-19 04:47:44 +00:00
1.8 KiB
1.8 KiB
say
The say command is used to send synthesized speech to the remote party. The text provided may be either plain text or may use SSML tags. jambonz supports a large number of speech vendors out of the box (see list below), and you may add others via the custom speech API.
{
"verb": "say",
"text": "hi there!",
"synthesizer" : {
"vendor": "google",
"language": "en-US"
}
}
You can use the following options in the say action:
| option | description | required |
|---|---|---|
| text | text to speak; may contain SSML tags | yes |
| synthesizer.vendor | speech vendor to use (see list below, along with any others you add via the custom speech API) | no |
| synthesizer.language | language code to use. | no |
| synthesizer.gender | (Google only) MALE, FEMALE, or NEUTRAL. | no |
| synthesizer.voice | voice to use. Note that the voice list differs whether you are using aws or Google. Defaults to application setting, if provided. | no |
| loop | the number of times a text is to be repeated; 0 means repeat forever. Defaults to 1. | no |
| earlyMedia | if true and the call has not yet been answered, play the audio without answering call. Defaults to false | no |
Text-to-speech vendors
jambonz natively supports the following text-to-speech services:
- aws
- azure
- deepgram
- elevenlabs
- ibm
- nuance
- nvidia
- wellsaid
- whisper
Note: Microsoft supports on-prem and private link options for deploying the speech service in addition to the hosted Microsoft service.