Commit Graph

63 Commits

Author SHA1 Message Date
Hoan Luu Huu
c945a2de5a verbio stt/tts mods (#65)
* verbio tts mod

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* mod_verbio_transcribe

* wip

* wip

* wip

* wip

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* verbio stt

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-05-29 08:05:54 -04:00
Hoan Luu Huu
466dec7a6f mod_custom_tts (#53)
* mod_custom_tts

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* add custom_vendor to mod_dub

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-05-28 19:27:17 -04:00
Hoan Luu Huu
097ac337cc mod_vad_detect (#69)
* mod_vad_detect

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* mod_vad_detect

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-05-28 10:32:39 -04:00
Hoan Luu Huu
45ecf151c3 Delete tts tmp file if playback_start event has not been sent (#68)
* Delete tts tmp file if playback_start event has not been sent

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* Delete tts tmp file if playback_start event has not been sent

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-05-26 09:29:05 -04:00
Hoan Luu Huu
d1d0f5dd77 jb_transcribe correctly unlock mutex before cleanup (#66)
* jb_transcribe correctly unlock mutex before cleanup

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>
2024-05-23 07:48:02 -04:00
Dave Horton
e4a27ae133 minor logging
Signed-off-by: Dave Horton <daveh@beachdognet.com>
1.2.15
2024-05-14 06:59:11 -04:00
Hoan Luu Huu
19f20bf0e7 Fix/azure white noise (#63)
* azure return odd bytes audio, that azure tts mode generate white noise

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-05-14 06:55:44 -04:00
Hoan Luu Huu
b019a634bd Fix playht timeout while downloading audio (#61)
Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>
2024-05-10 12:18:32 -04:00
Dave Horton
f0b304b8a1 session mutex not properly unlocked causing stuck channels (#60)
Signed-off-by: Dave Horton <daveh@beachdognet.com>
1.2.14
2024-05-02 10:35:54 -04:00
Hoan Luu Huu
be3714465b support aws sessionToken (#58)
* support aws sessionToken

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* fix code style

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
1.2.12
2024-05-02 08:28:01 -04:00
Hoan Luu Huu
b495dba126 reset response when cached speech interface feeded new text (#59)
Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-05-02 07:24:57 -04:00
Hoan Luu Huu
33750b5420 mod_deepgram_transcribe keep alive connection (#56)
* mod_deepgram_transcribe keep alive connection

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* read use single connection configuration from env var

* remove deepgram media bug buffered data

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* fix review comments

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
1.2.11
2024-04-30 08:00:59 -04:00
Dave Horton
f8604656e3 minor logging 2024-04-29 18:09:23 -04:00
Hoan Luu Huu
916332d91d elevenlabs to use http 1.1 (#57)
* elevenlabs to use http 1.1

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* update the rest of mod tts

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* add env variable for disabling http2

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
1.2.10
2024-04-29 07:51:09 -04:00
Hoan Luu Huu
33fee93ec7 free maloc variable on mod_audio_fork (#54) 1.2.9 2024-04-23 09:15:11 -04:00
Hoan Luu Huu
41aebafd1c fix eleevnlabs callsession stuck (#55)
* fix eleevnlabs callsession stuck

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* fix read sample rate from session is not needed

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-04-23 09:14:43 -04:00
Andrew Golledge
bd69d476e7 Add Error Code Check to Google V1 Implementation (#51)
* Put the check for `grpc` error code 0 in the Google Speech-To-Text v1 as well.

* Distinguish between two types of error object in `grpc_read_thread`

* Improve naming of JSON field

* Correct error in JSON field name.

* Add sign-off to previous commit

Signed-off-by: Andrew Golledge <andreas.golledge@gmail.com>

---------

Signed-off-by: Andrew Golledge <andreas.golledge@gmail.com>
2024-04-22 19:20:15 -04:00
Dave Horton
fea51d5ecf initialize variables to avoid possible junk values
Signed-off-by: Dave Horton <daveh@beachdognet.com>
2024-04-22 16:06:32 -04:00
Dave Horton
83a2d1d730 modify cache folder name and various fixes from testing tts streaming (#50) 1.2.8 2024-04-18 11:28:17 -04:00
Hoan Luu Huu
3f642467eb add properties to tts span for mod_*_tts (#45)
* add properties to tts span for mod_*_tts

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* support deepgram tts span

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* support tts span for playht

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* support rimelabs tts span

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
1.2.7
2024-04-14 09:13:54 -04:00
Hoan Luu Huu
622cb14957 mod_*_tts is not able to parse http response code if the protocol is hot HTTP/2 (#44)
* fix mod_tts wrongly parsing http error code

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>
1.2.6
2024-04-13 08:51:03 -04:00
Dave Horton
1e1a51a3d1 overlapping close from previous connection after starting new one causes transcripts to stop (#43) 2024-04-12 19:02:11 -04:00
Hoan Luu Huu
4ce95e6d27 support new parameters for google v2 (#31)
* support new parameters for google v2

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* add enable_voice_activity_events

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* changes to start and end timeout

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
Co-authored-by: Dave Horton <daveh@beachdognet.com>
2024-04-12 07:24:46 -04:00
Hoan Luu Huu
f0d15c57a2 support mod_rimelabs_tts (#41)
* support mod_rimelabs_tts

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* add readme and license

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-04-12 07:23:21 -04:00
Hoan Luu Huu
2bbc18903c mod audio fork cannot parse ws:<IP>/call path (#40)
* mod audio fork cannot parse ws:<IP>/call path

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* add comments for the regex

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* add comments for the regex

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>
1.2.5
2024-04-08 20:03:01 -04:00
Hoan Luu Huu
4aabb36a64 add playht tts to mod_dub (#39)
Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
1.2.4
2024-04-08 10:54:49 -04:00
Hoan Luu Huu
ee488e4621 support mod_playht_tts (#38)
* support mod_playht_tts

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
1.2.3
2024-04-08 10:22:16 -04:00
Dave Horton
751d890d9b change default base dir for streaming tts cache files to /tmp/ 1.2.2 2024-04-07 12:46:53 -04:00
Dave Horton
e431c5d159 changes to mod_azure_tts for event handling, resolve session locking issue in mod_whisper (#37) 1.2.1 2024-04-07 11:39:09 -04:00
Dave Horton
a96fb2b4b2 Fixes/mod azure tts (#36)
* fixes for unlocking mutex and minimizing time under lock

* call .get() on future returned from speechSynthesizer->SpeakTextAsync

* mod_azure_tts: various fixes, including dangling session lock and reducing latency

* mod_deepgram_transcribe: add support for transcribing filler words
1.2.0
2024-04-06 12:26:56 -04:00
Hoan Luu Huu
94361f1d20 mod_dub support sayOnTrack Deepgram (#35)
* mod_dub support sayOnTrack Deepgram

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* mod_dub sayOnTrack support azure

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* support whisper

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
1.1.3
2024-04-05 07:16:31 -04:00
Hoan Luu Huu
d05cfb8ef0 remove deepgram tts linear16 wav 44 bytes header (#34)
* remove deepgram tts linear16 wav 44 bytes header

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-04-05 07:15:02 -04:00
Hoan Luu Huu
8e7f4a3fab Feat/mod deepgram tts (#33)
* deepgram tts mode

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* fix crash audio sound

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

* run with opus

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* dg-request-id check

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-04-04 15:54:22 -04:00
Dave Horton
fa4930d53e Fix/docker build (#32)
* explicit flags for boost

* remove pragma statements

* remove some logging in mod_azure_tts

* pragma back in

* compile error fixes

* mod_azure_tts: set Playback-File-Type to tts_stream
2024-04-03 13:49:05 -04:00
Hoan Luu Huu
63e66042af fix cannot build mod_audio_fork (#29)
Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
1.1.2
2024-03-31 21:08:01 -04:00
Hoan Luu Huu
eedcc03112 support streaming audio for mod_audio_fork (#28)
* support streaming audio for mod_audio_fork

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>

* fix review comments

* fix review comments

---------

Signed-off-by: Hoan HL <quan.luuhoang8@gmail.com>
2024-03-31 09:05:50 -04:00
Dave Horton
07dddd0094 fix deadlock bug 2024-03-29 19:55:16 -04:00
Andrew Golledge
14297ce3a6 Handle grpc_read_thread exit by always firing an event (#27)
* Ensure an event is triggered when the grpc_read_thread exits in all cases

* Compare correct status codes for v1 and v2

* Add sign-off to previous commit

Signed-off-by: Andrew Golledge <andreas.golledge@gmail.com>

---------

Signed-off-by: Andrew Golledge <andreas.golledge@gmail.com>
2024-03-26 09:18:56 -04:00
Dave Horton
48aeedec0a bug: fix crashing race condition in mod_dub 1.1.1 2024-03-25 18:14:54 -04:00
Hoan Luu Huu
be6758c3a8 support azure tts stream (#13)
* support azure stream

* delete trash file

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* fire variable_tts_time_to_first_byte_ms
2024-03-24 09:18:38 -04:00
Dave Horton
74bfc3152f fix build errors 1.1.0 2024-03-23 20:23:13 -04:00
Dave Horton
9cdc5fdfca Feat/mod dub v2 (#22)
* support tts elevenlabs to mod_dub

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* wip

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* wip

* wip

* fix aws race condition when 2 start transcribes are sent at the same instant

* wip

* wip

* wip

* allow queue play on track

* wip

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* revert change for aws transcribe

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* fix type

* wip

* wip

* rename parameters

* rename paramters

* wip

* wip

* wip

* wip

* wip

* bug: there exists scenarios where callback is not defined

* wip

* wip

* revert unintended changes to mod_google_transcribe

* fix bugs w/ streaming tts simplified arg parsing to use freeswitch conventions

---------

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>
Co-authored-by: Quan HL <quan.luuhoang8@gmail.com>
2024-03-23 15:37:27 -04:00
Andrew Golledge
4e57f73c7e Add Support for Google Cloud Speech-To-Text V2 library in mod_google_transcribe (#23)
* Introduce Google Speech-To-Text V2 library

* Add sign-off to previous commit

Signed-off-by: Andrew Golledge <andreas.golledge@gmail.com>

---------

Signed-off-by: Andrew Golledge <andreas.golledge@gmail.com>
2024-03-23 12:02:48 -04:00
Dave Horton
4925d31f95 add -L/usr/local/lib for aws modules 1.0.11 2024-03-13 15:24:56 -04:00
Dave Horton
15c41d8884 further fix for aws transcribe race condition 1.0.10 2024-03-12 12:23:58 -04:00
Dave Horton
b606255206 add mod_dub (#16)
* add mod_dub

Signed-off-by: Dave Horton <daveh@beachdognet.com>

* remove some locks

---------

Signed-off-by: Dave Horton <daveh@beachdognet.com>
2024-03-12 09:56:49 -04:00
Hoan Luu Huu
d6ae0a6a39 fix elevenlabs and whisper for multiple codec (#14)
* fix elevenlabs and whisper for multiple codec

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

* fix review comments

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>

---------

Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>
2024-03-12 09:52:35 -04:00
Dave Horton
06ab877f68 fix aws race condition when 2 start transcribes are sent at the same instant 1.0.9 2024-03-11 13:57:15 -04:00
Hoan Luu Huu
92dd3fc854 enable azure language identification mode (#15)
Signed-off-by: Quan HL <quan.luuhoang8@gmail.com>
2024-03-08 10:12:33 -05:00
Hoan Luu Huu
b4f3a41913 whisper tts stream module (#8)
* whisper tts stream module

* add convert mp3 to linear

* add lmpg123 link

* wip

* whisper cache

* wip

* wip

* clean up

* add free mem and openai metric

* whisper: cache files as mp3 for simplicity

---------

Co-authored-by: Dave Horton <daveh@beachdognet.com>
1.0.8
2024-02-22 14:17:13 -05:00