Update vocodehq-public #631

ajar98 · 2024-07-12T21:54:30Z

This PR updates the vocodehq-public branch by merging changes from the main branch.

* [Bug-628] correct coding errors in the google synthesiser * create_speech --> create_speech_uncached --------- Co-authored-by: Ajay Raj <[email protected]>

* make terminate async * creates audio pipeline abstraction * fix streaming conversation api * make terminate() invocations in tests async * removes the vector_db.tear_down() call in streaming conversation

vocode-petern

Some questions for my own learning/understanding but since all of these changes have already been approved previously, it should be fine to merge

vocode-petern · 2024-07-12T22:06:57Z

vocode/streaming/synthesizer/google_synthesizer.py

@@ -56,7 +56,7 @@ def synthesize(self, message: str) -> Any:
        )

    # TODO: make this nonblocking, see speech.TextToSpeechAsyncClient
-    async def create_speech(
+    async def create_speech_uncached(


Was this just broken before? Looks like it should have been create_speech_uncached all along

yep, it was just broken :/

vocode-petern · 2024-07-12T22:07:09Z

vocode/streaming/synthesizer/google_synthesizer.py

@@ -75,7 +75,7 @@ async def create_speech(
        in_memory_wav.setnchannels(1)
        in_memory_wav.setsampwidth(2)
        in_memory_wav.setframerate(output_sample_rate)
-        in_memory_wav.writeframes(response.audio_content)
+        in_memory_wav.writeframes(response.audio_content[44:])


What's the significance of 44? I'm guessing there's some start sequence that's always prepended to the audio content? Is the audio content guaranteed to have a length of at least 44?

44 is the size of the wave header, it shouldn't be a magic number really, but is okay for now

jstahlbaum-fibernetics and others added 2 commits July 12, 2024 14:20

[Bug #628] correct coding errors in the google synthesiser (#629)

ad1adc8

* [Bug-628] correct coding errors in the google synthesiser * create_speech --> create_speech_uncached --------- Co-authored-by: Ajay Raj <[email protected]>

[DOW-119] creates AudioPipeline abstraction (#625)

4c196e1

* make terminate async * creates audio pipeline abstraction * fix streaming conversation api * make terminate() invocations in tests async * removes the vector_db.tear_down() call in streaming conversation

ajar98 requested a review from vocode-petern July 12, 2024 21:57

vocode-petern approved these changes Jul 12, 2024

View reviewed changes

ajar98 merged commit 2eb0115 into vocodehq-public Jul 12, 2024
8 checks passed

ajar98 mentioned this pull request Jul 12, 2024

Update vocodehq-public #632

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update vocodehq-public #631

Update vocodehq-public #631

ajar98 commented Jul 12, 2024

vocode-petern left a comment

vocode-petern Jul 12, 2024

ajar98 Jul 12, 2024

vocode-petern Jul 12, 2024

ajar98 Jul 12, 2024

Update vocodehq-public #631

Update vocodehq-public #631

Conversation

ajar98 commented Jul 12, 2024

vocode-petern left a comment

Choose a reason for hiding this comment

vocode-petern Jul 12, 2024

Choose a reason for hiding this comment

ajar98 Jul 12, 2024

Choose a reason for hiding this comment

vocode-petern Jul 12, 2024

Choose a reason for hiding this comment

ajar98 Jul 12, 2024

Choose a reason for hiding this comment