-
Notifications
You must be signed in to change notification settings - Fork 493
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update vocodehq-public #631
Update vocodehq-public #631
Conversation
* [Bug-628] correct coding errors in the google synthesiser * create_speech --> create_speech_uncached --------- Co-authored-by: Ajay Raj <[email protected]>
* make terminate async * creates audio pipeline abstraction * fix streaming conversation api * make terminate() invocations in tests async * removes the vector_db.tear_down() call in streaming conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some questions for my own learning/understanding but since all of these changes have already been approved previously, it should be fine to merge
@@ -56,7 +56,7 @@ def synthesize(self, message: str) -> Any: | |||
) | |||
|
|||
# TODO: make this nonblocking, see speech.TextToSpeechAsyncClient | |||
async def create_speech( | |||
async def create_speech_uncached( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Was this just broken before? Looks like it should have been create_speech_uncached all along
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yep, it was just broken :/
@@ -75,7 +75,7 @@ async def create_speech( | |||
in_memory_wav.setnchannels(1) | |||
in_memory_wav.setsampwidth(2) | |||
in_memory_wav.setframerate(output_sample_rate) | |||
in_memory_wav.writeframes(response.audio_content) | |||
in_memory_wav.writeframes(response.audio_content[44:]) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's the significance of 44? I'm guessing there's some start sequence that's always prepended to the audio content? Is the audio content guaranteed to have a length of at least 44?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
44 is the size of the wave header, it shouldn't be a magic number really, but is okay for now
This PR updates the vocodehq-public branch by merging changes from the main branch.