upgrade to latest cartesia 1.0.3 #587

rjheeta · 2024-06-27T12:43:01Z

Upgrade to latest Cartesia

ajar98

happy to get this in without the streaming update as well! though this will make the synthesizer much faster :)

ajar98 · 2024-06-28T00:09:09Z

pyproject.toml

@@ -43,7 +43,7 @@ groq = { version = "^0.9.0", optional = true }
 # Synthesizers
 google-cloud-texttospeech = { version = "^2.16.3", optional = true }
 pvkoala = { version = "^2.0.1", optional = true }
-cartesia = { version = "^0.1.1", optional = true }
+cartesia = "^1.0.3"


can this go back to being optional?

ajar98 · 2024-06-28T00:13:46Z

vocode/streaming/synthesizer/cartesia_synthesizer.py


-
    async def create_speech_uncached(


if interested, would love to support you making this properly streaming!!

would be something like:

def chunk_generator(sse): async for chunk in sse: yield SynthesisResult.ChunkResult(data=chunk) return SynthesisResult(chunk_generator=chunk_generator(generator), lambda seconds: self.get_message_cutoff_from_voice_speed(message, seconds, 150))

you would have to also make the synthesizer support mulaw properly (which I think they fixed recently!)

* [DOW-118] set up code linting and tests (#589) * adds github workflow * run black * run isort * adds precommit * adds vscode settings * adds pre-commit guidelines (#590) * creates docker image, updates telephony app deps (#601) * [DOW-105] refactor interruptions into the output device (#586) * [DOW-105] refactor interruptions into the output device (#562) * initial refactor works * remove notion of UtteranceAudioChunk and put all of the state in the callback * move per_chunk_allowance_seconds into output device * onboard onto vonage * rename to abstract output device and onboard other output devices * initial work to onboard twilio output device * twilio conversation works * some cleanup with better comments * unset poetry.lock * move abstract play method into ratelimitoutputdevice + dispatch to thread in fileoutputdevice * rename back to AsyncWorker * comments * work through a bit of mypy * asyncio.gather is g2g: * create interrupt lock * remove todo * remove last todo * remove log for interrupts * fmt * fix mypy * fix mypy * isort * creates first test and adds scaffolding * adds two other send_speech_to_output tests * make send_speech_to_output more efficient * adds tests for rate limit interruptions output device * makes some variables private and also makes the chunk id coming back from the mark match the incoming audio chunk * adds twilio output device tests * make typing better for output devices * fix mypy * resolve PR comments * resolve PR comments * [DOW-101] LiveKit integration (#591) * checkpoint * livekit v0 * in progress changes * integrate with worker * fix import * update deps and remove unneeded files * integrate it properly into app * fix interrupts * make transcript publish work * a confounding fix * isort * constants, some cleanup --------- Co-authored-by: Kian <[email protected]> * upgrade to latest cartesia 1.0.3 (#587) * upgrade to latest cartesia 1.0.3 * fixed linting conflict * finish streaming * make cartesia optional --------- Co-authored-by: Ajay Raj <[email protected]> * poetry version prerelease (#602) * feat: Add ability to configure OpenAI base URL in ChatGPTAgentConfig (#577) * feat: Add ability to configure OpenAI base URL in ChatGPTAgentConfig - Added `base_url` parameter to `ChatGPTAgentConfig` to allow customization of the OpenAI API base URL. - Updated `instantiate_openai_client` function to use the `base_url` parameter from the configuration. - Modified `ChatGPTAgent` to utilize the updated `instantiate_openai_client` function. - Added tests to verify the new `base_url` functionality in `tests/streaming/agent/test_base_agent.py`. This enhancement allows users to specify a custom OpenAI API base URL, providing greater flexibility in agent configuration. * adding capability to use the openai compatible endpoint with token estimation for llama * lint fix * changing openai base_url parameter for overall less code changes * missed logging update * Update vocode/streaming/agent/chat_gpt_agent.py * Update tests/streaming/agent/test_base_agent.py * fix test --------- Co-authored-by: Ajay Raj <[email protected]> * Support passthrough of AsyncHTTPTransport (#603) Support passthrough of AsyncHTTPTransport object * add script used to make PR * adds test target for vocodehq-public * Remove catch-all exception logger for asyncio tasks (#605) * remove error log from exception for asyncio tasks * remove log error on chatgpt query --------- Co-authored-by: Kian <[email protected]> Co-authored-by: rjheeta <[email protected]> Co-authored-by: Clay Elmore <[email protected]> Co-authored-by: vocode-petern <[email protected]> Co-authored-by: Adnaan Sachidanandan <[email protected]>

rjheeta added 2 commits June 27, 2024 08:42

upgrade to latest cartesia 1.0.3

bfbd1c8

fixed linting conflict

5fcb20a

ajar98 requested changes Jun 28, 2024

View reviewed changes

ajar98 added 4 commits July 3, 2024 11:24

finish streaming

abde468

make cartesia optional

c2c68e6

Merge branch 'main' into upgrade-cartesia-lib

9b5d5ca

Merge branch 'main' into upgrade-cartesia-lib

c91e2aa

ajar98 merged commit 949711f into vocodedev:main Jul 3, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

upgrade to latest cartesia 1.0.3 #587

upgrade to latest cartesia 1.0.3 #587

rjheeta commented Jun 27, 2024

ajar98 left a comment

ajar98 Jun 28, 2024

ajar98 Jun 28, 2024

upgrade to latest cartesia 1.0.3 #587

upgrade to latest cartesia 1.0.3 #587

Conversation

rjheeta commented Jun 27, 2024

ajar98 left a comment

Choose a reason for hiding this comment

ajar98 Jun 28, 2024

Choose a reason for hiding this comment

ajar98 Jun 28, 2024

Choose a reason for hiding this comment