🚖 fully local taxi-driver language assistant

📝 overview

this project provides a fully local language assistant called berto. berto can listen to your audio, transcribe it using whisper, and then interact with you in spanish using an ai model served locally via ollama. it also has text-to-speech capabilities to provide audio responses. 🗣️

🌟 features

local transcription of spoken language using whisper. 🎤
interaction with an ai model (llama 2 uncensored) served locally via ollama. 🦙
conversation options, including following up on questions related to science, history, and politics. ❓
text-to-speech responses using edge tts. 🔊
audio playback of berto's responses. 🎧

🎥 demo

watch the demo video here

🚀 setup

1. install dependencies 📦

first, you need to install the required packages. you can do this by running the following command to install all dependencies from the requirements.txt file:

pip install -r requirements.txt

2. download and install ollama 🛠️

this project requires ollama to serve the ai model (llama2-uncensored). you can download ollama from the Ollama website and install it on your local machine.

once installed, you need to download the model:

ollama run llama2-uncensored

make sure ollama is running on localhost:11434 to handle the requests.

3. run the script 🎬

to run the assistant, execute the following script:

python bertosito_chat.py

this will start a conversation with berto, who will transcribe your spoken audio and respond based on the conversation using the ai model hosted on ollama.

💡 how it works

recording audio: berto listens to your voice and transcribes it using whisper. 🎤
generating responses: it sends your transcribed text to the ai model and generates a response. 💬
text-to-speech: berto will convert the generated response to speech and play it back. 🔊
conversation options: the assistant presents multiple conversation options, including asking questions and following up on prior responses. 🤔

📂 file structure

bertosito_chat.py: the main script to run berto. 🖥️
response.mp3 and response.wav: audio files generated during interaction. 🎶
requirements.txt: file containing all the necessary dependencies. 📜
demo.mp4: demo video showcasing the app. 🎥

📝 notes

ensure that you have ollama running locally and the required models downloaded before starting the script. 🛠️
the assistant only responds in spanish and expects interactions in the same language. 🇪🇸
make sure to run ollama using the exact model name llama2-uncensored. 🦙

enjoy interacting with berto! 🎉

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github/workflows		.github/workflows
chinese		chinese
spanish		spanish
viz/PyPi		viz/PyPi
voices		voices
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bertosito_chat.py		bertosito_chat.py
demo.MOV		demo.MOV
demo.gif		demo.gif
requirements.txt		requirements.txt
response.mp3		response.mp3
response.wav		response.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚖 fully local taxi-driver language assistant

📝 overview

🌟 features

🎥 demo

🚀 setup

1. install dependencies 📦

2. download and install ollama 🛠️

3. run the script 🎬

💡 how it works

📂 file structure

📝 notes

About

Releases

Packages

Languages

License

carsonmulligan/berto_ollama_whisper

Folders and files

Latest commit

History

Repository files navigation

🚖 fully local taxi-driver language assistant

📝 overview

🌟 features

🎥 demo

🚀 setup

1. install dependencies 📦

2. download and install ollama 🛠️

3. run the script 🎬

💡 how it works

📂 file structure

📝 notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages