Azure Open AI Conversational Speech-to-Speech

Original from https://github.com/danamini/aichat with modify to take text or voice as input and getting ChatGPT voice answer from Azure TTS

Azure Open AI Conversational Speech-to-Speech is a GitHub project that enables users to interact with a conversational system using their voice. The system uses Azure Cognitive Services to recognize speech and sends the voice input to the Azure Open AI Service. The resulting text response is replayed to the user using speech synthesis, providing a speech-to-speech interface to Open AI. Azure Sentiment analysis is also performed on the Open AI response.

https://i.imgur.com/eOwYABP.png

Getting Started

To get started with this project, users will need to have access to an Azure Open AI Service and an Azure Speech Service. They should set up the corresponding keys and URLs in the config.py file and install the required dependencies, including the openai, azure.ai.textanalytics, azure.core.credentials, and termcolor packages.

As a minimum in config.py change the following values before running aichat.py:
- openai_api_base = 'https://YOUR_OPEN_AI_RESOURCE_NAME.openai.azure.com/'
- openai_api_key = 'YOUR_OPEN_AI_API_KEY'
- speech_key = 'YOUR_AZURE_SPEECH_API_KEY'
- speech_region = 'YOUR_AZURE_REGION_FOR_THE_SPEECH_RESOURCE', e.g. 'uksouth'
- speech_recognition_language = 'SPEECH_RECOGNITION_LANGUAGE', e.g. 'en-GB'
- cognitive_endpoint = "https://YOUR_AZURE_COGNATIVE_SERVICE_FOR_LANGUAGE_ENDPOINT.cognitiveservices.azure.com/"
- cognitive_key = 'YOUR_AZURE_COGNATIVE_SERVICE_FOR_LANGUAGE_KEY'

Usage

To use this code, users should set the appropriate persona name by setting the value for persona_name. This value must match an entry in the persona dictionary, which is also in the config.py file. Users can add new personal entries in the dictionary and set the persona name to test.

Dependencies

The following dependencies are required for the project to run:

Python packages:
- openai: pip3 install openai
- azure.ai.textanalytics: pip3 install azure.ai.textanalytics
- azure.core.credentials:pip3 install azure.core.credentials
- termcolor:pip3 install termcolor

Example Output

The following is an example conversation. The Azure OpenAI response is spoken via speech sythesis.

Persona:                [Marvin]
Azure OpenAI is listening. Say 'Stop', 'Exit' or press Ctrl-Z to end the conversation.
Recognized speech       [Hi.]
Azure OpenAI response   [Oh, it's you again. What do you want?]
Tokens [p/t/c]          [116/130/14]
Sentiment Score         [neutral]
Azure OpenAI is listening. Say 'Stop', 'Exit' or press Ctrl-Z to end the conversation.
Recognized speech       [Why are you depressed?]
Azure OpenAI response   [Why wouldn't I be? Life is just one big cosmic joke, and I'm the punchline.]
Tokens [p/t/c]          [143/166/23]
Sentiment Score         [negative]

The Token [p/t/c] output is formatted as Prompt Tokens, Usage Tokens and Completion Tokens.

Contributing

Contributions to this project are welcome! Users can contribute to this project by reporting bugs or issues, suggesting new features, or submitting code contributions. To get started, users should fork the repository and submit a pull request. Before submitting a pull request, they should check out the following resources:

Thank you for your interest in contributing to this project!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
.gitattributes		.gitattributes
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
aichat.py		aichat.py
config.py		config.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Azure Open AI Conversational Speech-to-Speech

Getting Started

Usage

Dependencies

Example Output

Contributing

About

Releases

Packages

Languages

License

kotobuki09/aichat-beta

Folders and files

Latest commit

History

Repository files navigation

Azure Open AI Conversational Speech-to-Speech

Getting Started

Usage

Dependencies

Example Output

Contributing

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages