Speech to Text

Speech to Text is a simple web application that allows users to transcribe audio files into text using the Facebook Wav2Vec2 model. This project is built using Flask and leverages the Hugging Face Transformers library.

Features

Transcribe audio files in formats like .wav, .mp3, and .flac.
Display the transcribed text on the web interface.

Getting Started

Prerequisites

Make sure you have Python and pip installed on your system.

Installation

Clone the repository:

git clone https://github.com/yourusername/speech-to-text.git
cd speech-to-text

Create a virtual environment (optional but recommended):
```
python -m venv venv
```
Activate the virtual environment:
- On Windows:
```
venv\Scripts\activate
```
- On macOS/Linux:
```
source venv/bin/activate
```
Install the required packages:
```
pip install -r requirements.txt
```

Running the Application

Run the Flask application:
```
python app.py
```
Open your web browser and navigate to http://127.0.0.1:5000/.
Upload an audio file and click the "Transcribe" button to see the transcribed text.

Usage

Choose an audio file (supported formats: .wav, .mp3, .flac).
Click the "Transcribe" button to get the transcribed text.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Application		Application
Code/Training		Code/Training
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech to Text

Features

Getting Started

Prerequisites

Installation

Running the Application

Usage

About

Releases

Packages

Contributors 2

Languages

XavierPereira2003/Speech-To-Text

Folders and files

Latest commit

History

Repository files navigation

Speech to Text

Features

Getting Started

Prerequisites

Installation

Running the Application

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages