Lyrical Recognition

Attempt at keyword recognition and transcription of audio using PocketSphinx and REPET.

Setup

Install required packages using pip

$ pip install numpy
$ pip install matplotlib
$ pip install librosa
$ pip install soundfile
$ pip install mutagen
$ pip install mp3_tagger
$ pip install azapi
$ pip install speech_recognition
$ pip install pydub
$ pip install pocketsphinx

Now install ffmpeg from here: https://www.ffmpeg.org/download.html

cd into the folder and run:

$ python vocal_seperation.py

Usage

Run vocal_seperation.py with src set to an ".mp3" file and it will produce an audio spectogram of the foreground and background audio. The foreground audio is then passed into pocketsphinx as an attempt for speech to text; however, the the current state of this method does not produce very accurate results. In addition, an API call will retrieve the actual lyrics of the song to a ".txt" file. This process is very basic and unreliable currently, but will be improved in the future.

Acknowledgements

This is based on the "REPET-SIM" method of Rafii and Pardo, 2012 and REPET algorithm by Brian McFee

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.vscode		.vscode
__pycache__		__pycache__
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
audio_file.wav		audio_file.wav
background.wav		background.wav
foreground.wav		foreground.wav
mp3.py		mp3.py
vocal_seperation.py		vocal_seperation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lyrical Recognition

Setup

Usage

Acknowledgements

About

Releases

Packages

Languages

License

srikarh/lyric-recognition

Folders and files

Latest commit

History

Repository files navigation

Lyrical Recognition

Setup

Usage

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages