SONG'S INFORMATION RETRIEVAL BASED ON LYRICS

Introduction

Information Retrieval is the process through which a computer system can respond to a user's query for text-based information on a specific topic. IR was one of the first and remains one of the most important problems in the domain of natural laguague processing (NLP) - stanford cs276

This Search Engine gives the result information about the song based on the relevance of the query about the lyrics provided by the user.

Motivation

The system supports users to search for songs based on a query from the lyrics.

We build an appilcation with similar idea with Shazam, MusixMatch

Data

The database we use for this retrieval model is from Song Lyrics Dataset on Kaggle.

This dataset contains lyric of songs by various artists. Thanks to The Author for creating this dataset, and for inspiring us to make this project.

Requirements

numpy
pandas
re
pickle
json
nltk
rank_bm25

Install all packages with the line: pip install -r requirements.txt

After installing the NLTK package, please do install NLTK Data for specific functions to work. Following this command in your terminal:

python
import nltk
nltk.download('popular')

Usage

We deployed our application to Streamlit framework for demo purposes of our project.

To run it, firstly, install the environment according to the requirements section above.

Then, run with the line: streamlit run music-retrieval.py

Or without using Streamlit framework, you can run with jupyter notebook file: jupyter notebook Information_Retrieval.ipynb

Remember to load the data file music_data.csv to be able to perform the next operations.

You can use your own custom music database by creating a file with the same structure as our data file music_data.csv.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
data		data
dictionary		dictionary
Information_Retrieval.ipynb		Information_Retrieval.ipynb
README.md		README.md
corpus.txt		corpus.txt
datasets.txt		datasets.txt
lemma.py		lemma.py
music-retrieval.py		music-retrieval.py
music_data.csv		music_data.csv
preprocess.py		preprocess.py
requirements.txt		requirements.txt
sample.jpeg		sample.jpeg
target.txt		target.txt
test.csv		test.csv
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SONG'S INFORMATION RETRIEVAL BASED ON LYRICS

Introduction

Motivation

Data

Requirements

Usage

About

Releases

Packages

Languages

htuannn/lyrics_retrieval

Folders and files

Latest commit

History

Repository files navigation

SONG'S INFORMATION RETRIEVAL BASED ON LYRICS

Introduction

Motivation

Data

Requirements

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages