Skip to content

🎵 A song recommendation engine built with Spark and Hadoop using the Million Song dataset.

Notifications You must be signed in to change notification settings

roshan-d21/song-recommendation-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Song Recommendation Engine

A song recommendation engine built using Spark and Hadoop.

The model is optimised using the Alternating Least Sqaures (ALS) algorithm.

Dependencies

  • Hadoop - 3.2.x
  • Spark - 3.0.x

Usage

Extract the dataset

unzip dataset.zip

Copy the files to Hadoop

hdfs dfs -copyFromLocal 10000.txt /user/<name>
hdfs dfs -copyFromLocal song_data.csv /user/<name>

Run the Spark job

$SPARK_HOME/bin/spark-submit rec_engine.py

Contributors:

About

🎵 A song recommendation engine built with Spark and Hadoop using the Million Song dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages