Song Recommendation Engine

A song recommendation engine built using Spark and Hadoop.

The model is optimised using the Alternating Least Sqaures (ALS) algorithm.

Dependencies

Extract the dataset

unzip dataset.zip

Copy the files to Hadoop

hdfs dfs -copyFromLocal 10000.txt /user/<name>
hdfs dfs -copyFromLocal song_data.csv /user/<name>

Run the Spark job

$SPARK_HOME/bin/spark-submit rec_engine.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data subset		data subset
.gitignore		.gitignore
README.md		README.md
dataset.zip		dataset.zip
extract_metadata.py		extract_metadata.py
rec_engine.py		rec_engine.py