Skip to content

Arc data pipeline for recommending movies based on the MovieLens dataset

Notifications You must be signed in to change notification settings

adamschmidt/movielens-arc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Arc Movie Recommender

Implementation of a movie recommendation pipeline using MovieLens data. Inspiration came from this article on codementor and the Spark documentation on collaborative filtering.

This repo is built on the arc data pipeline framework.

Setup

Download and unpack data files by running make download

Running

Start the infrastructure by running docker-compose up. This will start the Arc Jupyter Notebook service. You can then open Jupyter at http://localhost:8888.

With the Docker infrastructure running, the first step is to run src/MovieModel.ipynb to set up, train, and store the ALS machine learning model for the pipeline to use.

The, open src/MovieRecommendations.ipynb to execute the ML model as part of a data pipeline

Clean Up

Files generated by running the Arc IDE can be cleaned up through running make clean.

About

Arc data pipeline for recommending movies based on the MovieLens dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published