Skip to content

varunpusarla/Reddit_Scorer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Reddit_Scorer

A data pipeline built using luigi package that helps to retrieve and scores posts from the top subreddits.

The score of the subreddit is calculated by the following formula:

image

where the post score(PostRS) is calculated by:

image

In this project we score the top 50 subreddits by scoring them on the basis of top 5 comments of their top 10 posts.

Sample Output:

image

Schedule package of python has been used in order to run this pipeline twice a day.

Environment Setup

  1. Install the dependencies by running
pip install -r requirements.txt
  1. Add the following API credentials(you'll need to register for the reddit API) to an .env file a) Client ID b) Client Secret c) Username d) Password e) User agent

  2. Run the file:

python app.py

About

Tracking and ranking Reddit posts over time

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages