Skip to content

snahackathon/sh2014

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

SNA Hackathon 2014 helper code

This project contains the code that builds baseline model for SNA Hackathon online challenge http://sh2014.org.

There are 3 scripts in R provided:

  • prepare.R - Pre-processes content data and counts likes by post.
  • features.R - Extracts basic text features.
  • baseline.R - Builds baseline model.

Instruction

Extract source data files into ./data/src/. There must be 3 files provided:

  • test_content.csv
  • train_content.csv
  • train_likes.csv

The details on where to get source data might be found here

To build baseline model, run in command line:

git clone https://github.com/snahackathon/sh2014.git
cd ./sh2014
#<unzip data to ./data>
cd R
R --vanilla < prepare.R
R --vanilla < features.R
R --vanilla < baseline.R

You may now find baseline prediction for the test dataset in data/submit directory. Of course, it's only a baseline, so it doesn't pass threshold.

Submit your predictions on this page. And remember, sharing your code is prohibited by the rules of competition.

Good luck!

About

SNA Hackathon 2014 helper code

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages