ubiquitous-arsenal

Grab Twitter data related to all things Arsenal

Installation requirements

NLTK, nameparser and Twython is required. It can be installed via pip or through a conda environment.

For Twython:

pip install twython

Following which you can make a clone of this repository:

git clone [email protected]:dsmiff/ubiquitous-arsenal.git
source setup.sh

Running ubiquitous-arsenal:

source setup.sh
python fetch_tweets.py --counts <nTweets> -H <hashtag> --logging-level <logging_level>

Twython returns a maximum of 100 tweets from the search function. Therefore, nTweets is split into intervals of 100 tweets, each interval waiting 5 seconds before moving to the next. Logic is to filter squad players from each tweet, then search for human names using the nltk package. Immediate weakness is the difficulty in searching only for last names.

The tweets are stored to a text file, under the directory in which the script is run. From which, to analyse the tweets:

python analyse_tweets.py --in-tweets <tweet_file>

To fetch the current list of players of a given team:

python fetch_players.py --hashtag <team>

This will store the players and their shirt number in the teams directory.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
core		core
docs		docs
tests		tests
usr		usr
.gitignore		.gitignore
README.rst		README.rst
analyse_tweets.py		analyse_tweets.py
fetch_players.py		fetch_players.py
fetch_tweets.py		fetch_tweets.py
setup.sh		setup.sh
tweet_handler.py		tweet_handler.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ubiquitous-arsenal

Installation requirements

About

Releases

Packages

Languages

dsmiff/ubiquitous-arsenal

Folders and files

Latest commit

History

Repository files navigation

ubiquitous-arsenal

Installation requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages