Skip to content

juananthony/metro-big-data-unir

Repository files navigation

metro-big-data-unir

This project is a python script that uses tweepy to create listeners and gather tweets from Twitter. This tweets follows two conditions: tweet was posted by specific user_id or that user_id was mentioned. After gather a tweet, some NLP techniques are used (remove stopwords, etc) and it is classified into 3 different classes depending on the content: issue, complaint or nothing. To classify every tweet, a pre-trained model based on Naive Bayes is used. Finally, this tweets is stored in a MongoDB database.

About

Tweet capture tool for Metro Madrid messages

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published