Skip to content

minnakt/happysadsleepymad

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

😊😢😴😤 (HappySadSleepyMad)

This is an adapted version of a private repo. As such some code has been omitted.

📈 Data collection

Data was collected from Twitter from March 9th to April 12th. Around 300,000 tweets in total were collection during this one month period. Tweets collected for every emoji except 'Symbol' emojis (which includes emojis like flag emojis). Tweets stored in a SQL database.

🤓💻 Analyses

  • K-means clustering comparing sentiment to emotion.
  • NLP word vectorization to find words used in similar contexts to certain emojis across different languages.
  • KNN model that could classify the sentiment based on all emojis in a tweet with accuracy prediction around 75%. Used for keywords "data science" and "Donald Trump."
  • Attempted a model to predict what emojis a person would use based on the text of tweet. However, accuracy was only 3%.

🎨 Visualizations

  • Matplotlib for K-means clustering.
  • D3 bar graph to show ratio of positive/negative/neutral tweets corresponding to each of 😊😢😴😤.
  • D3 bar graph to show language frequency.
  • Radial graphs to show words used in similar contexts to each of 😊😢😴😤.
  • Word clouds showing most commonly associated words as well as the ratio of negative and positive emojis, and what emojis are used in negative versus positive contexts with respect to the keyword.
  • Pie chart showing top emojis for 'data science.'

🎉 Poster / Results

download

High quality version can be seen as a PDF at https://github.com/minnakt/happysadsleepymad/blob/master/final_poster.pdf.

📝 Blog

More detailed information about the project process can be found here:

If a summary is required, I recommend to read Part 3 only; there are some interesting results not included on the poster included on there.

📊 Tweet Database

https://drive.google.com/file/d/1Bt_KnTEvEh1LbtU-GKYKfMVYYYYSeYAV/view?usp=sharing

🔥👩❤️ Acknowledgements

This project could not have been completed without the following lovely and brilliant women:

📚 Resources

Sources for creating the emoji json data:

D3 pie chart:

Radial graphs:

About

A condensed version of HappySadSleepyMad.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published