Deciphering online censorship on twitter in the Indian context using topic modelling

This repository consists of a course project that aims to breakdown the apparatus of censorship on twitter in India using topic modelling and methods of text analysis and natural language processing.

The ultimate objective of this project is to understand if there are broader patterns or topics that emerge out of the tweets censored by the Indian government. If so, what could these topics be?

The project is broken down into two broad parts. The first part pertains to extracting details of accounts whose tweets were reported by the Indian government to Twitter between 2019 and 2021 using data collected by the Lumen database. A detailed explanation of the context of the problem along with the method used is mentioned in the pdf titled "Context_process_pt1". The python code for web scraping and organizing data from the Lumen database can be found in the file " Web_scraping_twitter_account_data".

The second part of the project attempts to extract texts of the reported tweets (if available), and then use text analysis and topic modelling to understand if there are certain patterns that emerge from the censored content, and if so, what these patterns are. The elaborate context of this part of the project can be found in "Context_process_pt2". The python code for the process can be found in the file "text_analysis_censored_tweets".

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Context_process_part 1 .pdf		Context_process_part 1 .pdf
Context_process_pt2.pdf		Context_process_pt2.pdf
README.md		README.md
Web_scraping_twitter_account_data.ipynb		Web_scraping_twitter_account_data.ipynb
master_replies.csv		master_replies.csv
text_analysis_censored_tweets.ipynb		text_analysis_censored_tweets.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deciphering online censorship on twitter in the Indian context using topic modelling

This repository consists of a course project that aims to breakdown the apparatus of censorship on twitter in India using topic modelling and methods of text analysis and natural language processing.

About

Releases

Packages

Languages

pranathiiyer/Deciphering-censored-tweets-in-India

Folders and files

Latest commit

History

Repository files navigation

Deciphering online censorship on twitter in the Indian context using topic modelling

This repository consists of a course project that aims to breakdown the apparatus of censorship on twitter in India using topic modelling and methods of text analysis and natural language processing.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages