The project aims to to produce a starter guide of philosophy reading for an outsider from a data analysis perspective.
- Find reading-friendly philosophers who use less uncommon words based on a self-defined metric 𝑈𝑛𝑐𝑜𝑚𝑚𝑜𝑛𝑊𝑜𝑟𝑑𝐷𝑒𝑛𝑠𝑖𝑡𝑦.
- Utilized TF-IDF to generate a word cloud for Smith(a reading-friendly philosopher chosen) to quickly grasp the main topics he emphasizes on.
- Explore how capitalism developed in the era based on semantic analysis with NRC Emotion lexicon
data/: NA. Github only accepts files less than 25MB, please download the dataset from https://www.kaggle.com/datasets/kouroshalizadeh/history-of-philosophy.
doc/: ipynb files, where the data story is presented with codes.
figs/: Figures
libs/: contains some functions used in the project
output/: csv files, including some middle tables and output tables.