The repo contains code for a project in my MSc course, which explains what I have done as part of the programme, which explains my ability in intermediate data analysis and visualization using Python.
This project conducts an exploratory data analysis (EDA) on a Spotify dataset to uncover insights about musical features, trends, and patterns. The analysis leverages Python and libraries like pandas
, matplotlib
, and seaborn
.
The dataset includes various musical attributes of songs sourced from Spotify’s Web API, such as acousticness, danceability, energy, and tempo.
- Data Preprocessing: Load and clean the data.
- Descriptive Statistics: Summarize numerical and categorical features.
- Data Visualization: Visualize distributions and relationships of features.
- Correlation Analysis: Examine correlations between musical attributes.
- Genre Analysis: Explore characteristics of different genres.
- Trend Analysis: Identify temporal trends in music features.
- Distribution Insights: Visualizations of feature distributions like danceability and energy.
- Correlation Findings: Analysis of relationships between different musical attributes.
- Genre Insights: Comparison of musical features across genres.
- Temporal Trends: Observations on how music features have changed over time.
- Data Manipulation: Efficient data handling with pandas.
- Visualization: Creating visual insights using
matplotlib
andseaborn
. - Statistical Analysis: Summarizing and interpreting data features.
- Python Programming: Writing clean and efficient Python code.