This is an analysis of music popularity compared with features of the music like key, mode, duration, danceability, energy, etc...
This analysis was done as a Data Analysis exercise with the idea to practice Data Cleaning, Univariate Analysis, Bivariate Analysis and SQL.
I started the project by defining the questions I wanted to answer:
- How do song properties affects their popularity?
- How popular are acoustic songs compared to the AVG?
- How popular are instrumental songs compared to the AVG?
- How popular are live songs compared to the AVG?
- What are the most popular songs?
- What are the 5 most popular songs in Europe
- What are the most popular songs songs per continent?
- What are the 5 most popular artist?
- What are the properties of the songs of the most popular artists?
- How does song properties evolve across the time? (properties vs release date)
- How does the time impact popularity? (popularity vs release date)
- How many albums were released by the most popular artists?
- When was the release of the first album of the most popular artist?
All these questions are answered through the analysis done on the Python Notebook music_popularity_analysis.ipynb and the sql queries in the file music_popularity_sql_answers.sql.
A preview of the project is also included at the included presentation Music_Popularity_Analysis.pdf including some answers to the questions raised.
This analysis is based on two datasets:
- Top Spotify Songs in 73 Countries (Daily Updated) by asaniczka
- Country and Continent Codes List by Steve Withington
The main information for the analysis is taken from the Spotify dataset but I wanted to group the information per continent instead of country for that reason I used the second dataset.
* NOTE:
This repository doesn't include any dataset (maily due to github file size limitations) to execute the code and/or to generate the database (not included for the same reason) you will need to manually download the datasets and include them as:
./dataset/universal_top_spotify_songs.csv./dataset/country-and-continent-codes-list-csv.csv
Top Spotify Songs in 73 Countries (Daily Updated) is under license ODC Attribution License (ODC-By)
This work is licensed under a Creative Commons Attribution 4.0 International License.
