Project 01: Exploring Weather Trends In Cairo
Analyzing local and global temperature data and comparing the temperature trends in Cairo to overall global temperature trends.
Exploring The Movie Database (TMDb) to answer chosen questions about movies and genres:
- How did several genres fare through the years?
- Does a high budget guarantee a high revenue? and which movie has the highest budget to revenue ratio?
- Does a movie runtime affect the revenue? and is the average movie runtime increasing over the years?
- Who is the most accomplished director in terms of total revenues and total average scores of his movies?
- Who are the most productive directors in terms of number of movies made?
Analyzing the results of an A/B test for an e-commerce website. The goal is to help the company understand if they should implement the new page, keep the old page, or perhaps run the experiment longer to make their decision.
Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though? Almost always greater than 10. 11/10, 12/10, 13/10, etc. Why? Because "they're good dogs Brent." WeRateDogs has over 4 million followers and has received international media coverage. Using the data set, we will answer the following questions:
- Which are the ten most popular dog names?
- Which dog stage is most frequent in our dataset according to number of dogs it has?
- Which dog stage has the highest average rating?
- Are high rated dogs more likely to be favorited than low rated ones?
More Details | View Notebook | View Presentation
The information regarding more than 100,000 loans on Prosper with more than 80 features, were systematically explored starting from plots of single variables and building up to plots of multiple variables. We have selected some features of interest from the available 81 features, performed data cleaning to remove duplicates, missing values, and make the data more consistent for analysis.