Skip to content

Exploratory data analysis (EDA) and dataviz with Jupyter Notebooks, Matplotlib & Seaborn

Notifications You must be signed in to change notification settings

emsuru/charlie-02-data-analysis

Repository files navigation

Immo Charlie Phase 02: Data Analysis & Visualisation

Python Jupyter Notebook matplotlib seaborn

📖 Description

This Data Explorer is phase 2 of a larger project where I develop an ML pipeline for property price prediction. See also:

  • phase 1 (data collection) here
  • phase 3 (ML model training) here
  • phase 4 (API deployment) here

🌺 Features

  • univariate EDA (Jupyter notebook)
  • multivariate EDA (Jupyter notebook)

graph

graph

⏱️ Background & timeline

This project was done over the course of one week in February 2024 in Ghent (Belgium), during the AI Bootcamp by Becode.

Its main goals were to practice:

  • exploratory data analysis (EDA)
  • data cleaning & preparation
  • data visualisation
  • data storytelling & presentation

The most challenging part for me with this project was coming to terms with just how much of EDA is iterative. Many were the times I had to go back to step 1 after discovering something in step 15. To check out my exploratory work, open any of the three EDA Jupyter notebooks.

🚀 Extensions

If I have time to return to this, I'd like to:

  • use a better, clearer structure, where I split the investigation into structure, quality & content and for each section look at categorical variables and numerical variables separately and in a clear order (e.g. ordinal then nominal, discrete then continuous etc.)
  • have a different version of the exploration where I try fancier libraries and dataviz tools (PyGWalker, Bokeh, Streamlit, Dash)

⚠️ Warning

All my code is currently heavily:

  • docstringed
  • commented
  • and sometimes typed

This is to help me learn and to make my sessions with our training coach more efficient.

🤗 Thank you for visiting my project page!

Connect with me on LinkedIn 🤍

About

Exploratory data analysis (EDA) and dataviz with Jupyter Notebooks, Matplotlib & Seaborn

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published