This project analyzes the Data provided by StackOverflow Annual Developer Survey and it's part of the Udacity Data Science Nano Degree.
See the Medium post.
The main questions to be answered in this notebook are:
- What are the top 5 languages considering community size?
- What is the developer average income for each language? How is it related with the community size?
- What are the top 10 languages with higher incomes devs with one or fewer years of experience?
- Please check requirements.txt
Clone this repository
git clone https://github.com/hanslemm/Stack-Overflow-Survey-2020
Create a virtualenv
python3 -m venv ve
Activate the virtualenv
$ source venv/bin/activate
Install the requirements in your virtualenv:
pip install update pip
pip install -r requirements.txt
.
├── .vscode
├── settings.json - Settings for VSCode
├── data - Repository with all information regarding Stack Overflow 2020 Survey
├── README_2020.txt - SO 2020 details
├── so_survey_2020.pdf - SO 2020 PDF
├── survey_results_public.csv - survey dataset
└── survey_results_schema.csv - dataset descriptions
├── images
├── Community Size and Average Income per Language.png
├── Community Size per Language.png
└── Top 10 Initial Income Languages and its Income Development.png
├── ve - Virtual Enviroment repo
├── data_analysis.ipynb - Jupyter Notebook with all the Data Analysis and code
├── README.md - This file
└── requirements.txt - requirements for pip