This project focuses on cleaning and preprocessing data to ensure it is ready for analysis of resumes from HeadHunter. The goal is to handle missing values, remove duplicates, and correct inconsistencies in the dataset.
data/
: Contains the raw and cleaned data CSV files.images/
: Generated reports and visualizations.
- Clone the repository:
git clone https://github.com/st-pavel/Analysis-of-Resumes.git
- Navigate to the project directory:
cd Analysis-of-Resumes
- Install the required dependencies:
pip install -r requirements.txt
Run the data cleaning script:
jupyter nbconvert --to notebook --execute --inplace Project-1_notebook_template.ipynb
You need to get CSV data file from link dataset and place it to data/ folder, or uncomment and run first cell in jupiiter notebook.
- Fork the repository.
- Create a new branch (
git checkout -b feature-branch
). - Commit your changes (
git commit -am 'Add new feature'
). - Push to the branch (
git push origin feature-branch
). - Create a new Pull Request.
This project is licensed under the MIT License.
For any questions or suggestions, please open an issue or contact the project maintainer at [email protected]
Pic 1 Pic 2 Pic 3 Pic 4 Pic 5 Pic 6 Pic 7 Pic 8 Pic 9 Pic 10 Pic 11