Skip to content

Latest commit

 

History

History
55 lines (40 loc) · 1.72 KB

README.md

File metadata and controls

55 lines (40 loc) · 1.72 KB

Customer-Churn-Analysis

Final Project of Kodluyoruz Izmir Applied Data Science 101 Bootcamp

Alt Text

This project is a binary classification problem. The aim of this project is to predict the customers that we will lose. Data consists of 51,047 instances and 58 attributes. If we focus on company goals, actually our problem is losing customers. In this situation the company needs to find churned customer in other words the customers that we will lose. Thus, the company will avoid losing profit and revenue.

Dataset Information

This is open source data (Cell2Cell Company) by Teradata center for customer relationship management at Duke University.

Dataset Link

Requirements

There are some general library requirements for the project and they are listed below.

  • Numpy
  • Pandas
  • Scikit-learn
  • Matplotlib
  • Seaborn
  • Machine Learning Librarys

The library requirements specific to some methods are:

  • DecisionTreeClassifier
  • RandomForestClassifier
  • GradientBoostingClassifier
  • XGBClassifier
  • LGBMClassifier
  • CatBoostClassifier

Steps

  • Data Understanding
  • Exploratory Data Analysis (EDA)
  • Train Test Validation Split
  • Outlier Handling
  • Missing Value Handling
  • Feature Generation
  • Encoding
  • Normalization
  • Train Test Validation(X,Y)Split
  • Feature Importance
  • Create Model

Team Members

Team Members
Sezen Duygu CEREN
Furkan KARAKUZ
Dila YAPICI