M2_SISE-EasyDate

Read this in other languages: English, Français.

This project is part of the Python Programming evaluation modalities of the Master 2 SISE at the Université Lumière Lyon 2 for the year 2022 - 2023.

Description of the git repository:

initial_datasets, folder containing the original datasets for the project
scripts_colab, folder containing the different works on the design of the predictive model
- cleaning_process, folder containing train and submissions data processing scripts and the finalized datasets
- data_exploration, folder containing the exploratory scripts on the data, relationship between variables, dimension reduction and visualizations
- models, folder containing the classification model scripts used with decisiontree and boosting
  - significance_var, folder containing the feature selection script and the finalized dataset
dash, folder containing the python scripts for creating and building a dash application

Link of the kaggle competition related to this project: https://www.kaggle.com/competitions/easydate-ai-match

In today’s busy world, finding and dating a romantic partner seems more time consuming than ever. As a result, many people have turned to speed dating as a solution that allows one to meet and interact with a large number of potential partners in a short amount of time. In this report, we want to explore what people are looking for in their speed dating matches, what it takes to become successful in getting approvals from a potential partner, if there exist any gender differences, and if any other factors influence peoples’ decisions.

About the Data

The dataset contains records from 6804 participants of Speed Dating Experiment. Data can be found in Kaggle dataset. They consists mainly of general info about each participant, attributes they look for in speed dating partner and their matching decisions.

Project Overview

The aim of this project is to get a hands-on practice on typical workflows usually seen in any machine learning project. Below mentioned is the basic workflow for this project:

Exploratory Data Analysis
Pre-processing
Modelling
Test Analysis
Dash App

Note: For more details, please refer to the notebook for each part.

Installation

google colab
Python 3.10
Python libraries(Pandas,Numpy,Matplotlib,Scikit-learn,plotly.express...)
Visual Studio Code

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
Dash		Dash
initial_datasets		initial_datasets
scripts_colab		scripts_colab
README.fr.md		README.fr.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

M2_SISE-EasyDate

Description of the git repository:

Table of contents

Introduction

About the Data

Project Overview

Installation

About

Releases

Packages

Contributors 4

Languages

Skarbkit/M2_SISE-EasyDate

Folders and files

Latest commit

History

Repository files navigation

M2_SISE-EasyDate

Description of the git repository:

Table of contents

Introduction

About the Data

Project Overview

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages