Skip to content

This repository contains all the kaggle competition projects

Notifications You must be signed in to change notification settings

sana1410/WiDS_Datathon-Kaggle_24

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Stay or Stray: Predict Course Dropout

[code](https://www.kaggle.com/code/sanayasin1410/stay-or-stray-v5)

The competition aims to tackle the challenge of predicting student dropout from Online Courses.

General Information


  • This Kaggle competition is organized by the Data Science Hub, Northeastern University.
  • The competition aims to tackle the challenge of predicting student dropout from Online Courses.
  • As the demand for online education skyrockets, understanding student behavior becomes paramount. With dropout rates posing a significant challenge, the need for accurate predictions is more pressing than ever. Imagine being able to identify at-risk students before they disengage, providing timely interventions and support to keep them on track.

Technologies Used


  • Python
  • Machine Learning
  • Scikit-Learn
    • WiDS Datathon 2024 Challenge #2


      Predict the duration of time it takes for patients to receive metastatic cancer diagnosis.

      General Information


      • Metastatic TNBC is considered the most aggressive TNBC and requires urgent and timely treatment. Unnecessary delays in diagnosis and subsequent treatment can have devastating effects in these difficult cancers. Differences in the wait time to get treatment is a good proxy for disparities in healthcare access.
      • The primary goal of building these models is to detect relationships between demographics of the patient with the likelihood of getting timely treatment. The secondary goal is to see if climate patterns impact proper diagnosis and treatment.
      • Gilead Sciences is the sponsor for this year’s WiDS Datathon. They provided a rich, real-world dataset which contains information about demographics, diagnosis and treatment options, and insurance provided about patients who were diagnosed with breast cancer. The dataset originated from Health Verity, one of the largest healthcare data ecosystems in the US. It was enriched with third party geo-demographic data to provide views into the socio economic aspects that may contribute to health equity. For this challenge, the dataset was then further enriched with zip code level climate data.

      Technologies Used


      • Data Science
      • Machine learning
      • Feature selection
      • Regression
      • Scikit-learn
      • Tableau
      • Python

      Dashboard

    About

    This repository contains all the kaggle competition projects

    Resources

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published