Data Science for Social Good 2022 Summer Fellowship, Carnegie Mellon University
Erika Salomon, Liliana Millán Núñez
- Mental Health Center in Johnson County, Kansas
- Criminal Justice Coordinating Council in Douglas County, Kansas
Behavioral health crises are a pernicious issue in the US. In 2020, there were over 45K deaths by suicide, and over 1.2 million suicide attempts. In 2021, there were over 107K deaths due to drug overdoses (Center for Disease Control and Prevention, 2022).
We partnered with the Johnson and Douglas Counties in Kansas to tackle this problem in their local contexts. Currently, our partners have outreach teams that work based on referrals to offer care to those in need. Our goal is to provide a proactive solution and use machine learning to predict people at risk of a behavioral health crisis to recommend for preventative outreach.
- Linux/Bash Terminal (to run the scripts)
- Python 3.10.4
- PostgreSQL 22.1.1
To create the environment with all the necessary tools installed (including underlying C libraries and python requirements):
conda hivenv create -f environment.yml
conda activate dojo_mh
Our environment here is called 'dojo_mh'
To allow programmatic access to the database, create an environment variable as:
export DBURL="postgres://your_username:your_password@url_to_database:xxxx/database_name"
where xxxx is the port number. You can also add this to your bash profile so it is available by default when you access the terminal. To add it to your bash profile:
echo "export DBURL="postgres://your_username:your_password@url_to_database:xxxx/database_name" > ~/.bashrc
sh ~/.bashrc
The database can then be accessed using psql
or SQLAlchemy
using the connection string.
The files and workflow to extract the raw data, clean them, and upload them to a database are described in:
The files and workflow to run the pipeline are described in: