Disaster Response Pipeline Project

Project Motivation

This project is part of the Data Science Nanodegree Program by Udacity in collaboration with figure Eight. The dataset contains pre-labelled tweet and messages from real life disaster events. The aim is to design a model to categorize massages on all 36 pre-defined categoties that can be sent to the appropriate disaster relief agency.

Requirements

The code should run with no issues using Python versions 3 with the following libraries:

Machine Learning: NumPY, Scipy, Pandas, sklearn
Natural Language Process: NLTK
SQLite Database: SQLalchemy
Model Loading and Saving: Pickle
Web App and Data Visualization: Flask, Plotly

Installation Instructions

Run the following commands in the project's root directory to set up your database and model.
- To run ETL pipeline that cleans data and stores in database python data/process_data.py data/disaster_messages.csv data/disaster_categories.csv data/DisasterResponse.db
- To run ML pipeline that trains classifier and saves python models/train_classifier.py data/DisasterResponse.db models/classifier.pkl
Run the following command in the app's directory to run your web app. python run.py
Go to http://0.0.0.0:3001/ or http://localhost:3001/

File Descriptions

Data
- disaster_categories.csv + disaster_messages.csv - Datasets with all the necessary informations
- process_data.py - Code that reads and cleans the csv files and stores it in a SQL database.
- db_disaster_messages.db - Dataset created after the transformed and cleaned data from the disasters CSV files.
Models
- train_classifer.py - Code necessary to load data and run the machine learning model, this will create a pickle file at the end (classifier.pkl)
App
- run.py - Flask app and the user interface used to predict results and display them.
- templates - Folder containing the html template files

Results

This is the expected frontpage from the website:

By inputting a sentence it should be able to see the categorie result:

There are other options for the pipeline in the ML Pipeline Preparation.ipynb. Feel free to change the build_model() function in the train_classifier.py file (models folder)

Licensing, Authors, Acknowledgements

Must give credit to Figure Eigth for the data. Also, thank you the StackOverFlow community and Udacity for the training! Otherwise, feel free to use the code here as you would like!

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.ipynb_checkpoints		.ipynb_checkpoints
app		app
data		data
models		models
ETL Pipeline Preparation.ipynb		ETL Pipeline Preparation.ipynb
ML Pipeline Preparation.ipynb		ML Pipeline Preparation.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Response Pipeline Project

Table of Contents

Project Motivation

Requirements

Installation Instructions

File Descriptions

Results

Licensing, Authors, Acknowledgements

About

Releases

Packages

Languages

matsuch/Disaster-Response-Pipeline

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Pipeline Project

Table of Contents

Project Motivation

Requirements

Installation Instructions

File Descriptions

Results

Licensing, Authors, Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages