Skip to content

ssalvatierra777/datathons

 
 

Repository files navigation

datathons

Logo

Toynbee Hall Datathon

Welcome to the first ever Toynbee Hall datathon! If you have not registered for this event, please pop on over to the Eventbrite so we can have an idea of how many people participated! This page has all the information you need to help us make meaningful changes in Tower Hamlets.

Background

Since our inception in 1884, evidence based research has been central to Toynbee Hall’s mission as a charity. Over a century ago we played a key role in creating the first data visualizations mapping poverty in order to help tackle inequalities in London. 135 years later we are now turning to new technologies to help serve the community we continue to be a part of.

Continuing in our history of using evidence to build narratives for social change, we thank you for joining us in person or online for an afternoon of analyzing and visualizing publicly available data to help create materials we can use in our mission to advocate for meaningful impacts that will change government policy.

Below, we detail our methods, goals, and provide as much as we can to enable you to help us.

Context

While you can read more about us on our website, the focus of today's Datathon looking at two separate issues in the Tower Hamlets community:

  1. Youth Homelessness
  2. Local Safety

To give a bit of background information, Tower Hamlets one of 32 boroughs in London, England.

The borough of Tower Hamlets is broken up into 20 smaller wards, each with its own unique demographics.

According to a previous research project we carried out looking at the needs of older people in Tower Hamlets, we identified public safety as an issue we needed to address in future work. That future work is this project.

Research Questions

In order to help guide this research, we have provided a list of prompts and questions roughly sorted from more specific to more broad in order to both give you context and guide your analysis.

Crime and Safety

  • How does Tower Hamlets compare to other boroughs in terms of crime?
  • Which wards in Tower Hamlets experience the most crime?
  • What are the patterns and trends of crime and safety in Tower Hamlets?

Social Cohesion

  • How does Tower Hamlets compare to other boroughs in terms of demographic diversity?
  • What are the patterns and trends in Tower Hamlets' population in terms of age?
  • What are the patterns and trends in Tower Hamlets' population in terms of ethnicity?

Youth Homelessness

  • How have patterns of youth homelessness changed over time and space?
  • What are the trends and patterns in terms of young people living in private rental accommodation in Tower Hamlets?

Specifically, we are interested in people aged 18--30 for our new PAR project on Youth and Housing.

Our goal here is to identify trends in data, visualize them, and share them in way that they can be leveraged by local charities as part of their literature they used to advocate for change at the Government level.

Example

For example, in answering the prompt regarding differences in Crime between wards the resulting data visualization might look something like this...

"I made the following plot to compare and contrast the amont of theft in some of the Tower Hamlets wards. From the chart, we can see there are many more reported thefts happening in Spitalfields and Banglatown compared with Limehouse and Mile End"

Example Plot

The data sets are found below along with their original source, as well a few tidy versions of some of the data to help with your time.

Shared Area

So everyone does not have to re-invent the wheel, we have provided a shared Google Doc for people to write down ideas on that they thought might help others.

Check it out HERE.

Your Analysis

Given the Datasets below, we ask that you analyze your data as transparently as possible. This means:

  1. For each visualization, please use a common open source software like R or Python to create your data visualizations.
  2. Save each visualization and please produce a commented script that allows your dataviz to be reproduced. This will allow us to expand upon your work, ensure the analyses are reproducible, and will let us at Toynbee Hall edit the visualizations for House Style if we adopt your creation in one of our campaigns.
  3. If possible, please try to match the House Style of the charity's data.
  4. You are free to share your creation(s) on Twitter with both the #data4toynbee (and #tidytuesday hashtags).
  5. You publish your creation under an open source license. We only ask this so if we do use your analysis or data visualization in our work, we don't run into any legal problems. The MIT license, which falls under this category, is reproduced below.

You can find a sub-directory of an example analyses in the Examples directory of this repository.

Datasets

In the list below we have our datasets of interest. The crime dataset has been taken from the London Data warehouse. There also exists a script to combine and merge years of London data if you are feeling bold and want to include a lot of archive data.

The Youth Homelessness data comes from Centrepoint.

Data Notes

  • The Youth Homelessness data varies based on how different districts report what qualifies as homeless
  • Wards changed in London across the three London Databases. If building a complete London crime dataset, these geographic changes will affect locations. See link for more details
  • Both Crime and Youth Homelessness data is COUNT data and not tidy!! Each row does not represent an INDIVIDUAL crime, but an aggregate.

Submitting Your Visualization

When you have finished your analysis, please save the code along with the image you created. Feel free to post it on Twitter with a link to your code ala #TidyTuesday.

You can either make a pull request of this repository where you create a new directory under the Submissions directory or send your image and code to [email protected]!

PLEASE REMEMBER TO ADD AN OPEN SOURCE LICENSE AGREEMENT TO YOUR CODE SO WE CAN USE IT IN OUR CAMPAIGNS

Schedule

Below we have sketched out our plans for the day. Times listed are BST. The schedule for the afternoon is purposefully loose as to allow people to chat, network, and work together to discuss problems and how best to solve them.

Time Plan Speaker
13:00 Welcome! Xia Lin
13:15 Introductions David Baker, Elliot Williams-Fletcher, Participants
13:30 Begin Datathon
15:30 Coffee Break
16:30 Wrap Up and Sharing
17:00 Event Ends

House Style

We want to use your work to help advocate for change in Tower Hamlets. In order for us to do that as efficiently as possible, the charities eventually will need to format it with their house style. Below we have listed the general style guides for both Toynbee Hall and Centrepoint.

You of course do not need to submit your data viz with our house style-- one of the big goals of this is to just see what we can learn when we ask for other's help analyzing our data -- but if you would like to follow the house style, Toynbee Hall uses...

You will notice that both charities use Orange, Black, Grey, and Blue.

Toynbee Hall

  • Font: Arial Regular
  • Primary Color
    • Orange (#F58220)
    • Grey (#635466)
  • Text Color
    • Grey (#4C4D4F)
  • Secondary Palette
    • Blue (#175984)
    • Light Blue (#96B4DE)

Centrepoint (YHD)

  • Font: Title: Oswald Regular Text: Lato (Light, Regular, Heavy)
  • Primary Color:
    • Orange (#D14414)
    • White (#FFFFFF)
    • Black (#000000)
  • Secondary Color:
    • Teal (#2EA393)
    • Dark Grey (#363636)
    • Warm Grey (#E9E6DC)
    • Yellow (#FFD141)
    • Blue (#0082C7)
    • Purple (#A54398)

Eventually we plan to create a similar house package akin to bbplot, but this has not been created yet.

Liscence Agreement

By participating in our Datathon, we ask that any data visualizations that you create are published under the the MIT License.

The licences is reproduced below.

Copyright (c) 2019 <YOUR NAME HERE>

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Thank You and Future Work

We want to thank you for the work that you are doing for us and our community. If you would like your name to be added to the report that will emerge from this project, please add your name to the Google Form below. This will allow us to include your name in the report.

One of the other goals in this Datathon is creating a network of data analysts that are willing to help with future work. Unlike much of the start up, tech, machine learning world, many non-profits and charities do not have or even need the latest machine learning in order to be effective in our goals. What we do need is people who can create clear, clean meaningful analyses to help build narratives to mobile social change. If you enjoyed this and would like to be contacted for future small and similar requests, please indicate that on the form below.

One of our goals is to use our infrastructure in order to build up a network where we can use data for good to help those that need it.

If you could please fill out THIS ONE MINUTE SURVEY that asks if you would like to be listed on our future reports and be contacted in the future, we would very much appreciate it! This survey also asks if we can share your information with DataKind, another charity organization looking to pair data scientists with charities.

About

Repository for Toynbee Hall Datathons

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 100.0%