Welcome to the Datathon of 8th IC2S2, 2022! In this year's competition, we invite you and your team to discover and convey insights from linked data about COVID prevalence, its framing in the national and local U.S. news, behavioral and commercial activities that follow. You are tasked with submiting an original, publicly relatable impression of your discovered patterns or associations, which should include an artistically appealing and scientific accurate data-driven visualization of your findings, alongside an explanatory description of up to 500 words. Findings should relate at least two forms data described above, may but need not link additional information (e.g., Twitter activity, Google Search attention, etc.) Source code of the discovered patterns should also be submitted to verify their scientific accuracy.
Please format your submission as this template and provide a separate explanatory description of your findings of up to 500 words. Please fill this contribution statement and email all three files to [email protected] by Wednesday @ 8am CDT to complete the submission. Late submissions will not be considered.
We compose a portfolio of 5 population-scale datasets for you to explore, including Meta Business Activities, Proquest News Articles, SafeGraph Mobility, The New York Times Covid Cases Report, Urban Region Census. The download links and brief introductions are provided as below.
These datasets are provided by Meta, including business posting activities on Facebook from 2020 to 2022. The data can be used to measure how local businesses are affected by and recover from crisis events like the pandemic. Please click here for more details.
These datasets are made from ProQuest US Newsstream using ProQuest TDM Studio. These datasets were created by selecting all newspaper articles with the term 'covid' in the full text. The possible date range is from January 1st, 2019, to July 12th, 2022. Please click here for more details.
By downloading this data you agree to the following terms: https://about.proquest.com/en/about/Supplemental-Terms-of-Use-TDM-Studio
The population mobility dataset is extracted from SafeGraph mobility data, including the population movements from the end of 2019 to April 2022, between the census tracts in each US MSA. The movements were aggregated from anonymized mobile devices. Please click here for more details.
These datasets are provided by The New York Times, including the daily cumulative number of cases and deaths reported in each US MSA since the beginning of the pandemic. Please click here for more details.
By downloading this data you agree to the following terms: https://github.com/nytimes/covid-19-data/blob/master/LICENSE
This dataset is collected from the latest census, including the regional population size of different demographic categories (e.g., female, 0_to_9_years_old, asian…) in each US MSA. Please click here for more details.