Skip to content

Datasets

Ione Fine edited this page Apr 10, 2020 · 22 revisions

Covid Tracking

US only, state by state https://covidtracking.com/data/ data available as spreadsheet or via API (JSON and CSV).

NYTimes

US only, state by state https://github.com/nytimes/covid-19-data

European Centre for Disease Prevention and Control

https://www.ecdc.europa.eu/en/geographical-distribution-2019-ncov-cases

All countries, available as CSV, JSON and XML

Nextstrain

https://nextstrain.org/ncov Tracks the spread of coronavirus, inferred from tracking how the genome evolves and mutates. The data visualization is excellent, but scroll down for info on where the data comes from and how to access it. Nextstrain isn't limited to coronavirus so it could be helpful for those who were asking about data for other viruses.

Academic Data Science Alliance

https://www.academicdatascience.org/covid The Academic Data Science Alliance is working with partners to pull together data sets and data science resources related to the COVID-19 pandemic. Initiated by conversations with Columbia, NYU, UC Berkeley, and UW.

Figshare Dimensions

Dimensions has a lot of datasets related to COVID-19 as well as a repository of publications. Lots of biological stuff but also some data on Africa, data from tweets about coronavirus, and "attitude studies" in Spain about the effects of learning about the virus on attitudes towards it.

Wolrdometer

https://www.worldometers.info/coronavirus/coronavirus-cases/ The worldometer tracking of cases and deaths by country. Can also view growth rate, recoveries, and new case info.

Citymapper Mobility Index

https://citymapper.com/cmi The index is calculated by comparing trips planned using the Citymapper app. Data are available for large cities around the world and can be downloaded.

Cuebiq Mobility Index

https://www.cuebiq.com/visitation-insights-covid19/?utm_source=nyt&utm_medium=article&utm_campaign=organic Weekly mobility index calculated from cell phone location data for each county within the US. Example data usage: https://www.nytimes.com/interactive/2020/04/02/us/coronavirus-social-distancing.html

US General Health Data

https://www.cdc.gov/nchs/nhanes/about_nhanes.htm National Health & Nutrition Examination Survey. The diseases, medical conditions, and health indicators to be studied include cardiovascular disease, diabetes, infectious disease, obesity, respiratory disease & many other physical risk factors that could be useful parameters for a covid model. Combines interviews and physical examinations.

Oxford Covid-19 Evidence Service

Case Fatality Rate (CFR) estimates, the country-specific issues affecting the CFR, and provides a current best estimate of the CFR, and more importantly, the Infection Fatality Rate (IFR).

The IFR estimates the fatality rate in all those with infection: the detected disease (cases) and those with an undetected disease (asymptomatic and not tested group).

https://www.cebm.net/covid-19/global-covid-19-case-fatality-rates/ Estimates of the severity of coronavirus disease 2019: a model-based analysis, Lancet

https://www.thelancet.com/journals/laninf/article/PIIS1473-3099(20)30243-7/fulltext

Clone this wiki locally