We’re a research group based out of Carnegie Mellon University dedicated to developing the theory and practice of epidemic tracking and forecasting. Pre-pandemic we worked mostly on influenza, dengue and norovirus; we’ve now picked up a focus on COVID. We procure unique data streams that reflect epidemic (or pandemic) activity, extract relevant indicators, and make these publicly and continuously available. We and others then use these indicators for nowcasting (situational awareness) and short-term forecasting.
Public health authorities (federal, state, local), the healthcare industry, the public and private sectors, fellow researchers working on epidemic tracking and forecasting, data journalists, and the general public.
September 2023. We were selected to be a Center of Innovation in Outbreak Analytics and Disease Modeling by the Centers for Disease Control and Prevention’s Center for Forecasting and Outbreak Analytics. We are one of thirteen centers serving as the core of Insight Net.
March 2023. With the sunsetting of regular COVID-19 Case reporting by both JHU CSSE and USAFacts, we are no longer focusing on case tracking and forecasting and are now prioritizing COVID and influenza-related hospitalizations, and on gearing up for tracking other circulating and emerging pathogens.
February 2023. We commenced a new project supporting the CDC’s Center for Outbreak Forecasting and Analytics, which includes, among other goals, the prototyping of federated epidemic surveillance.
September 2022. We launched Epidata v4, prioritizing fast access to the most up-to-date data while retaining the deep data revision history needed by researchers.
June 2022. We ended data collection for the COVID-19 Trends and Impact Survey (CTIS) after more than two years, during which we received 29.5 million survey responses in the US (and well over 100m globally).
April 2022. We received the Allen Newell Award for Research Excellence in SCS at CMU, which is awarded annually and recognizes an outstanding body of work that epitomizes Allen Newell’s research style.
April 2022. We received the Policy Impact Award and the Warren J. Mitofsky Innovators Award from the the American Association of Public Opinion Research (AAPOR), along with our partners from the University of Maryland Social Data Science Center and Meta, for our work on the COVID-19 Trends and Impact Survey (CTIS).
January 2022. We added selected signals from the White House COVID-19 Data Strategy and Execution Workgroup’s Community Profile Report (CPR), including hospital admission and vaccination rates. This data source was discontinued in February 2023.
January 2022. After the CDC paused flu forecasting for the 2020-2021 season due to too little flu activity, we restarted generating flu forecasts for the 2021-2022 flu season.
December 2021. Our work was highlighted in a Proceedings of the National Academy of Sciences Special Feature, “Beyond Cases and Deaths: The Benefits of Auxiliary Data Streams In Tracking the COVID-19 Pandemic”
May 2021. We received the 2021 Statistical Partnerships Among Academe, Industry, and Government (SPAIG) award from the American Statistical Association (ASA) along with our COVIDcast collaborators.
April 2021. We launched our redesigned website, which included COVIDcast 2.0.
March 2021. We added PCR testing data from COVID Act Now. These data were discontinued in December 2021.
January 2021. We added signals on adult and pediatric COVID hospitalizations from the U.S. Department of Health & Human Services. In particular, we include the sum of all confirmed adult and pediatric COVID-19 hospital admissions. This sum is used as the “ground truth” for hospitalizations by the COVID-19 Forecast Hub.
December 2020. We added national provisional death counts, from the National Center for Health Statistics (NCHS). These data are based on death certificate data received and coded by NCHS.
December 2020. We added inpatient and outpatient COVID-19 signals based on aggregated statistics from medical claims, provided to us by Change Healthcare.
November 2020. Shifting from our original Google Health Trends data source, we added our Google Symptoms signals, which estimate the volume of searches mapped to symptom sets related to COVID-19.
October 2020. Thirteen volunteer Googlers joined our group for six months via Google.org Fellowships, bringing in professional experience and dramatically increasing our productivity.
September 2020. The COVID-19 Symptom Data Challenge was launched. Sponsored by Delphi along with Meta, University of Maryland, the Duke Margolis Center for Health Policy, and Resolve to Save Lives, the Challenge asked participants to “enable earlier detection and improved situational awareness of the outbreak” using CTIS data.
August 2020. We built upon our pre-pandemic relationship with Quidel and added signals based on positivity rates of their COVID-19 antigen tests. This data source was discontinued in August 2023.
June 2020. We added several new data sources, including:
Anonymized location data from mobile phones from Safegraph. This source was discontinued in July 2022.
County-level confirmed COVID-19 case and death data from USAFacts. This source was deactivated in January 2023.
Inpatient and outpatient COVID-19 hospitalization signals we derived from aggregated statistics from medical claims, provided to us by Optum.
May 2020. We added signals for COVID-19 Cases and Deaths, mirrored from the Center for Systems Science and Engineering at Johns Hopkins University.
April 2020. We partnered with Meta to launch the COVID-19 Trends and Impact Survey (CTIS) to monitor in real-time the spread and impact of the COVID-19 pandemic in the United States.
April 2020. We added signals from two new data sources:
Signals related to COVID-related doctor visits, derived from aggregated statistics from medical claims, provided to us by Optum.
Data from Google Health Trends, which estimate the volume of COVID-related searches in a given location, on a given day. This data source was discontinued in March 2021.
April 2020. We began supporting and advising the U.S. CDC’s community-driven COVID-19 forecasting effort, including creating and maintaining an ensemble forecast from the models submitted to the COVID-19 Forecast Hub, and a forecast evaluation dashboard.
March 2020. Working with Brett Slatkin (head of Google Surveys) and Hal Varian (Google’s Chief Economist), we launched our Google Symptom Survey, which ended in May 2020.
March 2020. We launched COVIDcast, the nation’s largest public repository of diverse, real-time indicators of COVID-19 activity, freely accessible through the Epidata API, which is updated daily with the latest data.
2019. We became a CDC National Center of Excellence for Influenza Forecasting, one of two nationally (and a 5-year designation).
2016. We developed and deployed influenza nowcasts for the CDC, state departments of public health, and the public.
2016. We developed and deployed the Epidata API, which provides real-time access to epidemiological surveillance data.
2013. We began supporting the U.S. CDC’s Influenza Division in advancing and growing a scientific community around influenza forecasting. We’ve been perennial leaders in forecasting accuracy ever since.
We're grateful for financial and other support from our collaborators and supporters: