Your browser doesn't support javascript.
An infodemiological framework for tracking the spread of SARS-CoV-2 using integrated public data.
Liu, Zhimin; Jiang, Zuodong; Kip, Geoffrey; Snigdha, Kirti; Xu, Jennings; Wu, Xiaoying; Khan, Najat; Schultz, Timothy.
  • Liu Z; Janssen R&D Data Science, Janssen Research and Development, 2341 S Whittmore St, Titusville 08560, Furlong, PA 18925, United States.
  • Jiang Z; Janssen R&D Data Science, Janssen Research and Development, 2341 S Whittmore St, Titusville 08560, Furlong, PA 18925, United States.
  • Kip G; Janssen R&D Data Science, Janssen Research and Development, 2341 S Whittmore St, Titusville 08560, Furlong, PA 18925, United States.
  • Snigdha K; Janssen R&D Data Science, Janssen Research and Development, 2341 S Whittmore St, Titusville 08560, Furlong, PA 18925, United States.
  • Xu J; Janssen R&D Data Science, Janssen Research and Development, 2341 S Whittmore St, Titusville 08560, Furlong, PA 18925, United States.
  • Wu X; Janssen R&D Data Science, Janssen Research and Development, 2341 S Whittmore St, Titusville 08560, Furlong, PA 18925, United States.
  • Khan N; Janssen R&D Data Science, Janssen Research and Development, 2341 S Whittmore St, Titusville 08560, Furlong, PA 18925, United States.
  • Schultz T; Janssen R&D Data Science, Janssen Research and Development, 2341 S Whittmore St, Titusville 08560, Furlong, PA 18925, United States.
Pattern Recognit Lett ; 158: 133-140, 2022 Jun.
Article in English | MEDLINE | ID: covidwho-1804964
ABSTRACT
The outbreak of the SARS-CoV-2 novel coronavirus has caused a health crisis of immeasurable magnitude. Signals from heterogeneous public data sources could serve as early predictors for infection waves of the pandemic, particularly in its early phases, when infection data was scarce. In this article, we characterize temporal pandemic indicators by leveraging an integrated set of public data and apply them to a Prophet model to predict COVID-19 trends. An effective natural language processing pipeline was first built to extract time-series signals of specific articles from a news corpus. Bursts of these temporal signals were further identified with Kleinberg's burst detection algorithm. Across different US states, correlations for Google Trends of COVID-19 related terms, COVID-19 news volume, and publicly available wastewater SARS-CoV-2 measurements with weekly COVID-19 case numbers were generally high with lags ranging from 0 to 3 weeks, indicating them as strong predictors of viral spread. Incorporating time-series signals of these effective predictors significantly improved the performance of the Prophet model, which was able to predict the COVID-19 case numbers between one and two weeks with average mean absolute error rates of 0.38 and 0.46 respectively across different states.
Keywords

Full text: Available Collection: International databases Database: MEDLINE Type of study: Experimental Studies / Prognostic study Language: English Journal: Pattern Recognit Lett Year: 2022 Document Type: Article Affiliation country: J.patrec.2022.04.030

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Type of study: Experimental Studies / Prognostic study Language: English Journal: Pattern Recognit Lett Year: 2022 Document Type: Article Affiliation country: J.patrec.2022.04.030