Your browser doesn't support javascript.
Predicting the impact of the third wave of COVID-19 in India using hybrid statistical machine learning models: A time series forecasting and sentiment analysis approach.
Mohan, Sumit; Solanki, Anil Kumar; Taluja, Harish Kumar; Singh, Anuj.
  • Mohan S; Department of Computer Science and Engineering, Bundelkhand Institute of Engineering and Technology, Jhansi, AKTU, Lucknow, India. Electronic address: sumitmohan@bietjhs.ac.in.
  • Solanki AK; Department of Computer Science and Engineering, Bundelkhand Institute of Engineering and Technology, Jhansi, AKTU, Lucknow, India. Electronic address: aksolanki@bietjhs.ac.in.
  • Taluja HK; Department of Computer Science and Engineering, Noida International University, Noida, India. Electronic address: harishtaluja@gmail.com.
  • Anuradha; Department of Computer Science and Engineering, Ajay Kumar Garg Engineering College, Ghaziabad, AKTU, Lucknow, India. Electronic address: anutaluja@gmail.com.
  • Singh A; Department of Computer Science and Engineering, Kamla Nehru Institute of Technology, Sultanpur, AKTU, Lucknow, India. Electronic address: anuj.2295@knit.ac.in.
Comput Biol Med ; 144: 105354, 2022 05.
Article in English | MEDLINE | ID: covidwho-1703412
ABSTRACT

BACKGROUND:

Since January 2020, India has faced two waves of COVID-19; preparation for the upcoming waves is the primary challenge for public health sectors and governments. Therefore, it is important to forecast future cumulative confirmed cases to plan and implement control measures effectively.

METHODS:

This study proposed a hybrid autoregressive integrated moving average (ARIMA) and Prophet model to predict daily confirmed and cumulative confirmed cases. The built-in auto.arima function was first used to select the optimal hyperparameter values of the ARIMA model. Then, the modified ARIMA model was used to find the best fit between the test and forecast data to find the best model parameter combinations. Articles, blog posts, and news stories from virologists, scientists, and health experts related to the third wave of COVID-19 were gathered using the Python web scraping package Beautiful Soup. Their opinions (sentiments) toward the potential third wave were analyzed using natural language processing (NLP) libraries.

RESULTS:

A spike in daily confirmed and cumulative confirmed cases was predicted in India in the next 180 days based on past time series data. The results were validated using various analytical tools and evaluation metrics, producing a root mean square error (RMSE) of 0.14 and a mean absolute percentage error (MAPE) of 0.06. The NLP processing results revealed negative sentiments in most articles and blogs, with few exceptions.

CONCLUSION:

The findings of this study suggest that there will be more active cases in the upcoming days. The proposed models can forecast future daily confirmed and cumulative confirmed cases. This study will help the country and states plan appropriate public health measures for the upcoming waves of COVID-19.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 Type of study: Experimental Studies / Observational study / Prognostic study Limits: Humans Country/Region as subject: Asia Language: English Journal: Comput Biol Med Year: 2022 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 Type of study: Experimental Studies / Observational study / Prognostic study Limits: Humans Country/Region as subject: Asia Language: English Journal: Comput Biol Med Year: 2022 Document Type: Article