News Sentiment Informed Time-series Analyzing AI (SITALA) to curb the spread of COVID-19 in Houston

•First successful implementation of multivariate CNN to forecast COVID-19 spread.•The CNN model accepts COVID-19 test positivity and news sentiment as inputs.•COVID-19 news sentiment is obtained using IBM’s Watson Discovery News.•The county-level model can aid public policymakers to curb the spread...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2021-10, Vol.180, p.115104-115104, Article 115104
1. Verfasser: Desai, Prathamesh S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•First successful implementation of multivariate CNN to forecast COVID-19 spread.•The CNN model accepts COVID-19 test positivity and news sentiment as inputs.•COVID-19 news sentiment is obtained using IBM’s Watson Discovery News.•The county-level model can aid public policymakers to curb the spread of COVID-19.•The model predictions fare better than a published Bayesian-based SEIRD model. Coronavirus disease (COVID-19) has evolved into a pandemic with many unknowns. Houston, located in the Harris County of Texas, is becoming the next hotspot of this pandemic. With a severe decline in international and inter-state travel, a model at the county level is needed as opposed to the state or country level. Existing approaches have a few drawbacks. Firstly, the data used is the number of COVID-19 positive cases instead of positivity. The former is a function of the number of tests carried out while the number of tests normalizes the latter. Positivity gives a better picture of the spread of this pandemic as, with time, more tests are being administered. Positivity under 5% has been desired for the reopening of businesses to almost 100% capacity. Secondly, the data used by models like SEIRD (Susceptible, Exposed, Infectious, Recovered, and Deceased) lacks information about the sentiment of people concerning coronavirus. Thirdly, models that make use of social media posts might have too much noise and misinformation. On the other hand, news sentiment can capture long-term effects of hidden variables like public policy, opinions of local doctors, and disobedience of state-wide mandates. The present study introduces a new artificial intelligence (i.e., AI) model, viz., Sentiment Informed Time-series Analyzing AI (SITALA), trained on COVID-19 test positivity data and news sentiment from over 2750 news articles for Harris county. The news sentiment was obtained using IBM Watson Discovery News. SITALA is inspired by Google-Wavenet architecture and makes use of TensorFlow. The mean absolute error for the training dataset of 66 consecutive days is 2.76, and that for the test dataset of 22 consecutive days is 9.6. A cone of uncertainty is provided within which future COVID-19 test positivity has been shown to fall with high accuracy. The model predictions fare better than a published Bayesian-based SEIRD model. The model forecasts that in order to curb the spread of coronavirus in Houston, a sustained negative news sentiment (e.g., death count for COVID-19 will grow at an a
ISSN:0957-4174
1873-6793
0957-4174
DOI:10.1016/j.eswa.2021.115104