ABSTRACT
The COVID-19 outbreak, a recent major public health emergency, was the first national health crisis since China entered the era of mobile social media. In this context, the public posted many quarantine-related posts for help on social media. Most previous studies of social media during the pandemic focused only on people's emotional needs, with less analysis of quarantine help-seeking content. Based on this situation, this study analyzed the relationship between the number of quarantine help-seeking posts and the number of new diagnoses at different time points in the pandemic using Zhihu, the most comprehensive topic discussion platform in China. It showed a positive correlation between the number of help-seeking posts and the pandemic's severity. Given the diversity of people's help-seeking content, this study used topic model analysis and sentiment analysis to explore the key content of people's quarantine help-seeking posts during the pandemic. In light of the framework of uses and gratifications, we found that people posted the most questions in relation to help with information related to pandemic information and quarantine information. Interestingly, the study also found that the content of people's quarantine posts during the pandemic was primarily negative in sentiment. This study can thus help the community understand the changes in people's perceptions, attitudes, and concerns through their reactions to emergencies and then formulate relevant countermeasures to address pandemic control and information regulation, which will have implications for future responses to public health emergencies. Moreover, in terms of psychological aspects, it will help implement future mental health intervention strategies and better address the public's psychological problems.
Subject(s)
COVID-19 , Social Media , Humans , COVID-19/epidemiology , COVID-19/psychology , SARS-CoV-2 , Sentiment Analysis , Emergencies , Quarantine , China/epidemiologyABSTRACT
The vaccines against COVID-19 arrived in Spain at the end of 2020 along with vaccination campaigns which were not free of controversy. The debate was fueled by the adverse effects following the administration of the AstraZeneca-Oxford (AZ) vaccine in some European countries, eventually leading to its temporary suspension as a precautionary measure. In the present study, we analyze the healthcare professionals' conversations, sentiment, polarity, and intensity on social media during two periods in 2021: the one closest to the suspension of the AZ vaccine and the same time frame 30 days later. We also analyzed whether there were differences between Spain and the rest of the world. Results: The negative sentiment ratio was higher (U = 87; p = 0.048) in Spain in March (Med = 0.396), as well as the daily intensity (U = 86; p = 0.044; Med = 0.440). The opposite happened with polarity (U = 86; p = 0.044), which was higher in the rest of the world (Med = -0.264). Conclusions: There was a general increase in messages and interactions between March and April. In Spain, there was a higher incidence of negative messages and intensity compared to the rest of the world during the March period that disappeared in April. Finally, it was found that the dissemination of messages linked to negative emotions towards vaccines against COVID-19 from healthcare professionals contributed to a negative approach to primary prevention campaigns in the middle of the pandemic.
Subject(s)
COVID-19 , Social Media , Vaccines , Humans , Sentiment Analysis , COVID-19/epidemiology , COVID-19/prevention & control , COVID-19 Vaccines , Delivery of Health CareABSTRACT
BACKGROUND: The unprecedented speed of COVID-19 vaccine development and approval has raised public concern about its safety. However, studies on public discourses and opinions on social media focusing on adverse events (AEs) related to COVID-19 vaccine are rare. OBJECTIVE: This study aimed to analyze Korean tweets about COVID-19 vaccines (Pfizer, Moderna, AstraZeneca, Janssen, and Novavax) after the vaccine rollout, explore the topics and sentiments of tweets regarding COVID-19 vaccines, and examine their changes over time. We also analyzed topics and sentiments focused on AEs related to vaccination using only tweets with terms about AEs. METHODS: We devised a sophisticated methodology consisting of 5 steps: keyword search on Twitter, data collection, data preprocessing, data analysis, and result visualization. We used the Twitter Representational State Transfer application programming interface for data collection. A total of 1,659,158 tweets were collected from February 1, 2021, to March 31, 2022. Finally, 165,984 data points were analyzed after excluding retweets, news, official announcements, advertisements, duplicates, and tweets with <2 words. We applied a variety of preprocessing techniques that are suitable for the Korean language. We ran a suite of analyses using various Python packages, such as latent Dirichlet allocation, hierarchical latent Dirichlet allocation, and sentiment analysis. RESULTS: The topics related to COVID-19 vaccines have a very large spectrum, including vaccine-related AEs, emotional reactions to vaccination, vaccine development and supply, and government vaccination policies. Among them, the top major topic was AEs related to COVID-19 vaccination. The AEs ranged from the adverse reactions listed in the safety profile (eg, myalgia, fever, fatigue, injection site pain, myocarditis or pericarditis, and thrombosis) to unlisted reactions (eg, irregular menstruation, changes in appetite and sleep, leukemia, and deaths). Our results showed a notable difference in the topics for each vaccine brand. The topics pertaining to the Pfizer vaccine mainly mentioned AEs. Negative public opinion has prevailed since the early stages of vaccination. In the sentiment analysis based on vaccine brand, the topics related to the Pfizer vaccine expressed the strongest negative sentiment. CONCLUSIONS: Considering the discrepancy between academic evidence and public opinions related to COVID-19 vaccination, the government should provide accurate information and education. Furthermore, our study suggests the need for management to correct the misinformation related to vaccine-related AEs, especially those affecting negative sentiments. This study provides valuable insights into the public discourses and opinions regarding COVID-19 vaccination.
Subject(s)
COVID-19 Vaccines , COVID-19 , Social Media , Female , Humans , COVID-19/prevention & control , COVID-19 Vaccines/adverse effects , Republic of Korea , Sentiment Analysis , VaccinesABSTRACT
BACKGROUND: Chatbots have become a promising tool to support public health initiatives. Despite their potential, little research has examined how individuals interacted with chatbots during the COVID-19 pandemic. Understanding user-chatbot interactions is crucial for developing services that can respond to people's needs during a global health emergency. OBJECTIVE: This study examined the COVID-19 pandemic-related topics online users discussed with a commercially available social chatbot and compared the sentiment expressed by users from 5 culturally different countries. METHODS: We analyzed 19,782 conversation utterances related to COVID-19 covering 5 countries (the United States, the United Kingdom, Canada, Malaysia, and the Philippines) between 2020 and 2021, from SimSimi, one of the world's largest open-domain social chatbots. We identified chat topics using natural language processing methods and analyzed their emotional sentiments. Additionally, we compared the topic and sentiment variations in the COVID-19-related chats across countries. RESULTS: Our analysis identified 18 emerging topics, which could be categorized into the following 5 overarching themes: "Questions on COVID-19 asked to the chatbot" (30.6%), "Preventive behaviors" (25.3%), "Outbreak of COVID-19" (16.4%), "Physical and psychological impact of COVID-19" (16.0%), and "People and life in the pandemic" (11.7%). Our data indicated that people considered chatbots as a source of information about the pandemic, for example, by asking health-related questions. Users turned to SimSimi for conversation and emotional messages when offline social interactions became limited during the lockdown period. Users were more likely to express negative sentiments when conversing about topics related to masks, lockdowns, case counts, and their worries about the pandemic. In contrast, small talk with the chatbot was largely accompanied by positive sentiment. We also found cultural differences, with negative words being used more often by users in the United States than by those in Asia when talking about COVID-19. CONCLUSIONS: Based on the analysis of user-chatbot interactions on a live platform, this work provides insights into people's informational and emotional needs during a global health crisis. Users sought health-related information and shared emotional messages with the chatbot, indicating the potential use of chatbots to provide accurate health information and emotional support. Future research can look into different support strategies that align with the direction of public health policy.
Subject(s)
COVID-19 , Social Media , Humans , United States/epidemiology , COVID-19/epidemiology , COVID-19/prevention & control , COVID-19/psychology , Pandemics , SARS-CoV-2 , Sentiment Analysis , Communicable Disease ControlABSTRACT
The unprecedented rise in the number of COVID-19 cases has drawn global attention, as it has caused an adverse impact on the lives of people all over the world. As of December 31, 2021, more than 2, 86, 901, 222 people have been infected with COVID-19. The rise in the number of COVID-19 cases and deaths across the world has caused fear, anxiety and depression among individuals. Social media is the most dominant tool that disturbed human life during this pandemic. Among the social media platforms, Twitter is one of the most prominent and trusted social media platforms. To control and monitor the COVID-19 infection, it is necessary to analyze the sentiments of people expressed on their social media platforms. In this study, we proposed a deep learning approach known as a long short-term memory (LSTM) model for the analysis of tweets related to COVID-19 as positive or negative sentiments. In addition, the proposed approach makes use of the firefly algorithm to enhance the overall performance of the model. Further, the performance of the proposed model, along with other state-of-the-art ensemble and machine learning models, has been evaluated by using performance metrics such as accuracy, precision, recall, the AUC-ROC and the F1-score. The experimental results reveal that the proposed LSTM + Firefly approach obtained a better accuracy of 99.59% when compared with the other state-of-the-art models.
Subject(s)
COVID-19 , Deep Learning , Humans , Sentiment Analysis , Algorithms , FearABSTRACT
Housing safety and health problems threaten owners' and occupiers' safety and health. Nevertheless, there is no systematic review on this topic to the best of our knowledge. This study compared the academic and public opinions on housing safety and health and reviewed 982 research articles and 3,173 author works on housing safety and health published in the Web of Science Core Collection. PRISMA was used to filter the data, and natural language processing (NLP) was used to analyze emotions of the abstracts. Only 16 housing safety and health articles existed worldwide before 1998 but increased afterward. U.S. scholars published most research articles (30.76%). All top 10 most productive countries were developed countries, except China, which ranked second (16.01%). Only 25.9% of institutions have inter-institutional cooperation, and collaborators from the same institution produce most work. This study found that most abstracts were positive (n = 521), but abstracts with negative emotions attracted more citations. Despite many industries moving toward AI, housing safety and health research are exceptions as per articles published and Tweets. On the other hand, this study reviewed 8,257 Tweets to compare the focus of the public to academia. There were substantially more housing/residential safety (n = 8198) Tweets than housing health Tweets (n = 59), which is the opposite of academic research. Most Tweets about housing/residential safety were from the United Kingdom or Canada, while housing health hazards were from India. The main concern about housing safety per Twitter includes finance, people, and threats to housing safety. By contrast, people mainly concerned about costs of housing health issues, COVID, and air quality. In addition, most housing safety Tweets were neutral but positive dominated residential safety and health Tweets.
Subject(s)
COVID-19 , Social Media , Cluster Analysis , Housing , Humans , Natural Language Processing , Sentiment AnalysisABSTRACT
The spread of data on the web has increased in the last twenty years. One of the reasons is the appearance of social media. The data on social sites describe many real-life events in our daily lives. In the period of the COVID-19 pandemic, a lot of people and media organizations were writing and documenting their health status and the latest news about the coronavirus on social media. Using these tweets (sentiments) about the coronavirus and analyzing them in a computational model can help decision makers in measuring public opinion and yielding remarkable findings. In this research article, we introduce a deep learning sentiment analysis model based on Universal Sentence Encoder. The dataset used in this research was collected from Twitter, and it was classified as positive, neutral, and negative. The sentence embedding model determines the meaning of word sequences instead of individual words. The model divides the dataset into training and testing and depends on the sentence similarity in detecting sentiment class. The obtained accuracy results reached 78.062%, and this result outperforms many traditional ML classifiers based on TF-IDF applied on the same dataset and another model based on the CNN classifier.
Subject(s)
COVID-19 , Social Media , Humans , Pandemics , Semantics , Sentiment AnalysisABSTRACT
BACKGROUND: The COVID-19 pandemic has affected our society and human well-being in various ways. In this study, we investigate how the pandemic has influenced people's emotions and psychological states compared to a pre-pandemic period using real-world data from social media. METHOD: We collected Reddit social media data from 2019 (pre-pandemic) and 2020 (pandemic) from the subreddits communities associated with eight universities. We applied the pre-trained Robustly Optimized BERT pre-training approach (RoBERTa) to learn text embedding from the Reddit messages, and leveraged the relational information among posted messages to train a graph attention network (GAT) for sentiment classification. Finally, we applied model stacking to combine the prediction probabilities from RoBERTa and GAT to yield the final classification on sentiment. With the model-predicted sentiment labels on the collected data, we used a generalized linear mixed-effects model to estimate the effects of pandemic and in-person teaching during the pandemic on sentiment. RESULTS: The results suggest that the odds of negative sentiments in 2020 (pandemic) were 25.7% higher than the odds in 2019 (pre-pandemic) with a p-value < 0.001; and the odds of negative sentiments associated in-person learning were 48.3% higher than with remote learning in 2020 with a p-value of 0.029. CONCLUSIONS: Our study results are consistent with the findings in the literature on the negative impacts of the pandemic on people's emotions and psychological states. Our study contributes to the growing real-world evidence on the various negative impacts of the pandemic on our society; it also provides a good example of using both ML techniques and statistical modeling and inference to make better use of real-world data.
Subject(s)
COVID-19 , Social Media , Humans , COVID-19/epidemiology , Pandemics , Universities , Sentiment AnalysisABSTRACT
In order to analyze the evolution trend of public opinion in emergencies and explore its evolution law, this paper constructs a network sentiment analysis model based on text clustering, where the emotion analysis part is based on the pretraining BERT model and BiGRU model, in which BERT is used as the word embedding model to extract the feature vector of emotional text and BiGRU is used to extract the context of the text feature vector to accurately identify the sentiment polarity of public opinion data. In addition, the K-means clustering algorithm and Kolmogorov-Smirnov Z test were used to divide the different epidemic stages. Compared with other methods, the model proposed in this paper has a great degree of improvement in accuracy, recall, and F1 score index, which provides an opportunity reference and effective detection means for schools at all levels to carry out timely mental health education and psychological intervention for students.
Subject(s)
Epidemics , Sentiment Analysis , Attitude , Cluster Analysis , Humans , StudentsABSTRACT
BACKGROUND: The emergence of the novel coronavirus (COVID-19) and the necessary separation of populations have led to an unprecedented number of new social media users seeking information related to the pandemic. Currently, with an estimated 4.5 billion users worldwide, social media data offer an opportunity for near real-time analysis of large bodies of text related to disease outbreaks and vaccination. These analyses can be used by officials to develop appropriate public health messaging, digital interventions, educational materials, and policies. OBJECTIVE: Our study investigated and compared public sentiment related to COVID-19 vaccines expressed on 2 popular social media platforms-Reddit and Twitter-harvested from January 1, 2020, to March 1, 2022. METHODS: To accomplish this task, we created a fine-tuned DistilRoBERTa model to predict the sentiments of approximately 9.5 million tweets and 70 thousand Reddit comments. To fine-tune our model, our team manually labeled the sentiment of 3600 tweets and then augmented our data set through back-translation. Text sentiment for each social media platform was then classified with our fine-tuned model using Python programming language and the Hugging Face sentiment analysis pipeline. RESULTS: Our results determined that the average sentiment expressed on Twitter was more negative (5,215,830/9,518,270, 54.8%) than positive, and the sentiment expressed on Reddit was more positive (42,316/67,962, 62.3%) than negative. Although the average sentiment was found to vary between these social media platforms, both platforms displayed similar behavior related to the sentiment shared at key vaccine-related developments during the pandemic. CONCLUSIONS: Considering this similar trend in shared sentiment demonstrated across social media platforms, Twitter and Reddit continue to be valuable data sources that public health officials can use to strengthen vaccine confidence and combat misinformation. As the spread of misinformation poses a range of psychological and psychosocial risks (anxiety and fear, etc), there is an urgency in understanding the public perspective and attitude toward shared falsities. Comprehensive educational delivery systems tailored to a population's expressed sentiments that facilitate digital literacy, health information-seeking behavior, and precision health promotion could aid in clarifying such misinformation.
Subject(s)
COVID-19 , Social Media , Vaccines , COVID-19/prevention & control , COVID-19 Vaccines , Humans , Sentiment AnalysisABSTRACT
OBJECTIVES: This study examines the contents of official communication from United States governors' offices related to the COVID-19 pandemic to assess patterns in communication and to determine if they correlate with trends for COVID cases and deaths. METHODS: We collected text data for all COVID-19 related press releases between March 1 and December 31, 2020 from the US governors' office websites in all 50 states. An automated parsing and sentiment analyzer assessed descriptive statistics and trends in tone, including positivity and negativity. RESULTS: We included a total of 7,720 press releases in this study. We found that both positive and negative sentiments were homogenous across states at the beginning of the pandemic but became heterogeneous as the pandemic evolved. The same trend applied to the frequency and tone of press releases. Sentiments across states were overall positive with a small level of negativity. We observed a reactive official communication to the evolution of the number of COVID-19 cases rather than responsive or preventive. CONCLUSIONS: The findings of both positivity and negativity in press communications suggest that the effect of discounted importance was present in official communications. Our findings support a state-dependent optimal communication frequency and tone, agreeing with the curvilinear communication model of organizational theory and implying that feedback cycles between government officials and public response should be shortened to rapidly maximize communication efficacy during the pandemic. Future research should identify and evaluate the drivers of the large differences in communication tone across states and validate the reactive characteristics of COVID-19 official communications.
Subject(s)
COVID-19 , Social Media , COVID-19/epidemiology , Communication , Humans , Pandemics/prevention & control , SARS-CoV-2 , Sentiment Analysis , United States/epidemiologyABSTRACT
The COVID-19 pandemic fueled one of the quickest vaccine developments in history. Misinformation on online social media often leads to negative vaccine sentiment. We conducted a sentiment analysis and Latent Dirichlet Allocation topic modeling from Reddit communities focusing on the COVID-19 vaccine. Polarity analysis suggested these communities expressed positive sentiment regarding the vaccine. However, topic modeling revealed community members mainly focused on the side effects and vaccination experience.
Subject(s)
COVID-19 , Social Media , Vaccines , COVID-19/prevention & control , COVID-19 Vaccines , Humans , Pandemics/prevention & control , SARS-CoV-2 , Sentiment AnalysisABSTRACT
The COVID-19 pandemic caused by SARS-CoV-2 is still raging. Similar to other RNA viruses, SARS-COV-2 is constantly mutating, which leads to the production of many infectious and lethal strains. For instance, the omicron variant detected in November 2021 became the leading strain of infection in many countries around the world and sparked an intense public debate on social media. The aim of this study is to explore the Chinese public's perception of the omicron variants on social media. A total of 121,632 points of data relating to omicron on Sina Weibo from 0:00 27 November 2021 to 23:59:59 30 March 2022 (Beijing time) were collected and analyzed with LDA-based topic modeling and DLUT-Emotion ontology-based sentiment analysis. The results indicate that (1) the public discussion of omicron is based on five topics, including omicron's impact on the economy, the omicron infection situation in other countries/regions, the omicron infection situation in China, omicron and vaccines and pandemic prevention and control for omicron. (2) From the 3 sentiment orientations of 121,632 valid Weibo posts, 49,402 posts were judged as positive emotions, accounting for approximately 40.6%; 47,667 were negative emotions, accounting for nearly 39.2%; and 24,563 were neutral emotions, accounting for about 20.2%. (3) The result of the analysis of the temporal trend of the seven categories of emotion attribution showed that fear kept decreasing, whereas good kept increasing. This study provides more insights into public perceptions of and attitudes toward emerging SARS-CoV-2 variants. The results of this study may provide further recommendations for the Chinese government, public health authorities, and the media to promote knowledge about SARS-CoV-2 variant pandemic-resistant messages.
Subject(s)
COVID-19 , Emotions , Latent Class Analysis , Public Opinion , SARS-CoV-2 , Sentiment Analysis , Social Media , Attitude , COVID-19/economics , COVID-19/epidemiology , COVID-19/prevention & control , COVID-19/virology , COVID-19 Vaccines , China/epidemiology , Federal Government , Health Education , Humans , Internationality , Pandemics/prevention & control , Pandemics/statistics & numerical data , Public HealthABSTRACT
OBJECTIVE: Sentiment analysis is an important method for understanding emotions and opinions expressed through social media exchanges. Little work has been done to evaluate the performance of existing sentiment analysis tools on social media datasets, particularly those related to health, healthcare, or public health. This study aims to address the gap. MATERIAL AND METHODS: We evaluated 11 commonly used sentiment analysis tools on five health-related social media datasets curated in previously published studies. These datasets include Human Papillomavirus Vaccine, Health Care Reform, COVID-19 Masking, Vitals.com Physician Reviews, and the Breast Cancer Forum from MedHelp.org. For comparison, we also analyzed two non-health datasets based on movie reviews and generic tweets. We conducted a qualitative error analysis on the social media posts that were incorrectly classified by all tools. RESULTS: The existing sentiment analysis tools performed poorly with an average weighted F1 score below 0.6. The inter-tool agreement was also low; the average Fleiss Kappa score is 0.066. The qualitative error analysis identified two major causes for misclassification: (1) correct sentiment but on wrong subject(s) and (2) failure to properly interpret inexplicit/indirect sentiment expressions. DISCUSSION AND CONCLUSION: The performance of the existing sentiment analysis tools is insufficient to generate accurate sentiment classification results. The low inter-tool agreement suggests that the conclusion of a study could be entirely driven by the idiosyncrasies of the tool selected, rather than by the data. This is very concerning especially if the results may be used to inform important policy decisions such as mask or vaccination mandates.
Subject(s)
COVID-19 , Social Media , Emotions , Humans , Public Health/methods , Sentiment AnalysisABSTRACT
In previous work, we implemented a deep learning model with CamemBERT and PyTorch, and built a microservices architecture using the TorchServe serving library. Without TorchServe, inference time was three times faster when the model was loaded once in memory compared when the model was loaded each time. The preloaded model without TorchServe presented comparable inference time with the TorchServe instance. However, using a PyTorch preloaded model in a web application without TorchServe would necessitate to implement functionalities already present in TorchServe.
Subject(s)
COVID-19 , Deep Learning , Social Media , Vaccines , COVID-19/prevention & control , COVID-19 Vaccines , Humans , Sentiment AnalysisABSTRACT
The COVID-19 pandemic introduced unprecedented challenges for people and governments. Vaccines are an available solution to this pandemic. Recipients of the vaccines are of different ages, gender, and religion. Muslims follow specific Islamic guidelines that prohibit them from taking a vaccine with certain ingredients. This study aims at analyzing Facebook and Twitter data to understand the discourse related to halal vaccines using aspect-based sentiment analysis and text emotion analysis. We searched for the term "halal vaccine" and limited the timeline to the period between 1 January 2020, and 30 April 2021, and collected 6037 tweets and 3918 Facebook posts. We performed data preprocessing on tweets and Facebook posts and built the Latent Dirichlet Allocation (LDA) model to identify topics. Calculating the sentiment analysis for each topic was the next step. Finally, this study further investigates emotions in the data using the National Research Council of Canada Emotion Lexicon. Our analysis identified four topics in each of the Twitter dataset and Facebook dataset. Two topics of "COVID-19 vaccine" and "halal vaccine" are shared between the two datasets. The other two topics in tweets are "halal certificate" and "must halal", while "sinovac vaccine" and "ulema council" are two other topics in the Facebook dataset. The sentiment analysis shows that the sentiment toward halal vaccine is mostly neutral in Twitter data, whereas it is positive in Facebook data. The emotion analysis indicates that trust is the most present emotion among the top three emotions in both datasets, followed by anticipation and fear.
Subject(s)
COVID-19 , Social Media , Vaccines , COVID-19/prevention & control , Emotions , Humans , Pandemics/prevention & control , Sentiment Analysis , Vaccination/psychologyABSTRACT
Purpose: COVID-19-associated mucormycosis (CAM) was a serious public health problem during the second wave of COVID-19 in India. We planned to analyze public perceptions by sentiment analysis of Twitter data regarding CAM. Methods: In this observational study, the application programming interface (API) provided by the Twitter platform was used for extracting real-time conversations by using keywords related to mucormycosis (colloquially known as "black fungus"), from May 3 to August 29, 2021. Lexicon-based sentiment analysis of the tweets was done using the Vader sentiment analysis tool. To identify the overall sentiment of a user on any given topic, an algorithm to label a user "k" based on their sentiments was used. Results: A total of 4,01,037 tweets were collected between May 3 and August 29, 2021, and the peak frequency of 1,60,000 tweets was observed from May 17 to May 23, 2021. Positive sentiment tweets constituted a larger share as compared to negative sentiment tweets, with weekly variations. A temporal analysis of the demand for utilities showed that the demand was high in the initial period but decreased with time, which was associated with the availability of resources. Conclusion: Sentiment analysis using Twitter data revealed that social media platforms are gaining popularity to express one's emotions during the ongoing COVID-19 pandemic. In our study, time-based assessment of tweets showed a reduction over time in the frequency of negative sentiment tweets. The polarization in the retweet network of users, based on sentiment polarity, showed that the users were well connected, highlighting the fact that such issues bond our society rather than segregating it.
Subject(s)
COVID-19 , Mucormycosis , Social Media , COVID-19/epidemiology , Humans , Mucormycosis/diagnosis , Mucormycosis/epidemiology , Pandemics , Sentiment AnalysisABSTRACT
Social media is Internet-based by design, allowing people to share content quickly via electronic means. People can openly express their thoughts on social media sites such as Twitter, which can then be shared with other people. During the recent COVID-19 outbreak, public opinion analytics provided useful information for determining the best public health response. At the same time, the dissemination of misinformation, aided by social media and other digital platforms, has proven to be a greater threat to global public health than the virus itself, as the COVID-19 pandemic has shown. The public's feelings on social distancing can be discovered by analysing articulated messages from Twitter. The automated method of recognizing and classifying subjective information in text data is known as sentiment analysis. In this research work, we have proposed to use a combination of preprocessing approaches such as tokenization, filtering, stemming, and building N-gram models. Deep belief neural network (DBN) with pseudo labelling is used to classify the tweets. Top layers of the base classifiers are boosted in the pseudo labelling strategy, whereas lower levels of the base classifiers share weights for feature extraction. By introducing the pseudo boost mechanism, our suggested technique preserves the same time complexity as a DBN while achieving fast convergence to optimality. The pseudo labelling improves the performance of the classification. It extracts the keywords from the tweets with high precision. The results reveal that using the DBN classifier in conjunction with the bigram in the N-gram model outperformed other models by 90.3 percent. The proposed approach can also aid medical professionals and decision-makers in determining the best course of action for each location based on their views regarding the pandemic.
Subject(s)
COVID-19 , Social Media , Humans , Neural Networks, Computer , Pandemics , SARS-CoV-2 , Sentiment AnalysisABSTRACT
BACKGROUND AND OBJECTIVE: Social media sentiment analysis based on Twitter data can facilitate real-time monitoring of COVID-19 vaccine-related concerns. Thus, the governments can adopt proactive measures to address misinformation and inappropriate behaviors surrounding the COVID-19 vaccine, threatening the success of the national vaccination campaign. This study aims to identify the correlation between COVID-19 vaccine sentiments expressed on Twitter and COVID-19 vaccination coverage, case increase, and case fatality rate in Indonesia. METHODS: We retrieved COVID-19 vaccine-related tweets collected from Indonesian Twitter users between October 15, 2020, to April 12, 2021, using Drone Emprit Academic (DEA) platform. We collected the daily trend of COVID-19 vaccine coverage and the rate of case increase and case fatality from the Ministry of Health (MoH) official website and the KawalCOVID19 database, respectively. We identified the public sentiments, emotions, word usage, and trend of all filtered tweets 90 days before and after the national vaccination rollout in Indonesia. RESULTS: Using a total of 555,892 COVID-19 vaccine-related tweets, we observed the negative sentiments outnumbered positive sentiments for 59 days (65.50%), with the predominant emotion of anticipation among 90 days of the beginning of the study period. However, after the vaccination rollout, the positive sentiments outnumbered negative sentiments for 56 days (62.20%) with the growth of trust emotion, which is consistent with the positive appeals of the recent news about COVID-19 vaccine safety and the government's proactive risk communication. In addition, there was a statistically significant trend of vaccination sentiment scores, which strongly correlated with the increase of vaccination coverage (r = 0.71, P<.0001 both first and second doses) and the decreasing of case increase rate (r = -0.70, P<.0001) and case fatality rate (r = -0.74, P<.0001). CONCLUSIONS: Our results highlight the utility of social media sentiment analysis as government communication strategies to build public trust, affecting individual willingness to get vaccinated. This finding will be useful for countries to identify and develop strategies for speed up the vaccination rate by monitoring the dynamic netizens' reactions and expression in social media, especially Twitter, using sentiment analysis.