Dataset of discourses about COVID-19 and financial markets from Twitter.

Ngo, Vu Minh

Ngo, Vu Minh.

Ngo VM; School of Banking, Business College, University of Economics Ho Chi Minh City, 59C Nguyen Dinh Chieu st, District 3, Ho Chi Minh City, Vietnam.

Data Brief ; 43: 108428, 2022 Aug.

Article in English | MEDLINE | ID: covidwho-1914301

ABSTRACT

ABSTRACT

In this data article, a collection of 11,625,887 tweets on the topic of the COVID-19 pandemic are provided. The data from Twitter were collected through Twitter API from January 2020 to June 2020. In addition, we also provided subsets of tweets containing discourses on both COVID-19 and financial topics. In order to facilitate the research on sentiment analysis, the Sentiment140 dataset containing 1,600,000 tweets that were annotated as positive or negative sentiment was also provided (Go et al., 2009) We used Term Frequency-Inverse Document Frequency (TF-IDF) algorithm to transform documents to numeric vectors and used logistic regression classifier to train and predict sentiments of tweets. These datasets may garner interest from data science, economists, social science, natural language processing, epidemiology, and public health groups.

Keywords

COVID-19 pandemic; Cryptocurrency markets; Financial markets; Tweets

Fulltext

XML

PubMed Links

Search on Google

Full text: Available Collection: International databases Database: MEDLINE Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Language: English Journal: Data Brief Year: 2022 Document Type: Article Affiliation country: J.dib.2022.108428

Similar

MEDLINE

LILACS

LIS

Fulltext

XML

PubMed Links

Search on Google