Clustered Bert Model for predicting Retweet Popularity
14th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2022
; : 7-12, 2022.
Article
in English
| Scopus | ID: covidwho-2136192
ABSTRACT
This work aims to predict retweet popularity of covid19 tweet corpus. Our work fuses unsupervised and supervised learning techniques to create retweet popularity model. In the first phase, we use a Clustered Bert model, which works on clustering the Bert embeddings using clustering algorithms on the textual data to generate novel and meaningful feature set for our model. In the second phase, we use the output of Clustered Bert model as an input to the Supervised Regression models intending to predict retweet popularity. Our work also draws a comparison between features from numeric model;emotions/sentiment model;and Clustered Bert model. Three different Regression models, belonging to different categories Nearest Neighbors, Ensemble and Stacked models are then tested on the final feature-set to generate predictions for our model. The results show higher accuracy when the Clustered Bert model is used in combination with numerical and emotion/sentiment model. The experiment shows better results for Stacked Regression models out of all the three regressors used for our study. © 2022 IEEE.
Full text:
Available
Collection:
Databases of international organizations
Database:
Scopus
Type of study:
Prognostic study
Language:
English
Journal:
14th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2022
Year:
2022
Document Type:
Article
Similar
MEDLINE
...
LILACS
LIS