Your browser doesn't support javascript.
A Practical and Empirical Comparison of Three Topic Modeling Methods using a COVID-19 Corpus: LSA, LDA, and Top2Vec
56th Annual Hawaii International Conference on System Sciences, HICSS 2023 ; 2023-January:930-939, 2023.
Article in English | Scopus | ID: covidwho-2306370
ABSTRACT
This study was prepared as a practical guide for researchers interested in using topic modeling methodologies. This study is specially designed for those with difficulty determining which methodology to use. Many topic modeling methods have been developed since the 1980s namely, latent semantic indexing or analysis (LSI/LSA), probabilistic LSI/LSA (pLSI/pLSA), naïve Bayes, the Author-Recipient-Topic (ART), Latent Dirichlet Allocation (LDA), Topic Over Time (TOT), Dynamic Topic Models (DTM), Word2Vec, Top2Vec and \variation and combination of these techniques. For researchers from disciplines other than computer science may find it challenging to select a topic modeling methodology. We compared a recently developed topic modeling algorithm-Top2Vec- with two of the most conventional and frequently-used methodologies-LSA and LDA. As a study sample, we used a corpus of 65,292 COVID-19-focused s. Among the 11 topics we identified in each methodology, we found high levels of correlation between LDA and Top2Vec results, followed by LSA and LDA and Top2Vec and LSA. We also provided information on computational resources we used to perform the analyses and provided practical guidelines and recommendations for researchers. © 2023 IEEE Computer Society. All rights reserved.
Keywords
Search on Google
Collection: Databases of international organizations Database: Scopus Language: English Journal: 56th Annual Hawaii International Conference on System Sciences, HICSS 2023 Year: 2023 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS

Search on Google
Collection: Databases of international organizations Database: Scopus Language: English Journal: 56th Annual Hawaii International Conference on System Sciences, HICSS 2023 Year: 2023 Document Type: Article