Your browser doesn't support javascript.
An iterative topic model filtering framework for short and noisy user-generated data: analyzing conspiracy theories on twitter.
Kant, Gillian; Wiebelt, Levin; Weisser, Christoph; Kis-Katos, Krisztina; Luber, Mattias; Säfken, Benjamin.
  • Kant G; University of Göttingen, Göttingen, Germany.
  • Wiebelt L; University of Göttingen, Göttingen, Germany.
  • Weisser C; Campus-Institut Data Science (CIDAS), University of Göttingen, Göttingen, Germany.
  • Kis-Katos K; Campus-Institut Data Science (CIDAS), University of Göttingen, Göttingen, Germany.
  • Luber M; University of Göttingen, Göttingen, Germany.
  • Säfken B; Campus-Institut Data Science (CIDAS), University of Göttingen, Göttingen, Germany.
Int J Data Sci Anal ; : 1-21, 2022 May 06.
Article in English | MEDLINE | ID: covidwho-1943732
ABSTRACT
Conspiracy theories have seen a rise in popularity in recent years. Spreading quickly through social media, their disruptive effect can lead to a biased public view on policy decisions and events. We present a novel approach for LDA-pre-processing called Iterative Filtering to study such phenomena based on Twitter data. In combination with Hashtag Pooling as an additional pre-processing step, we are able to achieve a coherent framing of the discussion and topics of interest, despite of the inherent noisiness and sparseness of Twitter data. Our novel approach enables researchers to gain detailed insights into discourses of interest on Twitter, allowing them to identify tweets iteratively that are related to an investigated topic of interest. As an application, we study the dynamics of conspiracy-related topics on US Twitter during the last four months of 2020, which were dominated by the US-Presidential Elections and Covid-19. We monitor the public discourse in the USA with geo-spatial Twitter data to identify conspiracy-related contents by estimating Latent Dirichlet Allocation (LDA) Topic Models. We find that in this period, usual conspiracy-related topics played a marginal role in comparison with dominating topics, such as the US-Presidential Elections or the general discussions about Covid-19. The main conspiracy theories in this period were the ones linked to "Election Fraud" and the "Covid-19-hoax." Conspiracy-related keywords tended to appear together with Trump-related words and words related to his presidential campaign.
Keywords

Full text: Available Collection: International databases Database: MEDLINE Language: English Journal: Int J Data Sci Anal Year: 2022 Document Type: Article Affiliation country: S41060-022-00321-4

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Language: English Journal: Int J Data Sci Anal Year: 2022 Document Type: Article Affiliation country: S41060-022-00321-4