Written and spoken corpus of real and fake social media postings about COVID-19 (preprint)

Ng Bee Chin; Ng Zhi Ee Nicole; Kyla Kwan; Lee Yong Han Dylann; Liu Fang; Xu Hong

Este articulo es un Preprint

Los preprints son informes de investigación preliminares que no han sido certificados por revisión por pares. No deben considerarse para guiar la práctica clínica o los comportamientos relacionados con la salud y no deben publicarse en los medios como información establecida.

Los preprints publicados en línea permiten a los autores recibir comentarios rápidamente, y toda la comunidad científica puede evaluar de forma independiente el trabajo y responder adecuadamente. Estos comentarios se publican junto con los preprints para que cualquiera pueda leer y servir como una revisión pospublicación.

Written and spoken corpus of real and fake social media postings about COVID-19 (preprint)

Ng Bee Chin; Ng Zhi Ee Nicole; Kyla Kwan; Lee Yong Han Dylann; Liu Fang; Xu Hong.

arxiv; 2023.

Preprint en Inglés | PREPRINT-ARXIV | ID: ppzbmed-2310.04237v1

ABSTRACT

ABSTRACT

This study investigates the linguistic traits of fake news and real news. There are two parts to this study text data and speech data. The text data for this study consisted of 6420 COVID-19 related tweets re-filtered from Patwa et al. (2021). After cleaning, the dataset contained 3049 tweets, with 2161 labeled as 'real' and 888 as 'fake'. The speech data for this study was collected from TikTok, focusing on COVID-19 related videos. Research assistants fact-checked each video's content using credible sources and labeled them as 'Real', 'Fake', or 'Questionable', resulting in a dataset of 91 real entries and 109 fake entries from 200 TikTok videos with a total word count of 53,710 words. The data was analysed using the Linguistic Inquiry and Word Count (LIWC) software to detect patterns in linguistic data. The results indicate a set of linguistic features that distinguish fake news from real news in both written and speech data. This offers valuable insights into the role of language in shaping trust, social media interactions, and the propagation of fake news.

Asunto(s)

COVID-19

Texto completo

Imprimir

XML

Buscar en Google

Texto completo: Disponible Colección: Preprints Base de datos: PREPRINT-ARXIV Asunto principal: COVID-19 Idioma: Inglés Año: 2023 Tipo del documento: Preprint

Similares

MEDLINE

LILACS

LIS

Texto completo

Imprimir

XML

Buscar en Google

Texto completo: Disponible Colección: Preprints Base de datos: PREPRINT-ARXIV Asunto principal: COVID-19 Idioma: Inglés Año: 2023 Tipo del documento: Preprint