Your browser doesn't support javascript.
Fuzzy Matching for Symptom Detection in Tweets: Application to Covid-19 During the First Wave of the Pandemic in France.
Faviez, Carole; Foulquié, Pierre; Chen, Xiaoyi; Mebarki, Adel; Quennelle, Sophie; Texier, Nathalie; Katsahian, Sandrine; Schuck, Stéphane; Burgun, Anita.
  • Faviez C; Centre de Recherche des Cordeliers, Sorbonne Université, INSERM, Université de Paris, F-75006, Paris, France.
  • Foulquié P; Kap Code, Paris, France.
  • Chen X; Centre de Recherche des Cordeliers, Sorbonne Université, INSERM, Université de Paris, F-75006, Paris, France.
  • Mebarki A; Kap Code, Paris, France.
  • Quennelle S; Centre de Recherche des Cordeliers, Sorbonne Université, INSERM, Université de Paris, F-75006, Paris, France.
  • Texier N; M3C-Necker,Hôpital Necker-Enfants Malades, AP-HP, F-75015, Paris, France.
  • Katsahian S; Kap Code, Paris, France.
  • Schuck S; Centre de Recherche des Cordeliers, Sorbonne Université, INSERM, Université de Paris, F-75006, Paris, France.
  • Burgun A; Hôpital européen Georges Pompidou, Unité d'épidémiologie et de recherche clinique, AP-HP, F-75015, Paris, France.
Stud Health Technol Inform ; 281: 896-900, 2021 May 27.
Article in English | MEDLINE | ID: covidwho-1247818
ABSTRACT
The exhaustive automatic detection of symptoms in social media posts is made difficult by the presence of colloquial expressions, misspellings and inflected forms of words. The detection of self-reported symptoms is of major importance for emergent diseases like the Covid-19. In this study, we aimed to (1) develop an algorithm based on fuzzy matching to detect symptoms in tweets, (2) establish a comprehensive list of Covid-19-related symptoms and (3) evaluate the fuzzy matching for Covid-19-related symptom detection in French tweets. The Covid-19-related symptom list was built based on the aggregation of different data sources. French Covid-19-related tweets were automatically extracted using a dedicated data broker during the first wave of the pandemic in France. The fuzzy matching parameters were finetuned using all symptoms from MedDRA and then evaluated on a subset of 5000 Covid-19-related tweets in French for the detection of symptoms from our Covid-19-related list. The fuzzy matching improved the detection by the addition of 42% more correct matches with an 81% precision.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Social Media / COVID-19 Type of study: Experimental Studies / Observational study Limits: Humans Country/Region as subject: Europa Language: English Journal: Stud Health Technol Inform Journal subject: Medical Informatics / Health Services Research Year: 2021 Document Type: Article Affiliation country: SHTI210308

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Social Media / COVID-19 Type of study: Experimental Studies / Observational study Limits: Humans Country/Region as subject: Europa Language: English Journal: Stud Health Technol Inform Journal subject: Medical Informatics / Health Services Research Year: 2021 Document Type: Article Affiliation country: SHTI210308