Este articulo es un Preprint
Los preprints son informes de investigación preliminares que no han sido certificados por revisión por pares. No deben considerarse para guiar la práctica clínica o los comportamientos relacionados con la salud y no deben publicarse en los medios como información establecida.
Los preprints publicados en línea permiten a los autores recibir comentarios rápidamente, y toda la comunidad científica puede evaluar de forma independiente el trabajo y responder adecuadamente. Estos comentarios se publican junto con los preprints para que cualquiera pueda leer y servir como una revisión pospublicación.
Development of a Post-Acute Sequelae of COVID-19 (PASC) Symptom Lexicon Using Electronic Health Record Clinical Notes (preprint)
medrxiv; 2021.
Preprint
en Inglés
| medRxiv | ID: ppzbmed-10.1101.2021.07.29.21261260
ABSTRACT
Objective:
To develop a comprehensive post-acute sequelae of COVID-19 (PASC) symptom lexicon from clinical notes to support PASC symptom identification and research.Methods:
We identified 26,117 COVID-19 positive patients from the Mass General Brigham's electronic health records (EHR) and extracted 328,879 clinical notes from their post-acute infection period (day 51-110 from first positive COVID-19 test). The PASC symptom lexicon incorporated Unified Medical Language System (UMLS) Metathesaurus concepts and synonyms based on selected semantic types. The MTERMS natural language processing (NLP) tool was used to automatically extract symptoms from a development dataset. The lexicon was iteratively revised with manual chart review, keyword search, concept consolidation, and evaluation of NLP output. We assessed the comprehensiveness of the lexicon and the NLP performance using a validation dataset and reported the symptom prevalence across the entire corpus.Results:
The PASC symptom lexicon included 355 symptoms consolidated from 1,520 UMLS concepts. NLP achieved an averaged precision of 0.94 and an estimated recall of 0.84. Symptoms with the highest frequency included pain (43.1%), anxiety (25.8%), depression (24.0%), fatigue (23.4%), joint pain (21.0%), shortness of breath (20.8%), headache (20.0%), nausea and/or vomiting (19.9%), myalgia (19.0%), and gastroesophageal reflux (18.6%). Discussion andConclusion:
PASC symptoms are diverse. A comprehensive PASC symptom lexicon can be derived using a data-driven, ontology-driven and NLP-assisted approach. By using unstructured data, this approach may improve identification and analysis of patient symptoms in the EHR, and inform prospective study design, preventative care strategies, and therapeutic interventions for patient care.
Texto completo:
Disponible
Colección:
Preprints
Base de datos:
medRxiv
Asunto principal:
Trastornos de Ansiedad
/
Dolor
/
Reflujo Gastroesofágico
/
Artralgia
/
Náusea y Vómito Posoperatorios
/
Trastorno Depresivo
/
Disnea
/
Fatiga
/
Mialgia
/
COVID-19
Idioma:
Inglés
Año:
2021
Tipo del documento:
Preprint
Similares
MEDLINE
...
LILACS
LIS