Your browser doesn't support javascript.
Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada.
Meaney, Christopher; Escobar, Michael; Moineddin, Rahim; Stukel, Therese A; Kalia, Sumeet; Aliarzadeh, Babak; Chen, Tao; O'Neill, Braden; Greiver, Michelle.
  • Meaney C; Department of Family and Community Medicine, University of Toronto, Canada; Dalla Lana School of Public Health, University of Toronto, Canada. Electronic address: christopher.meaney@utoronto.ca.
  • Escobar M; Dalla Lana School of Public Health, University of Toronto, Canada.
  • Moineddin R; Department of Family and Community Medicine, University of Toronto, Canada; Dalla Lana School of Public Health, University of Toronto, Canada; ICES, Toronto, Canada.
  • Stukel TA; IHPME, University of Toronto, Canada; ICES, Toronto, Canada.
  • Kalia S; Department of Family and Community Medicine, University of Toronto, Canada; Dalla Lana School of Public Health, University of Toronto, Canada.
  • Aliarzadeh B; Department of Family and Community Medicine, University of Toronto, Canada.
  • Chen T; Department of Family and Community Medicine, University of Toronto, Canada.
  • O'Neill B; Department of Family and Community Medicine, University of Toronto, Canada.
  • Greiver M; Department of Family and Community Medicine, University of Toronto, Canada; Department of Family and Community Medicine, North York General Hospital and University of Toronto, Canada.
J Biomed Inform ; 128: 104034, 2022 04.
Article in English | MEDLINE | ID: covidwho-1703628
ABSTRACT

OBJECTIVE:

To demonstrate how non-negative matrix factorization can be used to learn a temporal topic model over a large collection of primary care clinical notes, characterizing diverse COVID-19 pandemic effects on the physical/mental/social health of residents of Toronto, Canada. MATERIALS AND

METHODS:

The study employs a retrospective open cohort design, consisting of 382,666 primary care progress notes from 44,828 patients, 54 physicians, and 12 clinics collected 01/01/2017 through 31/12/2020. Non-negative matrix factorization uncovers a meaningful latent topical structure permeating the corpus of primary care notes. The learned latent topical basis is transformed into a multivariate time series data structure. Time series methods and plots showcase the evolution/dynamics of learned topics over the study period and allow the identification of COVID-19 pandemic effects. We perform several post-hoc checks of model robustness to increase trust that descriptive/unsupervised inferences are stable over hyper-parameter configurations and/or data perturbations.

RESULTS:

Temporal topic modelling uncovers a myriad of pandemic-related effects from the expressive clinical text data. In terms of direct effects on patient-health, topics encoding respiratory disease symptoms display altered dynamics during the pandemic year. Further, the pandemic was associated with a multitude of indirect patient-level effects on topical domains representing mental health, sleep, social and familial dynamics, measurement of vitals/labs, uptake of prevention/screening maneuvers, and referrals to medical specialists. Finally, topic models capture changes in primary care practice patterns resulting from the pandemic, including changes in EMR documentation strategies and the uptake of telemedicine.

CONCLUSION:

Temporal topic modelling applied to a large corpus of rich primary care clinical text data, can identify a meaningful topical/thematic summarization which can provide policymakers and public health stakeholders a passive, cost-effective, technology for understanding holistic impacts of the COVID-19 pandemic on the primary healthcare system and community/public-health.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Pandemics / COVID-19 Type of study: Cohort study / Experimental Studies / Observational study / Prognostic study Limits: Humans Country/Region as subject: North America Language: English Journal: J Biomed Inform Journal subject: Medical Informatics Year: 2022 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Pandemics / COVID-19 Type of study: Cohort study / Experimental Studies / Observational study / Prognostic study Limits: Humans Country/Region as subject: North America Language: English Journal: J Biomed Inform Journal subject: Medical Informatics Year: 2022 Document Type: Article