Your browser doesn't support javascript.
Risk stratification for COVID-19 hospitalization: a multivariable model based on gradient-boosting decision trees.
Gutierrez, Jahir M; Volkovs, Maksims; Poutanen, Tomi; Watson, Tristan; Rosella, Laura C.
  • Gutierrez JM; Layer 6 AI (Gutierrez, Volkovs, Poutanen); ICES (Volkovs, Watson, Rosella); Dalla Lana School of Public Health (Watson, Rosella), University of Toronto; Vector Institute (Rosella), Toronto, Ont.
  • Volkovs M; Layer 6 AI (Gutierrez, Volkovs, Poutanen); ICES (Volkovs, Watson, Rosella); Dalla Lana School of Public Health (Watson, Rosella), University of Toronto; Vector Institute (Rosella), Toronto, Ont.
  • Poutanen T; Layer 6 AI (Gutierrez, Volkovs, Poutanen); ICES (Volkovs, Watson, Rosella); Dalla Lana School of Public Health (Watson, Rosella), University of Toronto; Vector Institute (Rosella), Toronto, Ont.
  • Watson T; Layer 6 AI (Gutierrez, Volkovs, Poutanen); ICES (Volkovs, Watson, Rosella); Dalla Lana School of Public Health (Watson, Rosella), University of Toronto; Vector Institute (Rosella), Toronto, Ont.
  • Rosella LC; Layer 6 AI (Gutierrez, Volkovs, Poutanen); ICES (Volkovs, Watson, Rosella); Dalla Lana School of Public Health (Watson, Rosella), University of Toronto; Vector Institute (Rosella), Toronto, Ont. laura.rosella@utoronto.ca.
CMAJ Open ; 9(4): E1223-E1231, 2021.
Article in English | MEDLINE | ID: covidwho-1593829
ABSTRACT

BACKGROUND:

The COVID-19 pandemic has led to an increased demand for health care resources and, in some cases, shortage of medical equipment and staff. Our objective was to develop and validate a multivariable model to predict risk of hospitalization for patients infected with SARS-CoV-2.

METHODS:

We used routinely collected health records in a patient cohort to develop and validate our prediction model. This cohort included adult patients (age ≥ 18 yr) from Ontario, Canada, who tested positive for SARS-CoV-2 ribonucleic acid by polymerase chain reaction between Feb. 2 and Oct. 5, 2020, and were followed up through Nov. 5, 2020. Patients living in long-term care facilities were excluded, as they were all assumed to be at high risk of hospitalization for COVID-19. Risk of hospitalization within 30 days of diagnosis of SARS-CoV-2 infection was estimated via gradient-boosting decision trees, and variable importance examined via Shapley values. We built a gradient-boosting model using the Extreme Gradient Boosting (XGBoost) algorithm and compared its performance against 4 empirical rules commonly used for risk stratifications based on age and number of comorbidities.

RESULTS:

The cohort included 36 323 patients with 2583 hospitalizations (7.1%). Hospitalized patients had a higher median age (64 yr v. 43 yr), were more likely to be male (56.3% v. 47.3%) and had a higher median number of comorbidities (3, interquartile range [IQR] 2-6 v. 1, IQR 0-3) than nonhospitalized patients. Patients were split into development (n = 29 058, 80.0%) and held-out validation (n = 7265, 20.0%) cohorts. The gradient-boosting model achieved high discrimination (development cohort area under the receiver operating characteristic curve across the 5 folds of 0.852; validation cohort 0.8475) and strong calibration (slope = 1.01, intercept = -0.01). The patients who scored at the top 10% captured 47.4% of hospitalizations, and those who scored at the top 30% captured 80.6%.

INTERPRETATION:

We developed and validated an accurate risk stratification model using routinely collected health administrative data. We envision that modelling such risk stratification based on routinely collected health data could support management of COVID-19 on a population health level.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: Decision Trees / Risk Assessment / COVID-19 / Hospitalization Type of study: Cohort study / Observational study / Prognostic study Limits: Adult / Aged / Female / Humans / Male / Middle aged Country/Region as subject: North America Language: English Journal: CMAJ Open Year: 2021 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Decision Trees / Risk Assessment / COVID-19 / Hospitalization Type of study: Cohort study / Observational study / Prognostic study Limits: Adult / Aged / Female / Humans / Male / Middle aged Country/Region as subject: North America Language: English Journal: CMAJ Open Year: 2021 Document Type: Article