A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization.

Yan, Yao; Schaffter, Thomas; Bergquist, Timothy; Yu, Thomas; Prosser, Justin; Aydin, Zafer; Jabeer, Amhar; Brugere, Ivan; Gao, Jifan; Chen, Guanhua; Causey, Jason; Yao, Yuxin; Bryson, Kevin; Long, Dustin R; Jarvik, Jeffrey G; Lee, Christoph I; Wilcox, Adam; Guinney, Justin; Mooney, Sean

Yan, Yao; Schaffter, Thomas; Bergquist, Timothy; Yu, Thomas; Prosser, Justin; Aydin, Zafer; Jabeer, Amhar; Brugere, Ivan; Gao, Jifan; Chen, Guanhua; Causey, Jason; Yao, Yuxin; Bryson, Kevin; Long, Dustin R; Jarvik, Jeffrey G; Lee, Christoph I; Wilcox, Adam; Guinney, Justin; Mooney, Sean.

Yan Y; Sage Bionetworks, Seattle, Washington.
Schaffter T; Molecular Engineering and Sciences Institute, University of Washington, Seattle.
Bergquist T; Sage Bionetworks, Seattle, Washington.
Yu T; Sage Bionetworks, Seattle, Washington.
Prosser J; Department of Biomedical Informatics and Medical Education, University of Washington, Seattle.
Aydin Z; Sage Bionetworks, Seattle, Washington.
Jabeer A; Institute of Translational Health Sciences, University of Washington, Seattle.
Brugere I; Department of Computer Engineering, Faculty of Engineering, Abdullah Gul University, Kayseri, Turkey.
Gao J; Department of Computer Engineering, Faculty of Engineering, Abdullah Gul University, Kayseri, Turkey.
Chen G; Department of Computer Science, University of Illinois at Chicago, Chicago.
Causey J; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison.
Yao Y; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison.
Bryson K; Computer Science Department, College of Engineering and Computer Science, Arkansas State University, Jonesboro.
Long DR; Arkansas AI-Campus, Center for No-Boundary Thinking, Arkansas State University, Jonesboro.
Jarvik JG; Department of Computer Science, University College London, London, United Kingdom.
Lee CI; Department of Computer Science, University College London, London, United Kingdom.
Wilcox A; Division of Critical Care Medicine, Department of Anesthesiology and Pain Medicine, University of Washington, Seattle.
Guinney J; The University of Washington Clinical Learning, Evidence And Research Center for Musculoskeletal Disorders, Seattle.
Mooney S; Department of Radiology, University of Washington School of Medicine, Seattle.

JAMA Netw Open ; 4(10): e2124946, 2021 10 01.

Article in English | MEDLINE | ID: covidwho-1460117

ABSTRACT

ABSTRACT

Importance Machine learning could be used to predict the likelihood of diagnosis and severity of illness. Lack of COVID-19 patient data has hindered the data science community in developing models to aid in the response to the pandemic.

Objectives:

To describe the rapid development and evaluation of clinical algorithms to predict COVID-19 diagnosis and hospitalization using patient data by citizen scientists, provide an unbiased assessment of model performance, and benchmark model performance on subgroups. Design, Setting, and

Participants:

This diagnostic and prognostic study operated a continuous, crowdsourced challenge using a model-to-data approach to securely enable the use of regularly updated COVID-19 patient data from the University of Washington by participants from May 6 to December 23, 2020. A postchallenge analysis was conducted from December 24, 2020, to April 7, 2021, to assess the generalizability of models on the cumulative data set as well as subgroups stratified by age, sex, race, and time of COVID-19 test. By December 23, 2020, this challenge engaged 482 participants from 90 teams and 7 countries. Main Outcomes and

Measures:

Machine learning algorithms used patient data and output a score that represented the probability of patients receiving a positive COVID-19 test result or being hospitalized within 21 days after receiving a positive COVID-19 test result. Algorithms were evaluated using area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC) scores. Ensemble models aggregating models from the top challenge teams were developed and evaluated.

Results:

In the analysis using the cumulative data set, the best performance for COVID-19 diagnosis prediction was an AUROC of 0.776 (95% CI, 0.775-0.777) and an AUPRC of 0.297, and for hospitalization prediction, an AUROC of 0.796 (95% CI, 0.794-0.798) and an AUPRC of 0.188. Analysis on top models submitting to the challenge showed consistently better model performance on the female group than the male group. Among all age groups, the best performance was obtained for the 25- to 49-year age group, and the worst performance was obtained for the group aged 17 years or younger. Conclusions and Relevance In this diagnostic and prognostic study, models submitted by citizen scientists achieved high performance for the prediction of COVID-19 testing and hospitalization outcomes. Evaluation of challenge models on demographic subgroups and prospective data revealed performance discrepancies, providing insights into the potential bias and limitations in the models.

Subject(s)

Algorithms; Benchmarking; COVID-19/diagnosis; Clinical Decision Rules; Crowdsourcing; Hospitalization/statistics & numerical data; Machine Learning; Adolescent; Adult; Aged; Aged, 80 and over; Area Under Curve; COVID-19/epidemiology; COVID-19/therapy; COVID-19 Testing; Child; Child, Preschool; Female; Humans; Infant; Infant, Newborn; Male; Middle Aged; Models, Statistical; Prognosis; ROC Curve; Severity of Illness Index; Washington/epidemiology; Young Adult

Fulltext

XML

PubMed Links

Search on Google

Full text: Available Collection: International databases Database: MEDLINE Main subject: Algorithms / Benchmarking / Crowdsourcing / Machine Learning / Clinical Decision Rules / COVID-19 / Hospitalization Type of study: Diagnostic study / Experimental Studies / Observational study / Prognostic study / Randomized controlled trials Limits: Adolescent / Adult / Aged / Child / Child, preschool / Female / Humans / Infant / Male / Middle aged Country/Region as subject: North America Language: English Journal: JAMA Netw Open Year: 2021 Document Type: Article

Similar

MEDLINE

LILACS

LIS

Fulltext

XML

PubMed Links

Search on Google