Your browser doesn't support javascript.
Distributed learning for heterogeneous clinical data with application to integrating COVID-19 data across 230 sites.
Tong, Jiayi; Luo, Chongliang; Islam, Md Nazmul; Sheils, Natalie E; Buresh, John; Edmondson, Mackenzie; Merkel, Peter A; Lautenbach, Ebbing; Duan, Rui; Chen, Yong.
  • Tong J; Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA.
  • Luo C; Division of Public Health Sciences, Department of Surgery, Washington University in St. Louis, St. Louis, MO, USA.
  • Islam MN; Optum Labs, UnitedHealth Group, Minnetonka, MN, USA.
  • Sheils NE; Optum Labs, UnitedHealth Group, Minnetonka, MN, USA.
  • Buresh J; Optum Labs, UnitedHealth Group, Minnetonka, MN, USA.
  • Edmondson M; Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA.
  • Merkel PA; Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA.
  • Lautenbach E; Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA.
  • Duan R; Harvard T. H. Chan School of Public Health, Harvard University, Boston, MA, USA.
  • Chen Y; Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA. ychen123@upenn.edu.
NPJ Digit Med ; 5(1): 76, 2022 Jun 14.
Article in English | MEDLINE | ID: covidwho-1890278
ABSTRACT
Integrating real-world data (RWD) from several clinical sites offers great opportunities to improve estimation with a more general population compared to analyses based on a single clinical site. However, sharing patient-level data across sites is practically challenging due to concerns about maintaining patient privacy. We develop a distributed algorithm to integrate heterogeneous RWD from multiple clinical sites without sharing patient-level data. The proposed distributed conditional logistic regression (dCLR) algorithm can effectively account for between-site heterogeneity and requires only one round of communication. Our simulation study and data application with the data of 14,215 COVID-19 patients from 230 clinical sites in the UnitedHealth Group Clinical Research Database demonstrate that the proposed distributed algorithm provides an estimator that is robust to heterogeneity in event rates when efficiently integrating data from multiple clinical sites. Our algorithm is therefore a practical alternative to both meta-analysis and existing distributed algorithms for modeling heterogeneous multi-site binary outcomes.

Full text: Available Collection: International databases Database: MEDLINE Type of study: Prognostic study / Reviews Language: English Journal: NPJ Digit Med Year: 2022 Document Type: Article Affiliation country: S41746-022-00615-8

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Type of study: Prognostic study / Reviews Language: English Journal: NPJ Digit Med Year: 2022 Document Type: Article Affiliation country: S41746-022-00615-8