Your browser doesn't support javascript.
Multisite learning of high-dimensional heterogeneous data with applications to opioid use disorder study of 15,000 patients across 5 clinical sites.
Liu, Xiaokang; Duan, Rui; Luo, Chongliang; Ogdie, Alexis; Moore, Jason H; Kranzler, Henry R; Bian, Jiang; Chen, Yong.
  • Liu X; Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, 423 Guardian Drive, Philadelphia, PA, 19104, USA.
  • Duan R; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA.
  • Luo C; Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, 423 Guardian Drive, Philadelphia, PA, 19104, USA.
  • Ogdie A; Division of Public Health Sciences, Washington University School of Medicine in St. Louis, St. Louis, MO, USA.
  • Moore JH; Department of Medicine, Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
  • Kranzler HR; Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, 90096, USA.
  • Bian J; Department of Psychiatry, University of Pennsylvania Perelman School of Medicine and the VISN 4 MIRECC, Crescenz VAMC, Philadelphia, PA, USA.
  • Chen Y; Department of Health Outcomes and Biomedical Informatics, University of Florida Health Cancer Center, Gainesville, FL, USA.
Sci Rep ; 12(1): 11073, 2022 06 30.
Article in English | MEDLINE | ID: covidwho-1921704
ABSTRACT
Integrating data across institutions can improve learning efficiency. To integrate data efficiently while protecting privacy, we propose A one-shot, summary-statistics-based, Distributed Algorithm for fitting Penalized (ADAP) regression models across multiple datasets. ADAP utilizes patient-level data from a lead site and incorporates the first-order (ADAP1) and second-order gradients (ADAP2) of the objective function from collaborating sites to construct a surrogate objective function at the lead site, where model fitting is then completed with proper regularizations applied. We evaluate the performance of the proposed method using both simulation and a real-world application to study risk factors for opioid use disorder (OUD) using 15,000 patient data from the OneFlorida Clinical Research Consortium. Our results show that ADAP performs nearly the same as the pooled estimator but achieves higher estimation accuracy and better variable selection than the local and average estimators. Moreover, ADAP2 successfully handles heterogeneity in covariate distributions.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: Algorithms / Opioid-Related Disorders Type of study: Experimental Studies / Observational study / Prognostic study Topics: Vaccines Limits: Humans Language: English Journal: Sci Rep Year: 2022 Document Type: Article Affiliation country: S41598-022-14029-9

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Algorithms / Opioid-Related Disorders Type of study: Experimental Studies / Observational study / Prognostic study Topics: Vaccines Limits: Humans Language: English Journal: Sci Rep Year: 2022 Document Type: Article Affiliation country: S41598-022-14029-9