Your browser doesn't support javascript.
Statistical biases due to anonymization evaluated in an open clinical dataset from COVID-19 patients.
Koll, Carolin E M; Hopff, Sina M; Meurers, Thierry; Lee, Chin Huang; Kohls, Mirjam; Stellbrink, Christoph; Thibeault, Charlotte; Reinke, Lennart; Steinbrecher, Sarah; Schreiber, Stefan; Mitrov, Lazar; Frank, Sandra; Miljukov, Olga; Erber, Johanna; Hellmuth, Johannes C; Reese, Jens-Peter; Steinbeis, Fridolin; Bahmer, Thomas; Hagen, Marina; Meybohm, Patrick; Hansch, Stefan; Vadász, István; Krist, Lilian; Jiru-Hillmann, Steffi; Prasser, Fabian; Vehreschild, Jörg Janne.
  • Koll CEM; University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany. carolin.koll@uk-koeln.de.
  • Hopff SM; University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany.
  • Meurers T; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Charitéplatz 1, 10117, Berlin, Germany.
  • Lee CH; University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany.
  • Kohls M; University of Wuerzburg, Faculty of Medicine, Institute for Clinical Epidemiology and Biometry, Wuerzburg, Germany.
  • Stellbrink C; Department of Cardiology and Intensive Care Medicine, Bielefeld Medical Centre, Medical Faculty OWL, University of Bielefeld, Bielefeld, Germany.
  • Thibeault C; Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany.
  • Reinke L; Internal Medicine Department I, University Medical Center Schleswig-Holstein Campus Kiel, Kiel, Germany.
  • Steinbrecher S; Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany.
  • Schreiber S; Internal Medicine Department I, University Medical Center Schleswig-Holstein Campus Kiel, Kiel, Germany.
  • Mitrov L; University of Cologne, Faculty of Medicine and University Hospital Cologne, Department I of Internal Medicine, Center for Integrated Oncology Aachen Bonn Cologne Duesseldorf, Cologne, Germany.
  • Frank S; Department of Anesthesiology, University Hospital of Ludwig-Maximilians-University (LMU), Munich, Germany.
  • Miljukov O; Department of Medicine III, University Hospital, LMU Munich, Munich, Germany.
  • Erber J; University of Wuerzburg, Faculty of Medicine, Institute for Clinical Epidemiology and Biometry, Wuerzburg, Germany.
  • Hellmuth JC; Technical University of Munich, School of Medicine, University Hospital rechts der Isar, Department of Internal Medicine II, Munich, Germany.
  • Reese JP; Department of Medicine III, University Hospital, LMU Munich, Munich, Germany.
  • Steinbeis F; COVID-19 Registry of the LMU Munich (CORKUM), University Hospital, LMU Munich, Munich, Germany.
  • Bahmer T; University of Wuerzburg, Faculty of Medicine, Institute for Clinical Epidemiology and Biometry, Wuerzburg, Germany.
  • Hagen M; Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany.
  • Meybohm P; Internal Medicine Department I, University Medical Center Schleswig-Holstein Campus Kiel, Kiel, Germany.
  • Hansch S; Airway Research Center North (ARCN), German Center for Lung Research (DZL), Großhansdorf, Germany.
  • Vadász I; Department II for Internal Medicine, Hematology/Oncology, University Hospital Frankfurt, Frankfurt am Main, Germany.
  • Krist L; Department of Anaesthesiology, Intensive Care, Emergency and Pain Medicine, University Hospital Wuerzburg, Wuerzburg, Germany.
  • Jiru-Hillmann S; Department of Infection Prevention and Infectious Diseases, University Hospital Regensburg, Regensburg, Germany.
  • Prasser F; Department of Internal Medicine, Justus Liebig University, Universities of Giessen and Marburg Lung Center (UGMLC), Member of the German Center for Lung Research (DZL), Giessen, Germany.
  • Vehreschild JJ; The Cardio-Pulmonary Institute (CPI), Giessen, Germany.
Sci Data ; 9(1): 776, 2022 12 21.
Article in English | MEDLINE | ID: covidwho-2185972
ABSTRACT
Anonymization has the potential to foster the sharing of medical data. State-of-the-art methods use mathematical models to modify data to reduce privacy risks. However, the degree of protection must be balanced against the impact on statistical properties. We studied an extreme case of this trade-off the statistical validity of an open medical dataset based on the German National Pandemic Cohort Network (NAPKON), which was prepared for publication using a strong anonymization procedure. Descriptive statistics and results of regression analyses were compared before and after anonymization of multiple variants of the original dataset. Despite significant differences in value distributions, the statistical bias was found to be small in all cases. In the regression analyses, the median absolute deviations of the estimated adjusted odds ratios for different sample sizes ranged from 0.01 [minimum = 0, maximum = 0.58] to 0.52 [minimum = 0.25, maximum = 0.91]. Disproportionate impact on the statistical properties of data is a common argument against the use of anonymization. Our analysis demonstrates that anonymization can actually preserve validity of statistical results in relatively low-dimensional data.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 Type of study: Cohort study / Experimental Studies / Observational study / Prognostic study / Systematic review/Meta Analysis Topics: Variants Limits: Humans Language: English Journal: Sci Data Year: 2022 Document Type: Article Affiliation country: S41597-022-01669-9

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 Type of study: Cohort study / Experimental Studies / Observational study / Prognostic study / Systematic review/Meta Analysis Topics: Variants Limits: Humans Language: English Journal: Sci Data Year: 2022 Document Type: Article Affiliation country: S41597-022-01669-9