Your browser doesn't support javascript.
loading
Tuning Privacy-Utility Tradeoff in Genomic Studies Using Selective SNP Hiding.
Alserr, Nour Almadhoun; Kale, Gulce; Mutlu, Onur; Tastan, Oznur; Ayday, Erman.
Affiliation
  • Alserr NA; Department of Information Technology and Electrical Engineering, ETH Zurich, Zurich 8006, Switzerland.
  • Kale G; Computer Engineering Department, Bilkent University, Ankara 06800, Turkey.
  • Mutlu O; Department of Information Technology and Electrical Engineering, ETH Zurich, Zurich 8006, Switzerland.
  • Tastan O; Computer Engineering Department, Bilkent University, Ankara 06800, Turkey.
  • Ayday E; Computer Science and Engineering, Sabanci University, Istanbul 34956, Turkey.
Article in En | MEDLINE | ID: mdl-37383349
Researchers need a rich trove of genomic datasets that they can leverage to gain a better understanding of the genetic basis of the human genome and identify associations between phenol-types and specific parts of DNA. However, sharing genomic datasets that include sensitive genetic or medical information of individuals can lead to serious privacy-related consequences if data lands in the wrong hands. Restricting access to genomic datasets is one solution, but this greatly reduces their usefulness for research purposes. To allow sharing of genomic datasets while addressing these privacy concerns, several studies propose privacy-preserving mechanisms for data sharing. Differential privacy is one of such mechanisms that formalize rigorous mathematical foundations to provide privacy guarantees while sharing aggregated statistical information about a dataset. Nevertheless, it has been shown that the original privacy guarantees of DP-based solutions degrade when there are dependent tuples in the dataset, which is a common scenario for genomic datasets (due to the existence of family members). In this work, we introduce a new mechanism to mitigate the vulnerabilities of the inference attacks on differentially private query results from genomic datasets including dependent tuples. We propose a utility-maximizing and privacy-preserving approach for sharing statistics by hiding selective SNPs of the family members as they participate in a genomic dataset. By evaluating our mechanism on a real-world genomic dataset, we empirically demonstrate that our proposed mechanism can achieve up to 40% better privacy than state-of-the-art DP-based solutions, while near-optimally minimizing utility loss.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Proc Asia Pac Bioinform Conf Year: 2023 Document type: Article Affiliation country: Switzerland Country of publication: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Proc Asia Pac Bioinform Conf Year: 2023 Document type: Article Affiliation country: Switzerland Country of publication: United kingdom