Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 377
Filter
1.
AMIA Jt Summits Transl Sci Proc ; 2024: 575-583, 2024.
Article in English | MEDLINE | ID: mdl-38827044

ABSTRACT

Genome-wide association studies (GWAS) remain a popular method for identifying novel genetic associations with human phenotypes and have provided many insights into the etiology of many diseases. However, GWAS provide limited support for how a genetic association might contribute to disease due to inherent limitations, such as linkage disequilibrium. As such, many methods that operate on GWAS summary statistics have been developed to generate evidence for functional pathways or for variants of interest, but they require defining the genomic region bounds for loci of interest. At present, there are limited methods for determining these bounds in a rigorous, reproducible way. We present a novel statistical method, Statistical Analysis for Bayesian Estimation of Regions (SABER), that uses Bayesian Gaussian mixture models to reproducibly generate ratios that quantify whether particular genomic positions represent the bounds of loci of interest and can be used to delineate genomic regions for downstream analyses.

2.
AMIA Jt Summits Transl Sci Proc ; 2024: 211-220, 2024.
Article in English | MEDLINE | ID: mdl-38827072

ABSTRACT

Fairness is crucial in machine learning to prevent bias based on sensitive attributes in classifier predictions. However, the pursuit of strict fairness often sacrifices accuracy, particularly when significant prevalence disparities exist among groups, making classifiers less practical. For example, Alzheimer's disease (AD) is more prevalent in women than men, making equal treatment inequitable for females. Accounting for prevalence ratios among groups is essential for fair decision-making. In this paper, we introduce prior knowledge for fairness, which incorporates prevalence ratio information into the fairness constraint within the Empirical Risk Minimization (ERM) framework. We develop the Prior-knowledge-guided Fair ERM (PFERM) framework, aiming to minimize expected risk within a specified function class while adhering to a prior-knowledge-guided fairness constraint. This approach strikes a flexible balance between accuracy and fairness. Empirical results confirm its effectiveness in preserving fairness without compromising accuracy.

3.
AMIA Jt Summits Transl Sci Proc ; 2024: 623-631, 2024.
Article in English | MEDLINE | ID: mdl-38827078

ABSTRACT

Alzheimer's disease (AD) is the most prevalent neurodegenerative disease worldwide, with one in nine people over the age of 65 living with the disease in 2023. In this study, we used a phenome wide association study (PheWAS) approach to identify cross-phenotype between previously identified genetic associations for AD and electronic health record (EHR) diagnoses from the UK Biobank (UKBB) (n=361,194 of European ancestry) and the eMERGE Network (n=105,108 of diverse ancestry). Based on 497 previously identified AD-associated variants from the Alzheimer's Disease Variant Portal (ADVP), we found significant associations primarily in immune and cardiac related diseases in our PheWAS. Replicating variants have widespread impacts on immune genes in diverse tissue types. This study demonstrates the potential of using the PheWAS strategy to improve our understanding of AD progression as well as identify potential drug repurposing opportunities for new treatment and disease prevention strategies.

4.
medRxiv ; 2024 Apr 10.
Article in English | MEDLINE | ID: mdl-38645167

ABSTRACT

Apart from ancestry, personal or environmental covariates may contribute to differences in polygenic score (PGS) performance. We analyzed effects of covariate stratification and interaction on body mass index (BMI) PGS (PGSBMI) across four cohorts of European (N=491,111) and African (N=21,612) ancestry. Stratifying on binary covariates and quintiles for continuous covariates, 18/62 covariates had significant and replicable R2 differences among strata. Covariates with the largest differences included age, sex, blood lipids, physical activity, and alcohol consumption, with R2 being nearly double between best and worst performing quintiles for certain covariates. 28 covariates had significant PGSBMI-covariate interaction effects, modifying PGSBMI effects by nearly 20% per standard deviation change. We observed overlap between covariates that had significant R2 differences among strata and interaction effects - across all covariates, their main effects on BMI were correlated with their maximum R2 differences and interaction effects (0.56 and 0.58, respectively), suggesting high-PGSBMI individuals have highest R2 and increase in PGS effect. Using quantile regression, we show the effect of PGSBMI increases as BMI itself increases, and that these differences in effects are directly related to differences in R2 when stratifying by different covariates. Given significant and replicable evidence for context-specific PGSBMI performance and effects, we investigated ways to increase model performance taking into account non-linear effects. Machine learning models (neural networks) increased relative model R2 (mean 23%) across datasets. Finally, creating PGSBMI directly from GxAge GWAS effects increased relative R2 by 7.8%. These results demonstrate that certain covariates, especially those most associated with BMI, significantly affect both PGSBMI performance and effects across diverse cohorts and ancestries, and we provide avenues to improve model performance that consider these effects.

5.
J Med Internet Res ; 26: e46777, 2024 Apr 18.
Article in English | MEDLINE | ID: mdl-38635981

ABSTRACT

BACKGROUND: As global populations age and become susceptible to neurodegenerative illnesses, new therapies for Alzheimer disease (AD) are urgently needed. Existing data resources for drug discovery and repurposing fail to capture relationships central to the disease's etiology and response to drugs. OBJECTIVE: We designed the Alzheimer's Knowledge Base (AlzKB) to alleviate this need by providing a comprehensive knowledge representation of AD etiology and candidate therapeutics. METHODS: We designed the AlzKB as a large, heterogeneous graph knowledge base assembled using 22 diverse external data sources describing biological and pharmaceutical entities at different levels of organization (eg, chemicals, genes, anatomy, and diseases). AlzKB uses a Web Ontology Language 2 ontology to enforce semantic consistency and allow for ontological inference. We provide a public version of AlzKB and allow users to run and modify local versions of the knowledge base. RESULTS: AlzKB is freely available on the web and currently contains 118,902 entities with 1,309,527 relationships between those entities. To demonstrate its value, we used graph data science and machine learning to (1) propose new therapeutic targets based on similarities of AD to Parkinson disease and (2) repurpose existing drugs that may treat AD. For each use case, AlzKB recovers known therapeutic associations while proposing biologically plausible new ones. CONCLUSIONS: AlzKB is a new, publicly available knowledge resource that enables researchers to discover complex translational associations for AD drug discovery. Through 2 use cases, we show that it is a valuable tool for proposing novel therapeutic hypotheses based on public biomedical knowledge.


Subject(s)
Alzheimer Disease , Humans , Alzheimer Disease/drug therapy , Alzheimer Disease/genetics , Pattern Recognition, Automated , Knowledge Bases , Machine Learning , Knowledge
6.
Nat Commun ; 15(1): 2604, 2024 Mar 23.
Article in English | MEDLINE | ID: mdl-38521789

ABSTRACT

The complex biological mechanisms underlying human brain aging remain incompletely understood. This study investigated the genetic architecture of three brain age gaps (BAG) derived from gray matter volume (GM-BAG), white matter microstructure (WM-BAG), and functional connectivity (FC-BAG). We identified sixteen genomic loci that reached genome-wide significance (P-value < 5×10-8). A gene-drug-disease network highlighted genes linked to GM-BAG for treating neurodegenerative and neuropsychiatric disorders and WM-BAG genes for cancer therapy. GM-BAG displayed the most pronounced heritability enrichment in genetic variants within conserved regions. Oligodendrocytes and astrocytes, but not neurons, exhibited notable heritability enrichment in WM and FC-BAG, respectively. Mendelian randomization identified potential causal effects of several chronic diseases on brain aging, such as type 2 diabetes on GM-BAG and AD on WM-BAG. Our results provide insights into the genetics of human brain aging, with clinical implications for potential lifestyle and therapeutic interventions. All results are publicly available at https://labs.loni.usc.edu/medicine .


Subject(s)
Diabetes Mellitus, Type 2 , White Matter , Humans , Brain , Gray Matter , Magnetic Resonance Imaging/methods , White Matter/physiology , Mendelian Randomization Analysis
7.
medRxiv ; 2024 Jan 10.
Article in English | MEDLINE | ID: mdl-38260403

ABSTRACT

Genome-wide association studies (GWAS) have been instrumental in identifying genetic associations for various diseases and traits. However, uncovering genetic underpinnings among traits beyond univariate phenotype associations remains a challenge. Multi-phenotype associations (MPA), or genetic pleiotropy, offer important insights into shared genes and pathways among traits, enhancing our understanding of genetic architectures of complex diseases. GWAS of biobank-linked electronic health record (EHR) data are increasingly being utilized to identify MPA among various traits and diseases. However, methodologies that can efficiently take advantage of distributed EHR to detect MPA are still lacking. Here, we introduce mixWAS, a novel algorithm that efficiently and losslessly integrates multiple EHRs via summary statistics, allowing the detection of MPA among mixed phenotypes while accounting for heterogeneities across EHRs. Simulations demonstrate that mixWAS outperforms the widely used MPA detection method, Phenome-wide association study (PheWAS), across diverse scenarios. Applying mixWAS to data from seven EHRs in the US, we identified 4,534 MPA among blood lipids, BMI, and circulatory diseases. Validation in an independent EHR data from UK confirmed 97.7% of the associations. mixWAS fundamentally improves the detection of MPA and is available as a free, open-source software.

8.
Radiology ; 310(1): e223170, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38259208

ABSTRACT

Despite recent advancements in machine learning (ML) applications in health care, there have been few benefits and improvements to clinical medicine in the hospital setting. To facilitate clinical adaptation of methods in ML, this review proposes a standardized framework for the step-by-step implementation of artificial intelligence into the clinical practice of radiology that focuses on three key components: problem identification, stakeholder alignment, and pipeline integration. A review of the recent literature and empirical evidence in radiologic imaging applications justifies this approach and offers a discussion on structuring implementation efforts to help other hospital practices leverage ML to improve patient care. Clinical trial registration no. 04242667 © RSNA, 2024 Supplemental material is available for this article.


Subject(s)
Artificial Intelligence , Radiology , Humans , Radiography , Algorithms , Machine Learning
9.
Br J Nutr ; 131(1): 156-162, 2024 01 14.
Article in English | MEDLINE | ID: mdl-37519237

ABSTRACT

Though diet quality is widely recognised as linked to risk of chronic disease, health systems have been challenged to find a user-friendly, efficient way to obtain information about diet. The Penn Healthy Diet (PHD) survey was designed to fill this void. The purposes of this pilot project were to assess the patient experience with the PHD, to validate the accuracy of the PHD against related items in a diet recall and to explore scoring algorithms with relationship to the Healthy Eating Index (HEI)-2015 computed from the recall data. A convenience sample of participants in the Penn Health BioBank was surveyed with the PHD, the Automated Self-Administered 24-hour recall (ASA24) and experience questions. Kappa scores and Spearman correlations were used to compare related questions in the PHD to the ASA24. Numerical scoring, regression tree and weighted regressions were computed for scoring. Participants assessed the PHD as easy to use and were willing to repeat the survey at least annually. The three scoring algorithms were strongly associated with HEI-2015 scores using National Health and Nutrition Examination Survey 2017-2018 data from which the PHD was developed and moderately associated with the pilot replication data. The PHD is acceptable to participants and at least moderately correlated with the HEI-2015. Further validation in a larger sample will enable the selection of the strongest scoring approach.


Subject(s)
Diet, Healthy , Diet , Humans , Nutrition Surveys , Pilot Projects , Diet Surveys
10.
Pac Symp Biocomput ; 29: 594-610, 2024.
Article in English | MEDLINE | ID: mdl-38160309

ABSTRACT

Access to safe and effective antiretroviral therapy (ART) is a cornerstone in the global response to the HIV pandemic. Among people living with HIV, there is considerable interindividual variability in absolute CD4 T-cell recovery following initiation of virally suppressive ART. The contribution of host genetics to this variability is not well understood. We explored the contribution of a polygenic score which was derived from large, publicly available summary statistics for absolute lymphocyte count from individuals in the general population (PGSlymph) due to a lack of publicly available summary statistics for CD4 T-cell count. We explored associations with baseline CD4 T-cell count prior to ART initiation (n=4959) and change from baseline to week 48 on ART (n=3274) among treatment-naïve participants in prospective, randomized ART studies of the AIDS Clinical Trials Group. We separately examined an African-ancestry-derived and a European-ancestry-derived PGSlymph, and evaluated their performance across all participants, and also in the African and European ancestral groups separately. Multivariate models that included PGSlymph, baseline plasma HIV-1 RNA, age, sex, and 15 principal components (PCs) of genetic similarity explained ∼26-27% of variability in baseline CD4 T-cell count, but PGSlymph accounted for <1% of this variability. Models that also included baseline CD4 T-cell count explained ∼7-9% of variability in CD4 T-cell count increase on ART, but PGSlymph accounted for <1% of this variability. In univariate analyses, PGSlymph was not significantly associated with baseline or change in CD4 T-cell count. Among individuals of African ancestry, the African PGSlymph term in the multivariate model was significantly associated with change in CD4 T-cell count while not significant in the univariate model. When applied to lymphocyte count in a general medical biobank population (Penn Medicine BioBank), PGSlymph explained ∼6-10% of variability in multivariate models (including age, sex, and PCs) but only ∼1% in univariate models. In summary, a lymphocyte count PGS derived from the general population was not consistently associated with CD4 T-cell recovery on ART. Nonetheless, adjusting for clinical covariates is quite important when estimating such polygenic effects.


Subject(s)
Anti-HIV Agents , HIV Infections , Humans , CD4-Positive T-Lymphocytes , Prospective Studies , Anti-HIV Agents/therapeutic use , Computational Biology , HIV Infections/drug therapy , HIV Infections/genetics , CD4 Lymphocyte Count , Viral Load
11.
Pac Symp Biocomput ; 29: 611-626, 2024.
Article in English | MEDLINE | ID: mdl-38160310

ABSTRACT

Polygenic risk scores (PRS) have predominantly been derived from genome-wide association studies (GWAS) conducted in European ancestry (EUR) individuals. In this study, we present an in-depth evaluation of PRS based on multi-ancestry GWAS for five cardiometabolic phenotypes in the Penn Medicine BioBank (PMBB) followed by a phenome-wide association study (PheWAS). We examine the PRS performance across all individuals and separately in African ancestry (AFR) and EUR ancestry groups. For AFR individuals, PRS derived using the multi-ancestry LD panel showed a higher effect size for four out of five PRSs (DBP, SBP, T2D, and BMI) than those derived from the AFR LD panel. In contrast, for EUR individuals, the multi-ancestry LD panel PRS demonstrated a higher effect size for two out of five PRSs (SBP and T2D) compared to the EUR LD panel. These findings underscore the potential benefits of utilizing a multi-ancestry LD panel for PRS derivation in diverse genetic backgrounds and demonstrate overall robustness in all individuals. Our results also revealed significant associations between PRS and various phenotypic categories. For instance, CAD PRS was linked with 18 phenotypes in AFR and 82 in EUR, while T2D PRS correlated with 84 phenotypes in AFR and 78 in EUR. Notably, associations like hyperlipidemia, renal failure, atrial fibrillation, coronary atherosclerosis, obesity, and hypertension were observed across different PRSs in both AFR and EUR groups, with varying effect sizes and significance levels. However, in AFR individuals, the strength and number of PRS associations with other phenotypes were generally reduced compared to EUR individuals. Our study underscores the need for future research to prioritize 1) conducting GWAS in diverse ancestry groups and 2) creating a cosmopolitan PRS methodology that is universally applicable across all genetic backgrounds. Such advances will foster a more equitable and personalized approach to precision medicine.


Subject(s)
Diabetes Mellitus, Type 2 , Hypertension , Humans , Genetic Risk Score , Genome-Wide Association Study/methods , Genetic Predisposition to Disease , Precision Medicine , Multifactorial Inheritance , Computational Biology , Phenotype , Hypertension/genetics , Diabetes Mellitus, Type 2/genetics , Risk Factors
12.
Proc Natl Acad Sci U S A ; 120(52): e2300842120, 2023 Dec 26.
Article in English | MEDLINE | ID: mdl-38127979

ABSTRACT

Normal and pathologic neurobiological processes influence brain morphology in coordinated ways that give rise to patterns of structural covariance (PSC) across brain regions and individuals during brain aging and diseases. The genetic underpinnings of these patterns remain largely unknown. We apply a stochastic multivariate factorization method to a diverse population of 50,699 individuals (12 studies and 130 sites) and derive data-driven, multi-scale PSCs of regional brain size. PSCs were significantly correlated with 915 genomic loci in the discovery set, 617 of which are newly identified, and 72% were independently replicated. Key pathways influencing PSCs involve reelin signaling, apoptosis, neurogenesis, and appendage development, while pathways of breast cancer indicate potential interplays between brain metastasis and PSCs associated with neurodegeneration and dementia. Using support vector machines, multi-scale PSCs effectively derive imaging signatures of several brain diseases. Our results elucidate genetic and biological underpinnings that influence structural covariance patterns in the human brain.


Subject(s)
Brain Neoplasms , Magnetic Resonance Imaging , Humans , Magnetic Resonance Imaging/methods , Brain/pathology , Brain Mapping/methods , Genomics , Brain Neoplasms/pathology
13.
medRxiv ; 2023 Nov 07.
Article in English | MEDLINE | ID: mdl-37986758

ABSTRACT

Alzheimer's disease (AD) is the most prevalent neurodegenerative disease worldwide, with one in nine people over the age of 65 living with the disease in 2023. In this study, we used a phenome wide association study (PheWAS) approach to identify cross-phenotype associations between previously identified genetic AD and for electronic health record (EHR) diagnoses from the UK Biobank (UKBB) (n=361,194 of European ancestry) and the eMERGE Network (n=105,108 of diverse ancestry). Based on 497 previously identified AD-associated variants from the Alzheimer's Disease Variant Portal (ADVP), we found significant associations primarily in immune and cardiac related diseases in our PheWAS. Replicating variants have widespread impacts on immune genes in diverse tissue types. This study demonstrates the potential of using the PheWAS strategy to improve our understanding of AD progression as well as identify potential drug repurposing opportunities for new treatment and disease prevention strategies.

15.
medRxiv ; 2023 Oct 02.
Article in English | MEDLINE | ID: mdl-37873439

ABSTRACT

Background: High on-treatment platelet reactivity (HTPR) with clopidogrel is predictive of ischemic events in adults with coronary artery disease. Despite strong data suggesting HTPR varies with ethnicity, including clinical and genetic variables, no genome-wide association study (GWAS) of clopidogrel response has been performed among Caribbean Hispanics. This study aimed to identify genetic predictors of HTPR in a cohort of Caribbean Hispanic cardiovascular patients from Puerto Rico. Methods: Local Ancestry inference (LAI) and traditional GWASs were performed on a cohort of 511 clopidogrel-treated patients, stratified based on their P2Y12 reaction units (PRU) into responders and non-responders (HTPR). Results: The LAI GWAS identified variants within the CYP2C19 region associated with HTPR, predominantly driven by individuals of European ancestry and absent in those with native ancestry. Incorporating local ancestry adjustment notably enhanced our ability to detect associations. While no loci reached traditional GWAS significance, three variants showed suggestive significance at chromosomes 3, 14 and 22 (OSBPL10 rs1376606, DERL3 rs5030613, and RGS6 rs9323567). In addition, a variant in the UNC5C gene on chromosome 4 was associated with an increased risk of HTPR. These findings were not identified in other cohorts, highlighting the unique genetic landscape of Caribbean Hispanics. Conclusion: This is the first GWAS of clopidogrel response in Hispanics, confirming the relevance of the CYP2C19 cluster, particularly among those with European ancestry, and also identifying novel markers in a diverse patient population. Further studies are warranted to replicate our findings in other diverse cohorts and meta-analyses.

16.
Sci Rep ; 13(1): 18532, 2023 10 28.
Article in English | MEDLINE | ID: mdl-37898691

ABSTRACT

Clostridioides difficile (C. diff.) infection (CDI) is a leading cause of hospital acquired diarrhea in North America and Europe and a major cause of morbidity and mortality. Known risk factors do not fully explain CDI susceptibility, and genetic susceptibility is suggested by the fact that some patients with colons that are colonized with C. diff. do not develop any infection while others develop severe or recurrent infections. To identify common genetic variants associated with CDI, we performed a genome-wide association analysis in 19,861 participants (1349 cases; 18,512 controls) from the Electronic Medical Records and Genomics (eMERGE) Network. Using logistic regression, we found strong evidence for genetic variation in the DRB locus of the MHC (HLA) II region that predisposes individuals to CDI (P > 1.0 × 10-14; OR 1.56). Altered transcriptional regulation in the HLA region may play a role in conferring susceptibility to this opportunistic enteric pathogen.


Subject(s)
Clostridium Infections , Genome-Wide Association Study , Humans , Clostridium Infections/genetics , Diarrhea , Histocompatibility Antigens , HLA Antigens/genetics , Histocompatibility Antigens Class II , Genetic Variation
17.
Am J Hum Genet ; 110(10): 1628-1647, 2023 10 05.
Article in English | MEDLINE | ID: mdl-37757824

ABSTRACT

Pharmacogenomics (PGx) is an integral part of precision medicine and contributes to the maximization of drug efficacy and reduction of adverse drug event risk. Accurate information on PGx allele frequencies improves the implementation of PGx. Nonetheless, curating such information from published allele data is time and resource intensive. The limited number of allelic variants in most studies leads to an underestimation of certain alleles. We applied the Pharmacogenomics Clinical Annotation Tool (PharmCAT) on an integrated 200K UK Biobank genetic dataset (N = 200,044). Based on PharmCAT results, we estimated PGx frequencies (alleles, diplotypes, phenotypes, and activity scores) for 17 pharmacogenes in five biogeographic groups: European, Central/South Asian, East Asian, Afro-Caribbean, and Sub-Saharan African. PGx frequencies were distinct for each biogeographic group. Even biogeographic groups with similar proportions of phenotypes were driven by different sets of dominant PGx alleles. PharmCAT also identified "no-function" alleles that were rare or seldom tested in certain groups by previous studies, e.g., SLCO1B1∗31 in the Afro-Caribbean (3.0%) and Sub-Saharan African (3.9%) groups. Estimated PGx frequencies are disseminated via the PharmGKB (The Pharmacogenomics Knowledgebase: www.pharmgkb.org). We demonstrate that genetic biobanks such as the UK Biobank are a robust resource for estimating PGx frequencies. Improving our understanding of PGx allele and phenotype frequencies provides guidance for future PGx studies and clinical genetic test panel design, and better serves individuals from wider biogeographic backgrounds.


Subject(s)
Biological Specimen Banks , Pharmacogenetics , Humans , Pharmacogenetics/methods , Alleles , Precision Medicine/methods , Gene Frequency/genetics , Liver-Specific Organic Anion Transporter 1
18.
Nat Commun ; 14(1): 5562, 2023 09 09.
Article in English | MEDLINE | ID: mdl-37689782

ABSTRACT

Genes act in concert with each other in specific contexts to perform their functions. Determining how these genes influence complex traits requires a mechanistic understanding of expression regulation across different conditions. It has been shown that this insight is critical for developing new therapies. Transcriptome-wide association studies have helped uncover the role of individual genes in disease-relevant mechanisms. However, modern models of the architecture of complex traits predict that gene-gene interactions play a crucial role in disease origin and progression. Here we introduce PhenoPLIER, a computational approach that maps gene-trait associations and pharmacological perturbation data into a common latent representation for a joint analysis. This representation is based on modules of genes with similar expression patterns across the same conditions. We observe that diseases are significantly associated with gene modules expressed in relevant cell types, and our approach is accurate in predicting known drug-disease pairs and inferring mechanisms of action. Furthermore, using a CRISPR screen to analyze lipid regulation, we find that functionally important players lack associations but are prioritized in trait-associated modules by PhenoPLIER. By incorporating groups of co-expressed genes, PhenoPLIER can contextualize genetic associations and reveal potential targets missed by single-gene strategies.


Subject(s)
Clustered Regularly Interspaced Short Palindromic Repeats , Epistasis, Genetic , Causality , Gene Regulatory Networks , Transcriptome
19.
medRxiv ; 2023 Aug 10.
Article in English | MEDLINE | ID: mdl-37609227

ABSTRACT

Loss of function variants in the NF1 gene cause neurofibromatosis type 1 (NF1), a genetic disorder characterized by complete penetrance, prevalence of 1 in 3,000, characteristic physical exam findings, and a substantially increased risk for malignancy. However, our understanding of the disorder is entirely based on patients ascertained through phenotype-first approaches. Leveraging a genotype-first approach in two large patient cohorts, we demonstrate unexpectedly high prevalence (1 in 450-750) of NF1 pathogenic variants. Half were identified in individuals lacking clinical features of NF1, with many appearing to have post-zygotic mosaicism for the identified variant. Incidentally discovered variants were not associated with classic NF1 features but were associated with an increased incidence of malignancy compared to a control population. Our findings suggest that NF1 pathogenic variants are substantially more common than previously thought, often characterized by somatic mosaicism and reduced penetrance, and are important contributors to cancer risk in the general population.

20.
medRxiv ; 2023 Oct 02.
Article in English | MEDLINE | ID: mdl-37503172

ABSTRACT

Heart failure (HF) is a complex trait, influenced by environmental and genetic factors, that affects over 30 million individuals worldwide. Historically, the genetics of HF have been studied in Mendelian forms of disease, where rare genetic variants have been linked to familial cardiomyopathies. More recently, genome-wide association studies (GWAS) have successfully identified common genetic variants associated with risk of HF. However, the relative importance of genetic variants across the allele-frequency spectrum remains incompletely characterized. Here, we report the results of common- and rare-variant association studies of all-cause heart failure, applying recently developed methods to quantify the heritability of HF attributable to different classes of genetic variation. We combine GWAS data across multiple populations including 207,346 individuals with HF and 2,151,210 without, identifying 176 risk loci at genome-wide significance (p < 5×10-8). Signals at newly identified common-variant loci include coding variants in Mendelian cardiomyopathy genes (MYBPC3, BAG3), as well as regulators of lipoprotein (LPL) and glucose metabolism (GIPR, GLP1R), and are enriched in cardiac, muscle, nerve, and vascular tissues, as well as myocyte and adipocyte cell types. Gene burden studies across three biobanks (PMBB, UKB, AOU) including 27,208 individuals with HF and 349,126 without uncover exome-wide significant (p < 3.15×10-6) associations for HF and rare predicted loss-of-function (pLoF) variants in TTN, MYBPC3, FLNC, and BAG3. Total burden heritability of rare coding variants (2.2%, 95% CI 0.99-3.5%) is highly concentrated in a small set of Mendelian cardiomyopathy genes, and is lower than heritability attributable to common variants (4.3%, 95% CI 3.9-4.7%) which is more diffusely spread throughout the genome. Finally, we demonstrate that common-variant background, in the form of a polygenic risk score (PRS), significantly modifies the risk of HF among carriers of pathogenic truncating variants in the Mendelian cardiomyopathy gene TTN. These findings suggest a significant polygenic component to HF exists that is not captured by current clinical genetic testing.

SELECTION OF CITATIONS
SEARCH DETAIL
...