Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 195
Filter
1.
Diabetes Care ; 47(6): 1042-1047, 2024 Jun 01.
Article in English | MEDLINE | ID: mdl-38652672

ABSTRACT

OBJECTIVE: To identify genetic risk factors for incident cardiovascular disease (CVD) among people with type 2 diabetes (T2D). RESEARCH DESIGN AND METHODS: We conducted a multiancestry time-to-event genome-wide association study for incident CVD among people with T2D. We also tested 204 known coronary artery disease (CAD) variants for association with incident CVD. RESULTS: Among 49,230 participants with T2D, 8,956 had incident CVD events (event rate 18.2%). We identified three novel genetic loci for incident CVD: rs147138607 (near CACNA1E/ZNF648, hazard ratio [HR] 1.23, P = 3.6 × 10-9), rs77142250 (near HS3ST1, HR 1.89, P = 9.9 × 10-9), and rs335407 (near TFB1M/NOX3, HR 1.25, P = 1.5 × 10-8). Among 204 known CAD loci, 5 were associated with incident CVD in T2D (multiple comparison-adjusted P < 0.00024, 0.05/204). A standardized polygenic score of these 204 variants was associated with incident CVD with HR 1.14 (P = 1.0 × 10-16). CONCLUSIONS: The data point to novel and known genomic regions associated with incident CVD among individuals with T2D.


Subject(s)
Cardiovascular Diseases , Diabetes Mellitus, Type 2 , Genome-Wide Association Study , Humans , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/epidemiology , Diabetes Mellitus, Type 2/complications , Cardiovascular Diseases/genetics , Cardiovascular Diseases/epidemiology , Female , Male , Middle Aged , Aged , Polymorphism, Single Nucleotide
2.
Diabetes ; 73(6): 993-1001, 2024 Jun 01.
Article in English | MEDLINE | ID: mdl-38470993

ABSTRACT

African Americans (AAs) have been underrepresented in polygenic risk score (PRS) studies. Here, we integrated genome-wide data from multiple observational studies on type 2 diabetes (T2D), encompassing a total of 101,987 AAs, to train and optimize an AA-focused T2D PRS (PRSAA), using a Bayesian polygenic modeling method. We further tested the score in three independent studies with a total of 7,275 AAs and compared the PRSAA with other published scores. Results show that a 1-SD increase in the PRSAA was associated with 40-60% increase in the odds of T2D (odds ratio [OR] 1.60, 95% CI 1.37-1.88; OR 1.40, 95% CI 1.16-1.70; and OR 1.45, 95% CI 1.30-1.62) across three testing cohorts. These models captured 1.0-2.6% of the variance (R2) in T2D on the liability scale. The positive predictive values for three calculated score thresholds (the top 2%, 5%, and 10%) ranged from 14 to 35%. The PRSAA, in general, performed similarly to existing T2D PRS. The need remains for larger data sets to continue to evaluate the utility of within-ancestry scores in the AA population.


Subject(s)
Black or African American , Diabetes Mellitus, Type 2 , Genetic Predisposition to Disease , Genome-Wide Association Study , Multifactorial Inheritance , Humans , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/epidemiology , Black or African American/genetics , Multifactorial Inheritance/genetics , Male , Female , Middle Aged , Bayes Theorem , Risk Factors , Polymorphism, Single Nucleotide , Adult , Aged
3.
Nat Metab ; 6(4): 659-669, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38499766

ABSTRACT

Metformin is a widely prescribed anti-diabetic medicine that also reduces body weight. There is ongoing debate about the mechanisms that mediate metformin's effects on energy balance. Here, we show that metformin is a powerful pharmacological inducer of the anorexigenic metabolite N-lactoyl-phenylalanine (Lac-Phe) in cells, in mice and two independent human cohorts. Metformin drives Lac-Phe biosynthesis through the inhibition of complex I, increased glycolytic flux and intracellular lactate mass action. Intestinal epithelial CNDP2+ cells, not macrophages, are the principal in vivo source of basal and metformin-inducible Lac-Phe. Genetic ablation of Lac-Phe biosynthesis in male mice renders animals resistant to the effects of metformin on food intake and body weight. Lastly, mediation analyses support a role for Lac-Phe as a downstream effector of metformin's effects on body mass index in participants of a large population-based observational cohort, the Multi-Ethnic Study of Atherosclerosis. Together, these data establish Lac-Phe as a critical mediator of the body weight-lowering effects of metformin.


Subject(s)
Body Weight , Eating , Metformin , Metformin/pharmacology , Animals , Humans , Body Weight/drug effects , Mice , Eating/drug effects , Male , Hypoglycemic Agents/pharmacology , Hypoglycemic Agents/therapeutic use , Phenylalanine/pharmacology , Phenylalanine/metabolism , Dipeptides/pharmacology
4.
medRxiv ; 2024 Feb 28.
Article in English | MEDLINE | ID: mdl-38464285

ABSTRACT

Background: Studies have identified individual blood biomarkers associated with chronic obstructive pulmonary disease (COPD) and related phenotypes. However, complex diseases such as COPD typically involve changes in multiple molecules with interconnections that may not be captured when considering single molecular features. Methods: Leveraging proteomic data from 3,173 COPDGene Non-Hispanic White (NHW) and African American (AA) participants, we applied sparse multiple canonical correlation network analysis (SmCCNet) to 4,776 proteins assayed on the SomaScan v4.0 platform to derive sparse networks of proteins associated with current vs. former smoking status, airflow obstruction, and emphysema quantitated from high-resolution computed tomography scans. We then used NetSHy, a dimension reduction technique leveraging network topology, to produce summary scores of each proteomic network, referred to as NetSHy scores. We next performed genome-wide association study (GWAS) to identify variants associated with the NetSHy scores, or network quantitative trait loci (nQTLs). Finally, we evaluated the replicability of the networks in an independent cohort, SPIROMICS. Results: We identified networks of 13 to 104 proteins for each phenotype and exposure in NHW and AA, and the derived NetSHy scores significantly associated with the variable of interests. Networks included known (sRAGE, ALPP, MIP1) and novel molecules (CA10, CPB1, HIS3, PXDN) and interactions involved in COPD pathogenesis. We observed 7 nQTL loci associated with NetSHy scores, 4 of which remained after conditional analysis. Networks for smoking status and emphysema, but not airflow obstruction, demonstrated a high degree of replicability across race groups and cohorts. Conclusions: In this work, we apply state-of-the-art molecular network generation and summarization approaches to proteomic data from COPDGene participants to uncover protein networks associated with COPD phenotypes. We further identify genetic associations with networks. This work discovers protein networks containing known and novel proteins and protein interactions associated with clinically relevant COPD phenotypes across race groups and cohorts.

5.
bioRxiv ; 2023 Nov 04.
Article in English | MEDLINE | ID: mdl-37961394

ABSTRACT

Metformin is a widely prescribed anti-diabetic medicine that also reduces body weight. The mechanisms that mediate metformin's effects on energy balance remain incompletely defined. Here we show that metformin is a powerful pharmacological inducer of the anorexigenic metabolite Lac-Phe in mice as well as in two independent human cohorts. In cell culture, metformin drives Lac-Phe biosynthesis via inhibition of complex I, increased glycolytic flux, and intracellular lactate mass action. Other biguanides and structurally distinct inhibitors of oxidative phosphorylation also increase Lac-Phe levels in vitro. Genetic ablation of CNDP2, the principal biosynthetic enzyme for Lac-Phe, in mice renders animals resistant to metformin's anorexigenic and anti-obesity effects. Mediation analyses also support a role for Lac-Phe in metformin's effect on body mass index in humans. These data establish the CNDP2/Lac-Phe pathway as a critical mediator of the effects of metformin on energy balance.

6.
Circ Genom Precis Med ; 16(6): e004176, 2023 Dec.
Article in English | MEDLINE | ID: mdl-38014529

ABSTRACT

BACKGROUND: Individuals with type 2 diabetes (T2D) have an increased risk of coronary artery disease (CAD), but questions remain about the underlying pathology. Identifying which CAD loci are modified by T2D in the development of subclinical atherosclerosis (coronary artery calcification [CAC], carotid intima-media thickness, or carotid plaque) may improve our understanding of the mechanisms leading to the increased CAD in T2D. METHODS: We compared the common and rare variant associations of known CAD loci from the literature on CAC, carotid intima-media thickness, and carotid plaque in up to 29 670 participants, including up to 24 157 normoglycemic controls and 5513 T2D cases leveraging whole-genome sequencing data from the Trans-Omics for Precision Medicine program. We included first-order T2D interaction terms in each model to determine whether CAD loci were modified by T2D. The genetic main and interaction effects were assessed using a joint test to determine whether a CAD variant, or gene-based rare variant set, was associated with the respective subclinical atherosclerosis measures and then further determined whether these loci had a significant interaction test. RESULTS: Using a Bonferroni-corrected significance threshold of P<1.6×10-4, we identified 3 genes (ATP1B1, ARVCF, and LIPG) associated with CAC and 2 genes (ABCG8 and EIF2B2) associated with carotid intima-media thickness and carotid plaque, respectively, through gene-based rare variant set analysis. Both ATP1B1 and ARVCF also had significantly different associations for CAC in T2D cases versus controls. No significant interaction tests were identified through the candidate single-variant analysis. CONCLUSIONS: These results highlight T2D as an important modifier of rare variant associations in CAD loci with CAC.


Subject(s)
Atherosclerosis , Coronary Artery Disease , Diabetes Mellitus, Type 2 , Plaque, Atherosclerotic , Humans , Coronary Artery Disease/genetics , Diabetes Mellitus, Type 2/complications , Diabetes Mellitus, Type 2/genetics , Carotid Intima-Media Thickness , Risk Factors , Atherosclerosis/genetics , Genomics
7.
Nat Commun ; 14(1): 7836, 2023 Nov 30.
Article in English | MEDLINE | ID: mdl-38036523

ABSTRACT

African Americans have a significantly higher risk of developing chronic kidney disease, especially focal segmental glomerulosclerosis -, than European Americans. Two coding variants (G1 and G2) in the APOL1 gene play a major role in this disparity. While 13% of African Americans carry the high-risk recessive genotypes, only a fraction of these individuals develops FSGS or kidney failure, indicating the involvement of additional disease modifiers. Here, we show that the presence of the APOL1 p.N264K missense variant, when co-inherited with the G2 APOL1 risk allele, substantially reduces the penetrance of the G1G2 and G2G2 high-risk genotypes by rendering these genotypes low-risk. These results align with prior functional evidence showing that the p.N264K variant reduces the toxicity of the APOL1 high-risk alleles. These findings have important implications for our understanding of the mechanisms of APOL1-associated nephropathy, as well as for the clinical management of individuals with high-risk genotypes that include the G2 allele.


Subject(s)
Glomerulosclerosis, Focal Segmental , Humans , Glomerulosclerosis, Focal Segmental/genetics , Apolipoprotein L1/genetics , Genetic Predisposition to Disease , Risk Factors , Genotype , Apolipoproteins/genetics
8.
Diabetes Care ; 46(11): 1978-1985, 2023 11 01.
Article in English | MEDLINE | ID: mdl-37756531

ABSTRACT

OBJECTIVE: Clonal hematopoiesis of indeterminate potential (CHIP) is an aging-related accumulation of somatic mutations in hematopoietic stem cells, leading to clonal expansion. CHIP presence has been implicated in atherosclerotic coronary heart disease (CHD) and all-cause mortality, but its association with incident type 2 diabetes (T2D) is unknown. We hypothesized that CHIP is associated with elevated risk of T2D. RESEARCH DESIGN AND METHODS: CHIP was derived from whole-genome sequencing of blood DNA in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) prospective cohorts. We performed analysis for 17,637 participants from six cohorts, without prior T2D, cardiovascular disease, or cancer. We evaluated baseline CHIP versus no CHIP prevalence with incident T2D, including associations with DNMT3A, TET2, ASXL1, JAK2, and TP53 variants. We estimated multivariable-adjusted hazard ratios (HRs) and 95% CIs with adjustment for age, sex, BMI, smoking, alcohol, education, self-reported race/ethnicity, and combined cohorts' estimates via fixed-effects meta-analysis. RESULTS: Mean (SD) age was 63.4 (11.5) years, 76% were female, and CHIP prevalence was 6.0% (n = 1,055) at baseline. T2D was diagnosed in n = 2,467 over mean follow-up of 9.8 years. Participants with CHIP had 23% (CI 1.04, 1.45) higher risk of T2D than those with no CHIP. Specifically, higher risk was for TET2 (HR 1.48; CI 1.05, 2.08) and ASXL1 (HR 1.76; CI 1.03, 2.99) mutations; DNMT3A was nonsignificant (HR 1.15; CI 0.93, 1.43). Statistical power was limited for JAK2 and TP53 analyses. CONCLUSIONS: CHIP was associated with higher incidence of T2D. CHIP mutations located on genes implicated in CHD and mortality were also related to T2D, suggesting shared aging-related pathology.


Subject(s)
Coronary Disease , Diabetes Mellitus, Type 2 , Humans , Female , Middle Aged , Male , Clonal Hematopoiesis/genetics , Diabetes Mellitus, Type 2/epidemiology , Diabetes Mellitus, Type 2/genetics , Prospective Studies , Hematopoiesis/genetics , Clonal Evolution , Coronary Disease/epidemiology , Coronary Disease/genetics , Mutation
9.
Nutrients ; 15(16)2023 Aug 16.
Article in English | MEDLINE | ID: mdl-37630778

ABSTRACT

Obesity has been linked to the gut microbiome, epigenome, and diet, yet these factors have not been studied together during obesity treatment. Our objective was to evaluate associations among gut microbiota (MB), DNA methylation (DNAme), and diet prior to and during a behavioral weight loss intervention. Adults (n = 47, age 40.9 ± 9.7 years, body mass index (BMI) 33.5 ± 4.5 kg/m2, 77% female) with data collected at baseline (BL) and 3 months (3 m) were included. Fecal MB was assessed via 16S sequencing and whole blood DNAme via the Infinium EPIC array. Food group and nutrient intakes and Healthy Eating Index (HEI) scores were calculated from 7-day diet records. Linear models were used to test for the effect of taxa relative abundance on DNAme and diet cross-sectionally at each time point, adjusting for confounders and a false discovery rate of 5%. Mean weight loss was 6.2 ± 3.9% at 3 m. At BL, one MB taxon, Ruminiclostridium, was associated with DNAme of the genes COL20A1 (r = 0.651, p = 0.029), COL18A1 (r = 0.578, p = 0.044), and NT5E (r = 0.365, p = 0.043). At 3 m, there were 14 unique MB:DNAme associations, such as Akkermansia with DNAme of GUSB (r = -0.585, p = 0.003), CRYL1 (r = -0.419, p = 0.007), C9 (r = -0.439, p = 0.019), and GMDS (r = -0.559, p = 0.046). Among taxa associated with DNAme, no significant relationships were seen with dietary intakes of relevant nutrients, food groups, or HEI scores. Our findings indicate that microbes linked to mucin degradation, short-chain fatty acid production, and body weight are associated with DNAme of phenotypically relevant genes. These relationships offer an initial understanding of the possible routes by which alterations in gut MB may influence metabolism during weight loss.


Subject(s)
Gastrointestinal Microbiome , Microbiota , Adult , Humans , Female , Middle Aged , Male , Epigenome , Diet , Obesity
10.
medRxiv ; 2023 Jul 28.
Article in English | MEDLINE | ID: mdl-37546893

ABSTRACT

BACKGROUND: Type 2 diabetes mellitus (T2D) confers a two- to three-fold increased risk of cardiovascular disease (CVD). However, the mechanisms underlying increased CVD risk among people with T2D are only partially understood. We hypothesized that a genetic association study among people with T2D at risk for developing incident cardiovascular complications could provide insights into molecular genetic aspects underlying CVD. METHODS: From 16 studies of the Cohorts for Heart & Aging Research in Genomic Epidemiology (CHARGE) Consortium, we conducted a multi-ancestry time-to-event genome-wide association study (GWAS) for incident CVD among people with T2D using Cox proportional hazards models. Incident CVD was defined based on a composite of coronary artery disease (CAD), stroke, and cardiovascular death that occurred at least one year after the diagnosis of T2D. Cohort-level estimated effect sizes were combined using inverse variance weighted fixed effects meta-analysis. We also tested 204 known CAD variants for association with incident CVD among patients with T2D. RESULTS: A total of 49,230 participants with T2D were included in the analyses (31,118 European ancestries and 18,112 non-European ancestries) which consisted of 8,956 incident CVD cases over a range of mean follow-up duration between 3.2 and 33.7 years (event rate 18.2%). We identified three novel, distinct genetic loci for incident CVD among individuals with T2D that reached the threshold for genome-wide significance (P<5.0×10-8): rs147138607 (intergenic variant between CACNA1E and ZNF648) with a hazard ratio (HR) 1.23, 95% confidence interval (CI) 1.15 - 1.32, P=3.6×10-9, rs11444867 (intergenic variant near HS3ST1) with HR 1.89, 95% CI 1.52 - 2.35, P=9.9×10-9, and rs335407 (intergenic variant between TFB1M and NOX3) HR 1.25, 95% CI 1.16 - 1.35, P=1.5×10-8. Among 204 known CAD loci, 32 were associated with incident CVD in people with T2D with P<0.05, and 5 were significant after Bonferroni correction (P<0.00024, 0.05/204). A polygenic score of these 204 variants was significantly associated with incident CVD with HR 1.14 (95% CI 1.12 - 1.16) per 1 standard deviation increase (P=1.0×10-16). CONCLUSIONS: The data point to novel and known genomic regions associated with incident CVD among individuals with T2D.

11.
medRxiv ; 2023 Aug 04.
Article in English | MEDLINE | ID: mdl-37577628

ABSTRACT

Black Americans have a significantly higher risk of developing chronic kidney disease (CKD), especially focal segmental glomerulosclerosis (FSGS), than European Americans. Two coding variants (G1 and G2) in the APOL1 gene play a major role in this disparity. While 13% of Black Americans carry the high-risk recessive genotypes, only a fraction of these individuals develops FSGS or kidney failure, indicating the involvement of additional disease modifiers. Here, we show that the presence of the APOL1 p.N264K missense variant, when co-inherited with the G2 APOL1 risk allele, substantially reduces the penetrance of the G1G2 and G2G2 high-risk genotypes by rendering these genotypes low-risk. These results align with prior functional evidence showing that the p.N264K variant reduces the toxicity of the APOL1 high-risk alleles. These findings have important implications for our understanding of the mechanisms of APOL1 -associated nephropathy, as well as for the clinical management of individuals with high-risk genotypes that include the G2 allele.

12.
J Clin Invest ; 133(18)2023 09 15.
Article in English | MEDLINE | ID: mdl-37471144

ABSTRACT

Protease-activated receptor 4 (PAR4) (gene F2RL3) harbors a functional dimorphism, rs773902 A/G (encoding Thr120/Ala120, respectively) and is associated with greater platelet aggregation. The A allele frequency is more common in Black individuals, and Black individuals have a higher incidence of ischemic stroke than White individuals. However, it is not known whether the A allele is responsible for worse stroke outcomes. To directly test the in vivo effect of this variant on stroke, we generated mice in which F2rl3 was replaced by F2RL3, thereby expressing human PAR4 (hPAR4) with either Thr120 or Ala120. Compared with hPAR4 Ala120 mice, hPAR4 Thr120 mice had worse stroke outcomes, mediated in part by enhanced platelet activation and platelet-neutrophil interactions. Analyses of 7,620 Black subjects with 487 incident ischemic strokes demonstrated the AA genotype was a risk for incident ischemic stroke and worse functional outcomes. In humanized mice, ticagrelor with or without aspirin improved stroke outcomes in hPAR4 Ala120 mice, but not in hPAR4 Thr120 mice. P selectin blockade improved stroke outcomes and reduced platelet-neutrophil interactions in hPAR4 Thr120 mice. Our results may explain some of the racial disparity in stroke and support the need for studies of nonstandard antiplatelet therapies for patients expressing PAR4 Thr120.


Subject(s)
Ischemic Stroke , Stroke , Humans , Animals , Mice , Receptors, Thrombin/genetics , Platelet Aggregation/genetics , Blood Platelets/physiology , Platelet Aggregation Inhibitors/pharmacology , Stroke/genetics , Receptor, PAR-1
13.
Alzheimers Dement ; 19(10): 4367-4376, 2023 10.
Article in English | MEDLINE | ID: mdl-37417779

ABSTRACT

INTRODUCTION: Diabetes and dementia are diseases of high health-care burden worldwide. Individuals with diabetes have 1.4 to 2.2 times higher risk of dementia. Our objective was to evaluate evidence of causality between these two common diseases. METHODS: We conducted a one-sample Mendelian randomization (MR) analysis in the US Department of Veterans Affairs Million Veteran program. The study included 334,672 participants ≥65 years of age with type 2 diabetes and dementia case-control status and genotype data. RESULTS: For each standard deviation increase in genetically predicted diabetes, we found increased odds of three dementia diagnoses in non-Hispanic White participants (all-cause: odds ratio [OR] = 1.07 [1.05-1.08], P = 3.40E-18; vascular: OR = 1.11 [1.07-1.15], P = 3.63E-09, Alzheimer's disease [AD]: OR = 1.06 [1.02-1.09], P = 6.84E-04) and non-Hispanic Black participants (all-cause: OR = 1.06 [1.02-1.10], P = 3.66E-03, vascular: OR = 1.11 [1.04-1.19], P = 2.20E-03, AD: OR = 1.12 [1.02-1.23], P = 1.60E-02) but not in Hispanic participants (all P > 0.05). DISCUSSION: We found evidence of causality between diabetes and dementia using a one-sample MR study, with access to individual level data, overcoming limitations of prior studies using two-sample MR techniques.


Subject(s)
Alzheimer Disease , Diabetes Mellitus, Type 2 , Veterans , Humans , Alzheimer Disease/epidemiology , Alzheimer Disease/genetics , Diabetes Mellitus, Type 2/epidemiology , Diabetes Mellitus, Type 2/genetics , Genome-Wide Association Study , Mendelian Randomization Analysis , Polymorphism, Single Nucleotide , Risk Factors , Aged
14.
J Peripher Nerv Syst ; 28(3): 460-470, 2023 09.
Article in English | MEDLINE | ID: mdl-37341347

ABSTRACT

BACKGROUND AND AIMS: The lack of easily measurable biomarkers remains a challenge in executing clinical trials for diabetic neuropathy (DN). Plasma Neurofilament light chain (NFL) concentration is a promising biomarker in immune-mediated neuropathies. Longitudinal studies evaluating NFL in DN have not been performed. METHODS: A nested case-control study was performed on participants with youth-onset type 2 diabetes enrolled in the prospective Treatment Options for Type 2 Diabetes in Adolescents and Youth (TODAY) study. Plasma NFL concentrations were measured at 4-year intervals from 2008 to 2020 in 50 participants who developed DN and 50 participants with type 2 diabetes who did not develop DN. RESULTS: NFL concentrations were similar in the DN and no DN groups at the first assessment. Concentrations were higher in DN participants at all subsequent assessment periods (all p < .01). NFL concentrations increased over time in both groups, with higher degrees of change in DN participants (interaction p = .045). A doubling of the NFL value at Assessment 2 in those without DN increased the odds of ultimate DN outcome by an estimated ratio of 2.86 (95% CI: [1.30, 6.33], p = .0046). At the final study visit, positive Spearman correlations (controlled for age, sex, diabetes duration, and BMI) were observed between NFL and HbA1c (0.48, p < .0001), total cholesterol (0.25, p = .018), and low-density lipoprotein (LDL (0.30, p = .0037)). Negative correlations were observed with measures of heart rate variability (-0.42 to -0.46, p = <.0001). INTERPRETATION: The findings that NFL concentrations are elevated in individuals with youth-onset type 2 diabetes, and increase more rapidly in those who develop DN, suggest that NFL could be a valuable biomarker for DN.


Subject(s)
Diabetes Mellitus, Type 2 , Diabetic Neuropathies , Humans , Adolescent , Case-Control Studies , Intermediate Filaments , Neurofilament Proteins , Biomarkers
15.
Sci Rep ; 13(1): 9254, 2023 06 07.
Article in English | MEDLINE | ID: mdl-37286633

ABSTRACT

Privacy protection is a core principle of genomic but not proteomic research. We identified independent single nucleotide polymorphism (SNP) quantitative trait loci (pQTL) from COPDGene and Jackson Heart Study (JHS), calculated continuous protein level genotype probabilities, and then applied a naïve Bayesian approach to link SomaScan 1.3K proteomes to genomes for 2812 independent subjects from COPDGene, JHS, SubPopulations and InteRmediate Outcome Measures In COPD Study (SPIROMICS) and Multi-Ethnic Study of Atherosclerosis (MESA). We correctly linked 90-95% of proteomes to their correct genome and for 95-99% we identify the 1% most likely links. The linking accuracy in subjects with African ancestry was lower (~ 60%) unless training included diverse subjects. With larger profiling (SomaScan 5K) in the Atherosclerosis Risk Communities (ARIC) correct identification was > 99% even in mixed ancestry populations. We also linked proteomes-to-proteomes and used the proteome only to determine features such as sex, ancestry, and first-degree relatives. When serial proteomes are available, the linking algorithm can be used to identify and correct mislabeled samples. This work also demonstrates the importance of including diverse populations in omics research and that large proteomic datasets (> 1000 proteins) can be accurately linked to a specific genome through pQTL knowledge and should not be considered unidentifiable.


Subject(s)
Atherosclerosis , Proteome , Humans , Proteome/genetics , Bayes Theorem , Privacy , Genome-Wide Association Study , Atherosclerosis/genetics , Polymorphism, Single Nucleotide
16.
PLoS Genet ; 19(5): e1010517, 2023 05.
Article in English | MEDLINE | ID: mdl-37216410

ABSTRACT

Integrative approaches that simultaneously model multi-omics data have gained increasing popularity because they provide holistic system biology views of multiple or all components in a biological system of interest. Canonical correlation analysis (CCA) is a correlation-based integrative method designed to extract latent features shared between multiple assays by finding the linear combinations of features-referred to as canonical variables (CVs)-within each assay that achieve maximal across-assay correlation. Although widely acknowledged as a powerful approach for multi-omics data, CCA has not been systematically applied to multi-omics data in large cohort studies, which has only recently become available. Here, we adapted sparse multiple CCA (SMCCA), a widely-used derivative of CCA, to proteomics and methylomics data from the Multi-Ethnic Study of Atherosclerosis (MESA) and Jackson Heart Study (JHS). To tackle challenges encountered when applying SMCCA to MESA and JHS, our adaptations include the incorporation of the Gram-Schmidt (GS) algorithm with SMCCA to improve orthogonality among CVs, and the development of Sparse Supervised Multiple CCA (SSMCCA) to allow supervised integration analysis for more than two assays. Effective application of SMCCA to the two real datasets reveals important findings. Applying our SMCCA-GS to MESA and JHS, we identified strong associations between blood cell counts and protein abundance, suggesting that adjustment of blood cell composition should be considered in protein-based association studies. Importantly, CVs obtained from two independent cohorts also demonstrate transferability across the cohorts. For example, proteomic CVs learned from JHS, when transferred to MESA, explain similar amounts of blood cell count phenotypic variance in MESA, explaining 39.0% ~ 50.0% variation in JHS and 38.9% ~ 49.1% in MESA. Similar transferability was observed for other omics-CV-trait pairs. This suggests that biologically meaningful and cohort-agnostic variation is captured by CVs. We anticipate that applying our SMCCA-GS and SSMCCA on various cohorts would help identify cohort-agnostic biologically meaningful relationships between multi-omics data and phenotypic traits.


Subject(s)
Canonical Correlation Analysis , Proteomics , Humans , Proteomics/methods , Multiomics , Cohort Studies
17.
medRxiv ; 2023 Mar 09.
Article in English | MEDLINE | ID: mdl-36945581

ABSTRACT

INTRODUCTION: Diabetes and dementia are diseases of high healthcare burden worldwide. Individuals with diabetes have 1.4 to 2.2 times higher risk of dementia. Our objective was to evaluate evidence of causality between these two common diseases. METHODS: We conducted a one-sample Mendelian randomization (MR) analysis in the U.S. Department of Veterans Affairs Million Veteran program. The study included 334,672 participants ≥65 years of age with type 2 diabetes and dementia case-control status and genotype data. RESULTS: For each standard deviation increase in genetically-predicted diabetes, we found increased odds of three dementia diagnoses in non-Hispanic White participants (all-cause: OR=1.07[1.05-1.08], P =3.40E-18; vascular: OR=1.11[1.07-1.15], P =3.63E-09, Alzheimer's: OR=1.06[1.02-1.09], P =6.84E-04) and non-Hispanic Black participants (all-cause: OR=1.06[1.02-1.10], P =3.66E-03, vascular: OR=1.11[1.04-1.19], P =2.20E-03, Alzheimer's: OR=1.12 [1.02-1.23], P =1.60E-02) but not in Hispanic participants (all P >.05). DISCUSSION: We found evidence of causality between diabetes and dementia using a one-sample MR study, with access to individual level data, overcoming limitations of prior studies utilizing two-sample MR techniques.

18.
Nat Genet ; 55(1): 154-164, 2023 01.
Article in English | MEDLINE | ID: mdl-36564505

ABSTRACT

Meta-analysis of whole genome sequencing/whole exome sequencing (WGS/WES) studies provides an attractive solution to the problem of collecting large sample sizes for discovering rare variants associated with complex phenotypes. Existing rare variant meta-analysis approaches are not scalable to biobank-scale WGS data. Here we present MetaSTAAR, a powerful and resource-efficient rare variant meta-analysis framework for large-scale WGS/WES studies. MetaSTAAR accounts for relatedness and population structure, can analyze both quantitative and dichotomous traits and boosts the power of rare variant tests by incorporating multiple variant functional annotations. Through meta-analysis of four lipid traits in 30,138 ancestrally diverse samples from 14 studies of the Trans Omics for Precision Medicine (TOPMed) Program, we show that MetaSTAAR performs rare variant meta-analysis at scale and produces results comparable to using pooled data. Additionally, we identified several conditionally significant rare variant associations with lipid traits. We further demonstrate that MetaSTAAR is scalable to biobank-scale cohorts through meta-analysis of TOPMed WGS data and UK Biobank WES data of ~200,000 samples.


Subject(s)
Genome-Wide Association Study , Lipids , Genome-Wide Association Study/methods , Whole Genome Sequencing/methods , Exome Sequencing , Phenotype , Lipids/genetics
19.
Int J Obes (Lond) ; 47(2): 109-116, 2023 02.
Article in English | MEDLINE | ID: mdl-36463326

ABSTRACT

BACKGROUND/OBJECTIVES: Obesity, defined as excessive fat accumulation that represents a health risk, is increasing in adults and children, reaching global epidemic proportions. Body mass index (BMI) correlates with body fat and future health risk, yet differs in prediction by fat distribution, across populations and by age. Nonetheless, few genetic studies of BMI have been conducted in ancestrally diverse populations. Gene expression association with BMI was assessed in the Multi-Ethnic Study of Atherosclerosis (MESA) in four self-identified race and ethnicity (SIRE) groups to identify genes associated with obesity. SUBJECTS/METHODS: RNA-sequencing was performed on 1096 MESA participants (37.8% white, 24.3% Hispanic, 28.4% African American, and 9.5% Chinese American) and linear models were used to assess the association of expression from each gene for its effect on BMI, adjusting for age, sex, sequencing center, study site, five expression and four genetic principal components in each self-identified race group. Sample-size-weighted meta-analysis was performed to identify genes with BMI-associated expression across ancestry groups. RESULTS: Within individual SIRE groups, there were zero to three genes whose expression is significantly (p < 1.97 × 10-6) associated with BMI. Across all groups, 45 genes were identified by meta-analysis whose expression was significantly associated with BMI, explaining 29.7% of BMI variation. The 45 genes are expressed in a variety of tissues and cell types and are enriched for obesity-related processes including erythrocyte function, oxygen binding and transport, and JAK-STAT signaling. CONCLUSIONS: We have identified genes whose expression is significantly associated with obesity in a multi-ethnic cohort. We have identified novel genes associated with BMI as well as confirmed previously identified genes from earlier genetic analyses. These novel genes and their biological pathways represent new targets for understanding the biology of obesity as well as new therapeutic intervention to reduce obesity and improve global public health.


Subject(s)
Body Mass Index , Gene Expression , Obesity , Adult , Child , Humans , Atherosclerosis , Obesity/epidemiology , Obesity/genetics
20.
Hum Mol Genet ; 32(6): 1048-1060, 2023 03 06.
Article in English | MEDLINE | ID: mdl-36444934

ABSTRACT

Diabetic kidney disease (DKD) is recognized as an important public health challenge. However, its genomic mechanisms are poorly understood. To identify rare variants for DKD, we conducted a whole-exome sequencing (WES) study leveraging large cohorts well-phenotyped for chronic kidney disease and diabetes. Our two-stage WES study included 4372 European and African ancestry participants from the Chronic Renal Insufficiency Cohort and Atherosclerosis Risk in Communities studies (stage 1) and 11 487 multi-ancestry Trans-Omics for Precision Medicine participants (stage 2). Generalized linear mixed models, which accounted for genetic relatedness and adjusted for age, sex and ancestry, were used to test associations between single variants and DKD. Gene-based aggregate rare variant analyses were conducted using an optimized sequence kernel association test implemented within our mixed model framework. We identified four novel exome-wide significant DKD-related loci through initiating diabetes. In single-variant analyses, participants carrying a rare, in-frame insertion in the DIS3L2 gene (rs141560952) exhibited a 193-fold increased odds [95% confidence interval (CI): 33.6, 1105] of DKD compared with noncarriers (P = 3.59 × 10-9). Likewise, each copy of a low-frequency KRT6B splice-site variant (rs425827) conferred a 5.31-fold higher odds (95% CI: 3.06, 9.21) of DKD (P = 2.72 × 10-9). Aggregate gene-based analyses further identified ERAP2 (P = 4.03 × 10-8) and NPEPPS (P = 1.51 × 10-7), which are both expressed in the kidney and implicated in renin-angiotensin-aldosterone system modulated immune response. In the largest WES study of DKD, we identified novel rare variant loci attaining exome-wide significance. These findings provide new insights into the molecular mechanisms underlying DKD.


Subject(s)
Diabetes Mellitus , Diabetic Nephropathies , Renal Insufficiency, Chronic , Humans , Aminopeptidases , Diabetic Nephropathies/genetics , Exome Sequencing , Kidney , Renal Insufficiency, Chronic/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...