Search | VHL Regional Portal

1.

Multi-ancestry polygenic risk scores for venous thromboembolism.

Jee, Yon Ho; Thibord, Florian; Dominguez, Alicia; Sept, Corriene; Boulier, Kristin; Venkateswaran, Vidhya; Ding, Yi; Cherlin, Tess; Verma, Shefali Setia; Faro, Valeria Lo; Bartz, Traci M; Boland, Anne; Brody, Jennifer A; Deleuze, Jean-Francois; Emmerich, Joseph; Germain, Marine; Johnson, Andrew D; Kooperberg, Charles; Morange, Pierre-Emmanuel; Pankratz, Nathan; Psaty, Bruce M; Reiner, Alexander P; Smadja, David M; Sitlani, Colleen M; Suchon, Pierre; Tang, Weihong; Trégouët, David-Alexandre; Zöllner, Sebastian; Pasaniuc, Bogdan; Damrauer, Scott M; Sanna, Serena; Snieder, Harold; Kabrhel, Christopher; Smith, Nicholas L; Kraft, Peter.

Hum Mol Genet ; 2024 Jun 16.

Article in English | MEDLINE | ID: mdl-38879759

ABSTRACT

Venous thromboembolism (VTE) is a significant contributor to morbidity and mortality, with large disparities in incidence rates between Black and White Americans. Polygenic risk scores (PRSs) limited to variants discovered in genome-wide association studies in European-ancestry samples can identify European-ancestry individuals at high risk of VTE. However, there is limited evidence on whether high-dimensional PRS constructed using more sophisticated methods and more diverse training data can enhance the predictive ability and their utility across diverse populations. We developed PRSs for VTE using summary statistics from the International Network against Venous Thrombosis (INVENT) consortium genome-wide association studies meta-analyses of European- (71 771 cases and 1 059 740 controls) and African-ancestry samples (7482 cases and 129 975 controls). We used LDpred2 and PRS-CSx to construct ancestry-specific and multi-ancestry PRSs and evaluated their performance in an independent European- (6781 cases and 103 016 controls) and African-ancestry sample (1385 cases and 12 569 controls). Multi-ancestry PRSs with weights tuned in European-ancestry samples slightly outperformed ancestry-specific PRSs in European-ancestry test samples (e.g. the area under the receiver operating curve [AUC] was 0.609 for PRS-CSx_combinedEUR and 0.608 for PRS-CSxEUR [P = 0.00029]). Multi-ancestry PRSs with weights tuned in African-ancestry samples also outperformed ancestry-specific PRSs in African-ancestry test samples (PRS-CSxAFR: AUC = 0.58, PRS-CSx_combined AFR: AUC = 0.59), although this difference was not statistically significant (P = 0.34). The highest fifth percentile of the best-performing PRS was associated with 1.9-fold and 1.68-fold increased risk for VTE among European- and African-ancestry subjects, respectively, relative to those in the middle stratum. These findings suggest that the multi-ancestry PRS might be used to improve performance across diverse populations to identify individuals at highest risk for VTE.

2.

Longitudinal method comparison: modeling polygenic risk for post-traumatic stress disorder over time in individuals of African and European ancestry.

Passero, Kristin; Noll, Jennie G; Verma, Shefali Setia; Selin, Claire; Hall, Molly A.

Front Genet ; 15: 1203577, 2024.

Article in English | MEDLINE | ID: mdl-38818035

ABSTRACT

Cross-sectional data allow the investigation of how genetics influence health at a single time point, but to understand how the genome impacts phenotype development, one must use repeated measures data. Ignoring the dependency inherent in repeated measures can exacerbate false positives and requires the utilization of methods other than general or generalized linear models. Many methods can accommodate longitudinal data, including the commonly used linear mixed model and generalized estimating equation, as well as the less popular fixed-effects model, cluster-robust standard error adjustment, and aggregate regression. We simulated longitudinal data and applied these five methods alongside naïve linear regression, which ignored the dependency and served as a baseline, to compare their power, false positive rate, estimation accuracy, and precision. The results showed that the naïve linear regression and fixed-effects models incurred high false positive rates when analyzing a predictor that is fixed over time, making them unviable for studying time-invariant genetic effects. The linear mixed models maintained low false positive rates and unbiased estimation. The generalized estimating equation was similar to the former in terms of power and estimation, but it had increased false positives when the sample size was low, as did cluster-robust standard error adjustment. Aggregate regression produced biased estimates when predictor effects varied over time. To show how the method choice affects downstream results, we performed longitudinal analyses in an adolescent cohort of African and European ancestry. We examined how developing post-traumatic stress symptoms were predicted by polygenic risk, traumatic events, exposure to sexual abuse, and income using four approaches-linear mixed models, generalized estimating equations, cluster-robust standard error adjustment, and aggregate regression. While the directions of effect were generally consistent, coefficient magnitudes and statistical significance differed across methods. Our in-depth comparison of longitudinal methods showed that linear mixed models and generalized estimating equations were applicable in most scenarios requiring longitudinal modeling, but no approach produced identical results even if fit to the same data. Since result discrepancies can result from methodological choices, it is crucial that researchers determine their model a priori, refrain from testing multiple approaches to obtain favorable results, and utilize as similar as possible methods when seeking to replicate results.

3.

Multi-ancestry polygenic risk scores for venous thromboembolism.

Jee, Yon Ho; Thibord, Florian; Dominguez, Alicia; Sept, Corriene; Boulier, Kristin; Venkateswaran, Vidhya; Ding, Yi; Cherlin, Tess; Verma, Shefali Setia; Faro, Valeria Lo; Bartz, Traci M; Boland, Anne; Brody, Jennifer A; Deleuze, Jean-Francois; Emmerich, Joseph; Germain, Marine; Johnson, Andrew D; Kooperberg, Charles; Morange, Pierre-Emmanuel; Pankratz, Nathan; Psaty, Bruce M; Reiner, Alexander P; Smadja, David M; Sitlani, Colleen M; Suchon, Pierre; Tang, Weihong; Trégouët, David-Alexandre; Zöllner, Sebastian; Pasaniuc, Bogdan; Damrauer, Scott M; Sanna, Serena; Snieder, Harold; Kabrhel, Christopher; Smith, Nicholas L; Kraft, Peter.

medRxiv ; 2024 Jan 10.

Article in English | MEDLINE | ID: mdl-38260294

ABSTRACT

Venous thromboembolism (VTE) is a significant contributor to morbidity and mortality, with large disparities in incidence rates between Black and White Americans. Polygenic risk scores (PRSs) limited to variants discovered in genome-wide association studies in European-ancestry samples can identify European-ancestry individuals at high risk of VTE. However, there is limited evidence on whether high-dimensional PRS constructed using more sophisticated methods and more diverse training data can enhance the predictive ability and their utility across diverse populations. We developed PRSs for VTE using summary statistics from the International Network against Venous Thrombosis (INVENT) consortium GWAS meta-analyses of European- (71,771 cases and 1,059,740 controls) and African-ancestry samples (7,482 cases and 129,975 controls). We used LDpred2 and PRSCSx to construct ancestry-specific and multi-ancestry PRSs and evaluated their performance in an independent European- (6,261 cases and 88,238 controls) and African-ancestry sample (1,385 cases and 12,569 controls). Multi-ancestry PRSs with weights tuned in European- and African-ancestry samples, respectively, outperformed ancestry-specific PRSs in European- (PRSCSXEUR: AUC=0.61 (0.60, 0.61), PRSCSX_combinedEUR: AUC=0.61 (0.60, 0.62)) and African-ancestry test samples (PRSCSXAFR: AUC=0.58 (0.57, 0.6), PRSCSX_combined AFR: AUC=0.59 (0.57, 0.60)). The highest fifth percentile of the best-performing PRS was associated with 1.9-fold and 1.68-fold increased risk for VTE among European- and African-ancestry subjects, respectively, relative to those in the middle stratum. These findings suggest that the multi-ancestry PRS may be used to identify individuals at highest risk for VTE and provide guidance for the most effective treatment strategy across diverse populations.

4.

Genome-Wide Association Study of Breast Density among Women of African Ancestry.

Verma, Shefali Setia; Guare, Lindsay; Ehsan, Sarah; Gastounioti, Aimilia; Scales, Gabrielle; Ritchie, Marylyn D; Kontos, Despina; McCarthy, Anne Marie.

Cancers (Basel) ; 15(10)2023 May 16.

Article in English | MEDLINE | ID: mdl-37345113

ABSTRACT

Breast density, the amount of fibroglandular versus fatty tissue in the breast, is a strong breast cancer risk factor. Understanding genetic factors associated with breast density may help in clarifying mechanisms by which breast density increases cancer risk. To date, 50 genetic loci have been associated with breast density, however, these studies were performed among predominantly European ancestry populations. We utilized a cohort of women aged 40-85 years who underwent screening mammography and had genetic information available from the Penn Medicine BioBank to conduct a Genome-Wide Association Study (GWAS) of breast density among 1323 women of African ancestry. For each mammogram, the publicly available "LIBRA" software was used to quantify dense area and area percent density. We identified 34 significant loci associated with dense area and area percent density, with the strongest signals in GACAT3, CTNNA3, HSD17B6, UGDH, TAAR8, ARHGAP10, BOD1L2, and NR3C2. There was significant overlap between previously identified breast cancer SNPs and SNPs identified as associated with breast density. Our results highlight the importance of breast density GWAS among diverse populations, including African ancestry populations. They may provide novel insights into genetic factors associated with breast density and help in elucidating mechanisms by which density increases breast cancer risk.

5.

Gene Interactions in Human Disease Studies-Evidence Is Mounting.

Singhal, Pankhuri; Verma, Shefali Setia; Ritchie, Marylyn D.

Annu Rev Biomed Data Sci ; 6: 377-395, 2023 08 10.

Article in English | MEDLINE | ID: mdl-37196359

ABSTRACT

Despite monumental advances in molecular technology to generate genome sequence data at scale, there is still a considerable proportion of heritability in most complex diseases that remains unexplained. Because many of the discoveries have been single-nucleotide variants with small to moderate effects on disease, the functional implication of many of the variants is still unknown and, thus, we have limited new drug targets and therapeutics. We, and many others, posit that one primary factor that has limited our ability to identify novel drug targets from genome-wide association studies may be due to gene interactions (epistasis), gene-environment interactions, network/pathway effects, or multiomic relationships. We propose that many of these complex models explain much of the underlying genetic architecture of complex disease. In this review, we discuss the evidence from multiple research avenues, ranging from pairs of alleles to multiomic integration studies and pharmacogenomics, that supports the need for further investigation of gene interactions (or epistasis) in genetic and genomic studies of human disease. Our goal is to catalog the mounting evidence for epistasis in genetic studies and the connections between genetic interactions and human health and disease that could enable precision medicine of the future.

Subject(s)

Epistasis, Genetic , Genome-Wide Association Study , Humans , Epistasis, Genetic/genetics , Genome , Genomics

6.

Polygenic prediction of preeclampsia and gestational hypertension.

Honigberg, Michael C; Truong, Buu; Khan, Raiyan R; Xiao, Brenda; Bhatta, Laxmi; Vy, Ha My T; Guerrero, Rafael F; Schuermans, Art; Selvaraj, Margaret Sunitha; Patel, Aniruddh P; Koyama, Satoshi; Cho, So Mi Jemma; Vellarikkal, Shamsudheen Karuthedath; Trinder, Mark; Urbut, Sarah M; Gray, Kathryn J; Brumpton, Ben M; Patil, Snehal; Zöllner, Sebastian; Antopia, Mariah C; Saxena, Richa; Nadkarni, Girish N; Do, Ron; Yan, Qi; Pe'er, Itsik; Verma, Shefali Setia; Gupta, Rajat M; Haas, David M; Martin, Hilary C; van Heel, David A; Laisk, Triin; Natarajan, Pradeep.

Nat Med ; 29(6): 1540-1549, 2023 Jun.

Article in English | MEDLINE | ID: mdl-37248299

ABSTRACT

Preeclampsia and gestational hypertension are common pregnancy complications associated with adverse maternal and child outcomes. Current tools for prediction, prevention and treatment are limited. Here we tested the association of maternal DNA sequence variants with preeclampsia in 20,064 cases and 703,117 control individuals and with gestational hypertension in 11,027 cases and 412,788 control individuals across discovery and follow-up cohorts using multi-ancestry meta-analysis. Altogether, we identified 18 independent loci associated with preeclampsia/eclampsia and/or gestational hypertension, 12 of which are new (for example, MTHFR-CLCN6, WNT3A, NPR3, PGR and RGL3), including two loci (PLCE1 and FURIN) identified in the multitrait analysis. Identified loci highlight the role of natriuretic peptide signaling, angiogenesis, renal glomerular function, trophoblast development and immune dysregulation. We derived genome-wide polygenic risk scores that predicted preeclampsia/eclampsia and gestational hypertension in external cohorts, independent of clinical risk factors, and reclassified eligibility for low-dose aspirin to prevent preeclampsia. Collectively, these findings provide mechanistic insights into the hypertensive disorders of pregnancy and have the potential to advance pregnancy risk stratification.

Subject(s)

Eclampsia , Hypertension, Pregnancy-Induced , Hypertension , Pre-Eclampsia , Pregnancy , Female , Child , Humans , Hypertension, Pregnancy-Induced/genetics , Pre-Eclampsia/genetics , Pre-Eclampsia/prevention & control , Aspirin , Risk Factors

7.

Genetic Predictors of Blood Pressure Traits are Associated with Preeclampsia.

Jasper, Elizabeth A; Hellwege, Jacklyn N; Breeyear, Joseph H; Xiao, Brenda; Jarvik, Gail P; Stanaway, Ian B; Leppig, Kathleen A; Chittoor, Geetha; Hayes, M Geoffrey; Dikilitas, Ozan; Kullo, Iftikhar J; Holm, Ingrid A; Verma, Shefali Setia; Edwards, Todd L; Velez Edwards, Digna R.

medRxiv ; 2023 Feb 14.

Article in English | MEDLINE | ID: mdl-36824881

ABSTRACT

Background: Preeclampsia, a pregnancy complication characterized by hypertension after 20 gestational weeks, is a major cause of maternal and neonatal morbidity and mortality. The mechanisms leading to preeclampsia are unclear; however, there is evidence that preeclampsia is highly heritable. We evaluated the association of polygenic risk scores (PRS) for blood pressure traits and preeclampsia to assess whether there is shared genetic architecture. Methods: Participants were obtained from Vanderbilt University's BioVU, the Electronic Medical Records and Genomics network, and the Penn Medicine Biobank. Non-Hispanic Black and White females of reproductive age with indications of pregnancy and genotype information were included. Preeclampsia was defined by ICD codes. Summary statistics for diastolic blood pressure (DBP), systolic blood pressure (SBP), and pulse pressure (PP) PRS were obtained from Giri et al 2019. Associations between preeclampsia and each PRS were evaluated separately by race and study population before evidence was meta-analyzed. Prediction models were developed and evaluated using 10-fold cross validation. Results: In the 3,504 Black and 5,009 White individuals included, the rate of preeclampsia was 15.49%. The DBP and SBP PRSs were associated with preeclampsia in Whites but not Blacks. The PP PRS was significantly associated with preeclampsia in Blacks and Whites. In trans-ancestry meta-analysis, all PRSs were associated with preeclampsia (OR DBP =1.10, 95% CI=1.02-1.17, p =7.68×10 -3 ; OR SBP =1.16, 95% CI=1.09-1.23, p =2.23×10 -6 ; OR PP =1.14, 95% CI=1.07-1.27, p =9.86×10 -5 ). However, addition of PRSs to clinical prediction models did not improve predictive performance. Conclusions: Genetic factors contributing to blood pressure regulation in the general population also predispose to preeclampsia.

8.

Inference of Causal Relationships Between Genetic Risk Factors for Cardiometabolic Phenotypes and Female-Specific Health Conditions.

Xiao, Brenda; Velez Edwards, Digna R; Lucas, Anastasia; Drivas, Theodore; Gray, Kathryn; Keating, Brendan; Weng, Chunhua; Jarvik, Gail P; Hakonarson, Hakon; Kottyan, Leah; Elhadad, Noemie; Wei, Wei-Qi; Luo, Yuan; Kim, Dokyoon; Ritchie, Marylyn; Verma, Shefali Setia.

J Am Heart Assoc ; 12(5): e026561, 2023 03 07.

Article in English | MEDLINE | ID: mdl-36846987

ABSTRACT

Background Cardiometabolic diseases are highly comorbid, but their relationship with female-specific or overwhelmingly female-predominant health conditions (breast cancer, endometriosis, pregnancy complications) is understudied. This study aimed to estimate the cross-trait genetic overlap and influence of genetic burden of cardiometabolic traits on health conditions unique to women. Methods and Results Using electronic health record data from 71 008 ancestrally diverse women, we examined relationships between 23 obstetrical/gynecological conditions and 4 cardiometabolic phenotypes (body mass index, coronary artery disease, type 2 diabetes, and hypertension) by performing 4 analyses: (1) cross-trait genetic correlation analyses to compare genetic architecture, (2) polygenic risk score-based association tests to characterize shared genetic effects on disease risk, (3) Mendelian randomization for significant associations to assess cross-trait causal relationships, and (4) chronology analyses to visualize the timeline of events unique to groups of women with high and low genetic burden for cardiometabolic traits and highlight the disease prevalence in risk groups by age. We observed 27 significant associations between cardiometabolic polygenic scores and obstetrical/gynecological conditions (body mass index and endometrial cancer, body mass index and polycystic ovarian syndrome, type 2 diabetes and gestational diabetes, type 2 diabetes and polycystic ovarian syndrome). Mendelian randomization analysis provided additional evidence of independent causal effects. We also identified an inverse association between coronary artery disease and breast cancer. High cardiometabolic polygenic scores were associated with early development of polycystic ovarian syndrome and gestational hypertension. Conclusions We conclude that polygenic susceptibility to cardiometabolic traits is associated with elevated risk of certain female-specific health conditions.

Subject(s)

Coronary Artery Disease , Diabetes Mellitus, Type 2 , Polycystic Ovary Syndrome , Humans , Female , Diabetes Mellitus, Type 2/epidemiology , Diabetes Mellitus, Type 2/genetics , Coronary Artery Disease/epidemiology , Coronary Artery Disease/genetics , Polycystic Ovary Syndrome/epidemiology , Polycystic Ovary Syndrome/genetics , Risk Factors , Phenotype

9.

Session Introduction: SALUD: Scalable Applications of cLinical risk Utility and preDiction.

Singhal, Pankhuri; Veturi, Yogasudha; Judy, Renae; Park, Yoson; Vujkovic, Marijana; Veatch, Olivia; Kember, Rachel; Verma, Shefali Setia.

Pac Symp Biocomput ; 28: 407-412, 2023.

Article in English | MEDLINE | ID: mdl-36540995

ABSTRACT

This PSB 2023 session discusses challenges in clinical implication and application of risk prediction models, which includes but is not limited to: implementation of risk models, responsible use of polygenic risk scores (PGS), and other risk prediction strategies. We focus on the development and use of new, scalable methods for harmonizing and refining risk prediction models by incorporating genetic and non-genetic risk factors, applying new phenotyping strategies, and integrating clinical factors and biomarkers. Lastly, we will discuss innovation in expanding the utility of these prediction models to underrepresented populations. This session focuses on the overarching theme of enabling early diagnosis, and treatment and preventive measures related to complex diseases and comorbidities.

Subject(s)

Computational Biology , Multifactorial Inheritance , Humans , Risk Factors , Genetic Predisposition to Disease

10.

Leveraging deep phenotyping from health check-up cohort with 10,000 Korean individuals for phenome-wide association study of 136 traits.

Choe, Eun Kyung; Shivakumar, Manu; Verma, Anurag; Verma, Shefali Setia; Choi, Seung Ho; Kim, Joo Sung; Kim, Dokyoon.

Sci Rep ; 12(1): 1930, 2022 02 04.

Article in English | MEDLINE | ID: mdl-35121771

ABSTRACT

The expanding use of the phenome-wide association study (PheWAS) faces challenges in the context of using International Classification of Diseases billing codes for phenotype definition, imbalanced study population ethnicity, and constrained application of the results in research. We performed a PheWAS utilizing 136 deep phenotypes corroborated by comprehensive health check-ups in a Korean population, along with trans-ethnic comparisons through using the UK Biobank and Biobank Japan Project. Meta-analysis with Korean and Japanese population was done. The PheWAS associated 65 phenotypes with 14,101 significant variants (P < 4.92 × 10-10). Network analysis, visualization of cross-phenotype mapping, and causal inference mapping with Mendelian randomization were conducted. Among phenotype pairs from the genotype-driven cross-phenotype associations, we evaluated penetrance in correlation analysis using a clinical database. We focused on the application of PheWAS in order to make it robust and to aid the derivation of biological meaning post-PheWAS. This comprehensive analysis of PheWAS results based on a health check-up database will provide researchers and clinicians with a panoramic overview of the networks among multiple phenotypes and genetic variants, laying groundwork for the practical application of precision medicine.

Subject(s)

Genetic Variation , Penetrance , Case-Control Studies , Gene Regulatory Networks , Gene-Environment Interaction , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Mendelian Randomization Analysis , Phenotype , Republic of Korea

11.

SARS-CoV-2 Seropositivity and Seroconversion in Patients Undergoing Active Cancer-Directed Therapy.

Sun, Lova; Surya, Sanjna; Goodman, Noah G; Le, Anh N; Kelly, Gregory; Owoyemi, Olutosin; Desai, Heena; Zheng, Cathy; DeLuca, Shannon; Good, Madeline L; Hussain, Jasmin; Jeffries, Seth D; Kry, Yolanda R; Kugler, Emily M; Mansour, Maikel; Ndicu, John; Osei-Akoto, AnnaClaire; Prior, Timothy; Pundock, Stacy L; Varughese, Lisa A; Weaver, JoEllen; Doucette, Abigail; Dudek, Scott; Verma, Shefali Setia; Gouma, Sigrid; Weirick, Madison E; McAllister, Christopher M; Bange, Erin; Gabriel, Peter; Ritchie, Marylyn; Rader, Daniel J; Vonderheide, Robert H; Schuchter, Lynn M; Verma, Anurag; Maillard, Ivan; Mamtani, Ronac; Hensley, Scott E; Gross, Robert; Wileyto, E Paul; Huang, Alexander C; Maxwell, Kara N; DeMichele, Angela.

JCO Oncol Pract ; 17(12): e1879-e1886, 2021 12.

Article in English | MEDLINE | ID: mdl-34133219

ABSTRACT

PURPOSE: Multiple studies have demonstrated the negative impact of cancer care delays during the COVID-19 pandemic, and transmission mitigation techniques are imperative for continued cancer care delivery. We aimed to gauge the effectiveness of these measures at the University of Pennsylvania. METHODS: We conducted a longitudinal study of SARS-CoV-2 antibody seropositivity and seroconversion in patients presenting to infusion centers for cancer-directed therapy between May 21, 2020, and October 8, 2020. Participants completed questionnaires and had up to five serial blood collections. RESULTS: Of 124 enrolled patients, only two (1.6%) had detectable SARS-CoV-2 antibodies on initial blood draw, and no initially seronegative patients developed newly detectable antibodies on subsequent blood draw(s), corresponding to a seroconversion rate of 0% (95% CI, 0.0 TO 4.1%) over 14.8 person-years of follow up, with a median of 13 health care visits per patient. CONCLUSION: These results suggest that patients with cancer receiving in-person care at a facility with aggressive mitigation efforts have an extremely low likelihood of COVID-19 infection.

Subject(s)

COVID-19 , Neoplasms , Humans , Longitudinal Studies , Neoplasms/therapy , Pandemics , SARS-CoV-2 , Seroconversion

12.

Medical records-based chronic kidney disease phenotype for clinical care and "big data" observational and genetic studies.

Shang, Ning; Khan, Atlas; Polubriaginof, Fernanda; Zanoni, Francesca; Mehl, Karla; Fasel, David; Drawz, Paul E; Carrol, Robert J; Denny, Joshua C; Hathcock, Matthew A; Arruda-Olson, Adelaide M; Peissig, Peggy L; Dart, Richard A; Brilliant, Murray H; Larson, Eric B; Carrell, David S; Pendergrass, Sarah; Verma, Shefali Setia; Ritchie, Marylyn D; Benoit, Barbara; Gainer, Vivian S; Karlson, Elizabeth W; Gordon, Adam S; Jarvik, Gail P; Stanaway, Ian B; Crosslin, David R; Mohan, Sumit; Ionita-Laza, Iuliana; Tatonetti, Nicholas P; Gharavi, Ali G; Hripcsak, George; Weng, Chunhua; Kiryluk, Krzysztof.

NPJ Digit Med ; 4(1): 70, 2021 Apr 13.

Article in English | MEDLINE | ID: mdl-33850243

ABSTRACT

Chronic Kidney Disease (CKD) represents a slowly progressive disorder that is typically silent until late stages, but early intervention can significantly delay its progression. We designed a portable and scalable electronic CKD phenotype to facilitate early disease recognition and empower large-scale observational and genetic studies of kidney traits. The algorithm uses a combination of rule-based and machine-learning methods to automatically place patients on the staging grid of albuminuria by glomerular filtration rate ("A-by-G" grid). We manually validated the algorithm by 451 chart reviews across three medical systems, demonstrating overall positive predictive value of 95% for CKD cases and 97% for healthy controls. Independent case-control validation using 2350 patient records demonstrated diagnostic specificity of 97% and sensitivity of 87%. Application of the phenotype to 1.3 million patients demonstrated that over 80% of CKD cases are undetected using ICD codes alone. We also demonstrated several large-scale applications of the phenotype, including identifying stage-specific kidney disease comorbidities, in silico estimation of kidney trait heritability in thousands of pedigrees reconstructed from medical records, and biobank-based multicenter genome-wide and phenome-wide association studies.

13.

SARS-CoV-2 seropositivity and seroconversion in patients undergoing active cancer-directed therapy.

Sun, Lova; Surya, Sanjna; Goodman, Noah G; Le, Anh N; Kelly, Gregory; Owoyemi, Olutosin; Desai, Heena; Zheng, Cathy; DeLuca, Shannon; Good, Madeline L; Hussain, Jasmin; Jeffries, Seth D; Kry, Yolanda R; Kugler, Emily M; Mansour, Maikel; Ndicu, John; Osei-Akoto, AnnaClaire; Prior, Timothy; Pundock, Stacy L; Varughese, Lisa A; Weaver, JoEllen; Doucette, Abigail; Dudek, Scott; Verma, Shefali Setia; Gouma, Sigrid; Weirick, Madison E; McAllister, Christopher M; Bange, Erin; Gabriel, Peter; Ritchie, Marylyn; Rader, Daniel J; Vonderheide, Robert H; Schuchter, Lynn M; Verma, Anurag; Maillard, Ivan; Mamtani, Ronac; Hensley, Scott E; Gross, Robert; Wileyto, E Paul; Huang, Alexander C; Maxwell, Kara N; DeMichele, Angela.

medRxiv ; 2021 Jan 16.

Article in English | MEDLINE | ID: mdl-33469597

ABSTRACT

Multiple studies have demonstrated the negative impact of cancer care delays during the COVID-19 pandemic, and transmission mitigation techniques are imperative for continued cancer care delivery. To gauge the effectiveness of these measures at the University of Pennsylvania, we conducted a longitudinal study of SARS-CoV-2 antibody seropositivity and seroconversion in patients presenting to infusion centers for cancer-directed therapy between 5/21/2020 and 10/8/2020. Participants completed questionnaires and had up to five serial blood collections. Of 124 enrolled patients, only two (1.6%) had detectable SARS-CoV-2 antibodies on initial blood draw, and no initially seronegative patients developed newly detectable antibodies on subsequent blood draw(s), corresponding to a seroconversion rate of 0% (95%CI 0.0-4.1%) over 14.8 person-years of follow up, with a median of 13 healthcare visits per patient. These results suggest that cancer patients receiving in-person care at a facility with aggressive mitigation efforts have an extremely low likelihood of COVID-19 infection.

14.

Genetic Architecture of Abdominal Aortic Aneurysm in the Million Veteran Program.

Klarin, Derek; Verma, Shefali Setia; Judy, Renae; Dikilitas, Ozan; Wolford, Brooke N; Paranjpe, Ishan; Levin, Michael G; Pan, Cuiping; Tcheandjieu, Catherine; Spin, Joshua M; Lynch, Julie; Assimes, Themistocles L; Åldstedt Nyrønning, Linn; Mattsson, Erney; Edwards, Todd L; Denny, Josh; Larson, Eric; Lee, Ming Ta Michael; Carrell, David; Zhang, Yanfei; Jarvik, Gail P; Gharavi, Ali G; Harley, John; Mentch, Frank; Pacheco, Jennifer A; Hakonarson, Hakon; Skogholt, Anne Heidi; Thomas, Laurent; Gabrielsen, Maiken Elvestad; Hveem, Kristian; Nielsen, Jonas Bille; Zhou, Wei; Fritsche, Lars; Huang, Jie; Natarajan, Pradeep; Sun, Yan V; DuVall, Scott L; Rader, Daniel J; Cho, Kelly; Chang, Kyong-Mi; Wilson, Peter W F; O'Donnell, Christopher J; Kathiresan, Sekar; Scali, Salvatore T; Berceli, Scott A; Willer, Cristen; Jones, Gregory T; Bown, Matthew J; Nadkarni, Girish; Kullo, Iftikhar J.

Circulation ; 142(17): 1633-1646, 2020 10 27.

Article in English | MEDLINE | ID: mdl-32981348

ABSTRACT

BACKGROUND: Abdominal aortic aneurysm (AAA) is an important cause of cardiovascular mortality; however, its genetic determinants remain incompletely defined. In total, 10 previously identified risk loci explain a small fraction of AAA heritability. METHODS: We performed a genome-wide association study in the Million Veteran Program testing ≈18 million DNA sequence variants with AAA (7642 cases and 172 172 controls) in veterans of European ancestry with independent replication in up to 4972 cases and 99 858 controls. We then used mendelian randomization to examine the causal effects of blood pressure on AAA. We examined the association of AAA risk variants with aneurysms in the lower extremity, cerebral, and iliac arterial beds, and derived a genome-wide polygenic risk score (PRS) to identify a subset of the population at greater risk for disease. RESULTS: Through a genome-wide association study, we identified 14 novel loci, bringing the total number of known significant AAA loci to 24. In our mendelian randomization analysis, we demonstrate that a genetic increase of 10 mm Hg in diastolic blood pressure (odds ratio, 1.43 [95% CI, 1.24-1.66]; P=1.6×10-6), as opposed to systolic blood pressure (odds ratio, 1.06 [95% CI, 0.97-1.15]; P=0.2), likely has a causal relationship with AAA development. We observed that 19 of 24 AAA risk variants associate with aneurysms in at least 1 other vascular territory. A 29-variant PRS was strongly associated with AAA (odds ratioPRS, 1.26 [95% CI, 1.18-1.36]; PPRS=2.7×10-11 per SD increase in PRS), independent of family history and smoking risk factors (odds ratioPRS+family history+smoking, 1.24 [95% CI, 1.14-1.35]; PPRS=1.27×10-6). Using this PRS, we identified a subset of the population with AAA prevalence greater than that observed in screening trials informing current guidelines. CONCLUSIONS: We identify novel AAA genetic associations with therapeutic implications and identify a subset of the population at significantly increased genetic risk of AAA independent of family history. Our data suggest that extending current screening guidelines to include testing to identify those with high polygenic AAA risk, once the cost of genotyping becomes comparable with that of screening ultrasound, would significantly increase the yield of current screening at reasonable cost.

Subject(s)

Aortic Aneurysm, Abdominal/genetics , Humans , Veterans

15.

Genomewide Association Study of Platelet Reactivity and Cardiovascular Response in Patients Treated With Clopidogrel: A Study by the International Clopidogrel Pharmacogenomics Consortium.

Verma, Shefali Setia; Bergmeijer, Thomas O; Gong, Li; Reny, Jean-Luc; Lewis, Joshua P; Mitchell, Braxton D; Alexopoulos, Dimitrios; Aradi, Daniel; Altman, Russ B; Bliden, Kevin; Bradford, Yuki; Campo, Gianluca; Chang, Kiyuk; Cleator, John H; Déry, Jean-Pierre; Dridi, Nadia P; Fernandez-Cadenas, Israel; Fontana, Pierre; Gawaz, Meinrad; Geisler, Tobias; Gensini, Gian Franco; Giusti, Betti; Gurbel, Paul A; Hochholzer, Willibald; Holmvang, Lene; Kim, Eun-Young; Kim, Ho-Sook; Marcucci, Rossella; Montaner, Joan; Backman, Joshua D; Pakyz, Ruth E; Roden, Dan M; Schaeffeler, Elke; Schwab, Matthias; Shin, Jae Gook; Siller-Matula, Jolanta M; Ten Berg, Jurriën M; Trenk, Dietmar; Valgimigli, Marco; Wallace, John; Wen, Ming-Shien; Kubo, Michiaki; Lee, Ming Ta Michael; Whaley, Ryan; Winter, Stefan; Klein, Teri E; Shuldiner, Alan R; Ritchie, Marylyn D.

Clin Pharmacol Ther ; 108(5): 1067-1077, 2020 11.

Article in English | MEDLINE | ID: mdl-32472697

ABSTRACT

Antiplatelet response to clopidogrel shows wide variation, and poor response is correlated with adverse clinical outcomes. CYP2C19 loss-of-function alleles play an important role in this response, but account for only a small proportion of variability in response to clopidogrel. An aim of the International Clopidogrel Pharmacogenomics Consortium (ICPC) is to identify other genetic determinants of clopidogrel pharmacodynamics and clinical response. A genomewide association study (GWAS) was performed using DNA from 2,750 European ancestry individuals, using adenosine diphosphate-induced platelet reactivity and major cardiovascular and cerebrovascular events as outcome parameters. GWAS for platelet reactivity revealed a strong signal for CYP2C19*2 (P value = 1.67e-33). After correction for CYP2C19*2 no other single-nucleotide polymorphism reached genomewide significance. GWAS for a combined clinical end point of cardiovascular death, myocardial infarction, or stroke (5.0% event rate), or a combined end point of cardiovascular death or myocardial infarction (4.7% event rate) showed no significant results, although in coronary artery disease, percutaneous coronary intervention, and acute coronary syndrome subgroups, mutations in SCOS5P1, CDC42BPA, and CTRAC1 showed genomewide significance (lowest P values: 1.07e-09, 4.53e-08, and 2.60e-10, respectively). CYP2C19*2 is the strongest genetic determinant of on-clopidogrel platelet reactivity. We identified three novel associations in clinical outcome subgroups, suggestive for each of these outcomes.

Subject(s)

Blood Platelets/drug effects , Cardiovascular Diseases/prevention & control , Clopidogrel/therapeutic use , Coronary Artery Disease/therapy , Cytochrome P-450 CYP2C19/genetics , Percutaneous Coronary Intervention , Pharmacogenomic Variants , Platelet Aggregation Inhibitors/therapeutic use , Polymorphism, Single Nucleotide , Aged , Blood Platelets/metabolism , Cardiovascular Diseases/blood , Cardiovascular Diseases/genetics , Cardiovascular Diseases/mortality , Clopidogrel/adverse effects , Coronary Artery Disease/mortality , Cytochrome P-450 CYP2C19/metabolism , Female , Genome-Wide Association Study , Humans , Male , Middle Aged , Percutaneous Coronary Intervention/adverse effects , Percutaneous Coronary Intervention/mortality , Pharmacogenetics , Platelet Aggregation Inhibitors/adverse effects , Risk Assessment , Risk Factors , Treatment Outcome

16.

Heritability and genome-wide association study of benign prostatic hyperplasia (BPH) in the eMERGE network.

Hellwege, Jacklyn N; Stallings, Sarah; Torstenson, Eric S; Carroll, Robert; Borthwick, Kenneth M; Brilliant, Murray H; Crosslin, David; Gordon, Adam; Hripcsak, George; Jarvik, Gail P; Linneman, James G; Devi, Parimala; Peissig, Peggy L; Sleiman, Patrick A M; Hakonarson, Hakon; Ritchie, Marylyn D; Verma, Shefali Setia; Shang, Ning; Denny, Josh C; Roden, Dan M; Velez Edwards, Digna R; Edwards, Todd L.

Sci Rep ; 9(1): 6077, 2019 04 15.

Article in English | MEDLINE | ID: mdl-30988330

ABSTRACT

Benign prostatic hyperplasia (BPH) results in a significant public health burden due to the morbidity caused by the disease and many of the available remedies. As much as 70% of men over 70 will develop BPH. Few studies have been conducted to discover the genetic determinants of BPH risk. Understanding the biological basis for this condition may provide necessary insight for development of novel pharmaceutical therapies or risk prediction. We have evaluated SNP-based heritability of BPH in two cohorts and conducted a genome-wide association study (GWAS) of BPH risk using 2,656 cases and 7,763 controls identified from the Electronic Medical Records and Genomics (eMERGE) network. SNP-based heritability estimates suggest that roughly 60% of the phenotypic variation in BPH is accounted for by genetic factors. We used logistic regression to model BPH risk as a function of principal components of ancestry, age, and imputed genotype data, with meta-analysis performed using METAL. The top result was on chromosome 22 in SYN3 at rs2710383 (p-value = 4.6 × 10-7; Odds Ratio = 0.69, 95% confidence interval = 0.55-0.83). Other suggestive signals were near genes GLGC, UNCA13, SORCS1 and between BTBD3 and SPTLC3. We also evaluated genetically-predicted gene expression in prostate tissue. The most significant result was with increasing predicted expression of ETV4 (chr17; p-value = 0.0015). Overexpression of this gene has been associated with poor prognosis in prostate cancer. In conclusion, although there were no genome-wide significant variants identified for BPH susceptibility, we present evidence supporting the heritability of this phenotype, have identified suggestive signals, and evaluated the association between BPH and genetically-predicted gene expression in prostate.

Subject(s)

Genetic Predisposition to Disease , Inheritance Patterns , Prostatic Hyperplasia/genetics , Aged , Aged, 80 and over , Biomarkers/metabolism , Case-Control Studies , Electronic Health Records/statistics & numerical data , Gene Expression Profiling , Genome-Wide Association Study , Genotyping Techniques , Humans , Male , Middle Aged , Polymorphism, Single Nucleotide , Prostate/pathology , Prostatic Hyperplasia/epidemiology , Prostatic Hyperplasia/pathology

17.

Session Introduction - Pattern Recognition in Biomedical Data: Challenges in putting big data to work.

Verma, Shefali Setia; Verma, Anurag; Kim, Dokyoon; Darabos, Christian.

Pac Symp Biocomput ; 24: 1-7, 2019.

Article in English | MEDLINE | ID: mdl-30864305

ABSTRACT

The following sections are included:IntroductionReferences.

18.

Author Correction: Rare variants in drug target genes contributing to complex diseases, phenome-wide.

Verma, Shefali Setia; Josyula, Navya; Verma, Anurag; Zhang, Xinyuan; Veturi, Yogasudha; Dewey, Frederick E; Hartzel, Dustin N; Lavage, Daniel R; Leader, Joe; Ritchie, Marylyn D; Pendergrass, Sarah A.

Sci Rep ; 8(1): 15911, 2018 Oct 23.

Article in English | MEDLINE | ID: mdl-30353015

ABSTRACT

A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has not been fixed in the paper.

19.

Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals.

Lee, James J; Wedow, Robbee; Okbay, Aysu; Kong, Edward; Maghzian, Omeed; Zacher, Meghan; Nguyen-Viet, Tuan Anh; Bowers, Peter; Sidorenko, Julia; Karlsson Linnér, Richard; Fontana, Mark Alan; Kundu, Tushar; Lee, Chanwook; Li, Hui; Li, Ruoxi; Royer, Rebecca; Timshel, Pascal N; Walters, Raymond K; Willoughby, Emily A; Yengo, Loïc; Alver, Maris; Bao, Yanchun; Clark, David W; Day, Felix R; Furlotte, Nicholas A; Joshi, Peter K; Kemper, Kathryn E; Kleinman, Aaron; Langenberg, Claudia; Mägi, Reedik; Trampush, Joey W; Verma, Shefali Setia; Wu, Yang; Lam, Max; Zhao, Jing Hua; Zheng, Zhili; Boardman, Jason D; Campbell, Harry; Freese, Jeremy; Harris, Kathleen Mullan; Hayward, Caroline; Herd, Pamela; Kumari, Meena; Lencz, Todd; Luan, Jian'an; Malhotra, Anil K; Metspalu, Andres; Milani, Lili; Ong, Ken K; Perry, John R B.

Nat Genet ; 50(8): 1112-1121, 2018 07 23.

Article in English | MEDLINE | ID: mdl-30038396

ABSTRACT

Here we conducted a large-scale genetic association analysis of educational attainment in a sample of approximately 1.1 million individuals and identify 1,271 independent genome-wide-significant SNPs. For the SNPs taken together, we found evidence of heterogeneous effects across environments. The SNPs implicate genes involved in brain-development processes and neuron-to-neuron communication. In a separate analysis of the X chromosome, we identify 10 independent genome-wide-significant SNPs and estimate a SNP heritability of around 0.3% in both men and women, consistent with partial dosage compensation. A joint (multi-phenotype) analysis of educational attainment and three related cognitive phenotypes generates polygenic scores that explain 11-13% of the variance in educational attainment and 7-10% of the variance in cognitive performance. This prediction accuracy substantially increases the utility of polygenic scores as tools in research.

Subject(s)

Multifactorial Inheritance , Adult , Aged , Aged, 80 and over , Cohort Studies , Educational Status , Female , Genome-Wide Association Study/methods , Humans , Male , Middle Aged , Phenotype , Polymorphism, Single Nucleotide

20.

Rare variants in drug target genes contributing to complex diseases, phenome-wide.

Verma, Shefali Setia; Josyula, Navya; Verma, Anurag; Zhang, Xinyuan; Veturi, Yogasudha; Dewey, Frederick E; Hartzel, Dustin N; Lavage, Daniel R; Leader, Joe; Ritchie, Marylyn D; Pendergrass, Sarah A.

Sci Rep ; 8(1): 4624, 2018 03 15.

Article in English | MEDLINE | ID: mdl-29545597

ABSTRACT

The DrugBank database consists of ~800 genes that are well characterized drug targets. This list of genes is a useful resource for association testing. For example, loss of function (LOF) genetic variation has the potential to mimic the effect of drugs, and high impact variation in these genes can impact downstream traits. Identifying novel associations between genetic variation in these genes and a range of diseases can also uncover new uses for the drugs that target these genes. Phenome Wide Association Studies (PheWAS) have been successful in identifying genetic associations across hundreds of thousands of diseases. We have conducted a novel gene based PheWAS to test the effect of rare variants in DrugBank genes, evaluating associations between these genes and more than 500 quantitative and dichotomous phenotypes. We used whole exome sequencing data from 38,568 samples in Geisinger MyCode Community Health Initiative. We evaluated the results of this study when binning rare variants using various filters based on potential functional impact. We identified multiple novel associations, and the majority of the significant associations were driven by functionally annotated variation. Overall, this study provides a sweeping exploration of rare variant associations within functionally relevant genes across a wide range of diagnoses.

Subject(s)

Biomarkers/analysis , Databases, Pharmaceutical , Disease/genetics , Genome-Wide Association Study/statistics & numerical data , Pharmaceutical Preparations/analysis , Phenotype , Polymorphism, Single Nucleotide , Algorithms , Computational Biology/methods , Genetic Association Studies , Genome, Human , Genotype , Humans , Pharmaceutical Preparations/metabolism

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL