Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 19 de 19
Filter
1.
medRxiv ; 2023 Jun 04.
Article in English | MEDLINE | ID: mdl-37398155

ABSTRACT

Behaviors and disorders characterized by difficulties with self-regulation, such as problematic substance use, antisocial behavior, and symptoms of attention-deficit/hyperactivity disorder (ADHD), incur high costs for individuals, families, and communities. These externalizing behaviors often appear early in the life course and can have far-reaching consequences. Researchers have long been interested in direct measurements of genetic risk for externalizing behaviors, which can be incorporated alongside other known risk factors to improve efforts at early identification and intervention. In a preregistered analysis drawing on data from the Environmental Risk (E-Risk) Longitudinal Twin Study (N=862 twins) and the Millennium Cohort Study (MCS; N=2,824 parent-child trios), two longitudinal cohorts from the UK, we leveraged molecular genetic data and within-family designs to test for genetic effects on externalizing behavior that are unbiased by the common sources of environmental confounding. Results are consistent with the conclusion that an externalizing polygenic index (PGI) captures causal effects of genetic variants on externalizing problems in children and adolescents, with an effect size that is comparable to those observed for other established risk factors in the research literature on externalizing behavior. Additionally, we find that polygenic associations vary across development (peaking from age 5-10 years), that parental genetics (assortment and parent-specific effects) and family-level covariates affect prediction little, and that sex differences in polygenic prediction are present but only detectable using within-family comparisons. Based on these findings, we believe that the PGI for externalizing behavior is a promising means for studying the development of disruptive behaviors across child development.

2.
Nat Commun ; 14(1): 4473, 2023 07 25.
Article in English | MEDLINE | ID: mdl-37491308

ABSTRACT

Measurement error in polygenic indices (PGIs) attenuates the estimation of their effects in regression models. We analyze and compare two approaches addressing this attenuation bias: Obviously Related Instrumental Variables (ORIV) and the PGI Repository Correction (PGI-RC). Through simulations, we show that the PGI-RC performs slightly better than ORIV, unless the prediction sample is very small (N < 1000) or when there is considerable assortative mating. Within families, ORIV is the best choice since the PGI-RC correction factor is generally not available. We verify the empirical validity of the simulations by predicting educational attainment and height in a sample of siblings from the UK Biobank. We show that applying ORIV between families increases the standardized effect of the PGI by 12% (height) and by 22% (educational attainment) compared to a meta-analysis-based PGI, yet estimates remain slightly below the PGI-RC estimates. Furthermore, within-family ORIV regression provides the tightest lower bound for the direct genetic effect, increasing the lower bound for the standardized direct genetic effect on educational attainment from 0.14 to 0.18 (+29%), and for height from 0.54 to 0.61 (+13%) compared to a meta-analysis-based PGI.


Subject(s)
Educational Status , Humans
3.
PLoS Genet ; 19(2): e1010638, 2023 02.
Article in English | MEDLINE | ID: mdl-36809357

ABSTRACT

Mediation analysis is commonly used to identify mechanisms and intermediate factors between causes and outcomes. Studies drawing on polygenic scores (PGSs) can readily employ traditional regression-based procedures to assess whether trait M mediates the relationship between the genetic component of outcome Y and outcome Y itself. However, this approach suffers from attenuation bias, as PGSs capture only a (small) part of the genetic variance of a given trait. To overcome this limitation, we developed MA-GREML: a method for Mediation Analysis using Genome-based Restricted Maximum Likelihood (GREML) estimation. Using MA-GREML to assess mediation between genetic factors and traits comes with two main advantages. First, we circumvent the limited predictive accuracy of PGSs that regression-based mediation approaches suffer from. Second, compared to methods employing summary statistics from genome-wide association studies, the individual-level data approach of GREML allows to directly control for confounders of the association between M and Y. In addition to typical GREML parameters (e.g., the genetic correlation), MA-GREML estimates (i) the effect of M on Y, (ii) the direct effect (i.e., the genetic variance of Y that is not mediated by M), and (iii) the indirect effect (i.e., the genetic variance of Y that is mediated by M). MA-GREML also provides standard errors of these estimates and assesses the significance of the indirect effect. We use analytical derivations and simulations to show the validity of our approach under two main assumptions, viz., that M precedes Y and that environmental confounders of the association between M and Y are controlled for. We conclude that MA-GREML is an appropriate tool to assess the mediating role of trait M in the relationship between the genetic component of Y and outcome Y. Using data from the US Health and Retirement Study, we provide evidence that genetic effects on Body Mass Index (BMI), cognitive functioning and self-reported health in later life run partially through educational attainment. For mental health, we do not find significant evidence for an indirect effect through educational attainment. Further analyses show that the additive genetic factors of these four outcomes do partially (cognition and mental health) and fully (BMI and self-reported health) run through an earlier realization of these traits.


Subject(s)
Genome-Wide Association Study , Genome , Humans , Likelihood Functions , Phenotype , Multifactorial Inheritance
4.
Cell Genom ; 2(6)2022 Jun 08.
Article in English | MEDLINE | ID: mdl-35812988

ABSTRACT

Understanding which biological pathways are specific versus general across diagnostic categories and levels of symptom severity is critical to improving nosology and treatment of psychopathology. Here, we combine transdiagnostic and dimensional approaches to genetic discovery for the first time, conducting a novel multivariate genome-wide association study of eight psychiatric symptoms and disorders broadly related to mood disturbance and psychosis. We identify two transdiagnostic genetic liabilities that distinguish between common forms of psychopathology versus rarer forms of serious mental illness. Biological annotation revealed divergent genetic architectures that differentially implicated prenatal neurodevelopment and neuronal function and regulation. These findings inform psychiatric nosology and biological models of psychopathology, as they suggest that the severity of mood and psychotic symptoms present in serious mental illness may reflect a difference in kind rather than merely in degree.

5.
BMC Bioinformatics ; 23(1): 305, 2022 Jul 27.
Article in English | MEDLINE | ID: mdl-35896974

ABSTRACT

BACKGROUND: Heritability and genetic correlation can be estimated from genome-wide single-nucleotide polymorphism (SNP) data using various methods. We recently developed multivariate genomic-relatedness-based restricted maximum likelihood (MGREML) for statistically and computationally efficient estimation of SNP-based heritability ([Formula: see text]) and genetic correlation ([Formula: see text]) across many traits in large datasets. Here, we extend MGREML by allowing it to fit and perform tests on user-specified factor models, while preserving the low computational complexity. RESULTS: Using simulations, we show that MGREML yields consistent estimates and valid inferences for such factor models at low computational cost (e.g., for data on 50 traits and 20,000 individuals, a saturated model involving 50 [Formula: see text]'s, 1225 [Formula: see text]'s, and 50 fixed effects is estimated and compared to a restricted model in less than one hour on a single notebook with two 2.7 GHz cores and 16 GB of RAM). Using repeated measures of height and body mass index from the US Health and Retirement Study, we illustrate the ability of MGREML to estimate a factor model and test whether it fits the data better than a nested model. The MGREML tool, the simulation code, and an extensive tutorial are freely available at https://github.com/devlaming/mgreml/ . CONCLUSION: MGREML can now be used to estimate multivariate factor structures and perform inferences on such factor models at low computational cost. This new feature enables simple structural equation modeling using MGREML, allowing researchers to specify, estimate, and compare genetic factor models of their choosing using SNP data.


Subject(s)
Genomics , Multifactorial Inheritance , Genome , Genome-Wide Association Study , Genomics/methods , Humans , Models, Genetic , Phenotype , Polymorphism, Single Nucleotide
6.
Commun Biol ; 4(1): 1180, 2021 10 12.
Article in English | MEDLINE | ID: mdl-34642422

ABSTRACT

Human variation in brain morphology and behavior are related and highly heritable. Yet, it is largely unknown to what extent specific features of brain morphology and behavior are genetically related. Here, we introduce a computationally efficient approach for multivariate genomic-relatedness-based restricted maximum likelihood (MGREML) to estimate the genetic correlation between a large number of phenotypes simultaneously. Using individual-level data (N = 20,190) from the UK Biobank, we provide estimates of the heritability of gray-matter volume in 74 regions of interest (ROIs) in the brain and we map genetic correlations between these ROIs and health-relevant behavioral outcomes, including intelligence. We find four genetically distinct clusters in the brain that are aligned with standard anatomical subdivision in neuroscience. Behavioral traits have distinct genetic correlations with brain morphology which suggests trait-specific relevance of ROIs. These empirical results illustrate how MGREML can be used to estimate internally consistent and high-dimensional genetic correlation matrices in large datasets.


Subject(s)
Behavior , Brain/anatomy & histology , Cerebral Cortex , Female , Genome, Human , Humans , Male , Models, Genetic , Multivariate Analysis
7.
Nat Neurosci ; 24(10): 1367-1376, 2021 10.
Article in English | MEDLINE | ID: mdl-34446935

ABSTRACT

Behaviors and disorders related to self-regulation, such as substance use, antisocial behavior and attention-deficit/hyperactivity disorder, are collectively referred to as externalizing and have shared genetic liability. We applied a multivariate approach that leverages genetic correlations among externalizing traits for genome-wide association analyses. By pooling data from ~1.5 million people, our approach is statistically more powerful than single-trait analyses and identifies more than 500 genetic loci. The loci were enriched for genes expressed in the brain and related to nervous system development. A polygenic score constructed from our results predicts a range of behavioral and medical outcomes that were not part of genome-wide analyses, including traits that until now lacked well-performing polygenic scores, such as opioid use disorder, suicide, HIV infections, criminal convictions and unemployment. Our findings are consistent with the idea that persistent difficulties in self-regulation can be conceptualized as a neurodevelopmental trait with complex and far-reaching social and health correlates.


Subject(s)
Behavior, Addictive/genetics , Genetic Association Studies , Self-Control , Attention Deficit Disorder with Hyperactivity/genetics , Behavior, Addictive/psychology , Behavioral Symptoms/genetics , Behavioral Symptoms/psychology , Computational Biology , Crime/psychology , Genome-Wide Association Study , HIV Infections/genetics , HIV Infections/psychology , Humans , Meta-Analysis as Topic , Multifactorial Inheritance , Multivariate Analysis , Opioid-Related Disorders/genetics , Opioid-Related Disorders/psychology , Reproducibility of Results , Suicide , Unemployment
8.
Mol Psychiatry ; 26(6): 2056-2069, 2021 06.
Article in English | MEDLINE | ID: mdl-32393786

ABSTRACT

We conducted genome-wide association studies (GWAS) of relative intake from the macronutrients fat, protein, carbohydrates, and sugar in over 235,000 individuals of European ancestries. We identified 21 unique, approximately independent lead SNPs. Fourteen lead SNPs are uniquely associated with one macronutrient at genome-wide significance (P < 5 × 10-8), while five of the 21 lead SNPs reach suggestive significance (P < 1 × 10-5) for at least one other macronutrient. While the phenotypes are genetically correlated, each phenotype carries a partially unique genetic architecture. Relative protein intake exhibits the strongest relationships with poor health, including positive genetic associations with obesity, type 2 diabetes, and heart disease (rg ≈ 0.15-0.5). In contrast, relative carbohydrate and sugar intake have negative genetic correlations with waist circumference, waist-hip ratio, and neighborhood deprivation (|rg| ≈ 0.1-0.3) and positive genetic correlations with physical activity (rg ≈ 0.1 and 0.2). Relative fat intake has no consistent pattern of genetic correlations with poor health but has a negative genetic correlation with educational attainment (rg ≈-0.1). Although our analyses do not allow us to draw causal conclusions, we find no evidence of negative health consequences associated with relative carbohydrate, sugar, or fat intake. However, our results are consistent with the hypothesis that relative protein intake plays a role in the etiology of metabolic dysfunction.


Subject(s)
Diabetes Mellitus, Type 2 , Genome-Wide Association Study , Body Mass Index , Diabetes Mellitus, Type 2/genetics , Diet , Genomics , Humans , Life Style
10.
Nat Hum Behav ; 3(5): 513-525, 2019 05.
Article in English | MEDLINE | ID: mdl-30962613

ABSTRACT

Genetic correlations estimated from genome-wide association studies (GWASs) reveal pervasive pleiotropy across a wide variety of phenotypes. We introduce genomic structural equation modelling (genomic SEM): a multivariate method for analysing the joint genetic architecture of complex traits. Genomic SEM synthesizes genetic correlations and single-nucleotide polymorphism heritabilities inferred from GWAS summary statistics of individual traits from samples with varying and unknown degrees of overlap. Genomic SEM can be used to model multivariate genetic associations among phenotypes, identify variants with effects on general dimensions of cross-trait liability, calculate more predictive polygenic scores and identify loci that cause divergence between traits. We demonstrate several applications of genomic SEM, including a joint analysis of summary statistics from five psychiatric traits. We identify 27 independent single-nucleotide polymorphisms not previously identified in the contributing univariate GWASs. Polygenic scores from genomic SEM consistently outperform those from univariate GWASs. Genomic SEM is flexible and open ended, and allows for continuous innovation in multivariate genetic analysis.


Subject(s)
Genome-Wide Association Study/statistics & numerical data , Genomics/methods , Latent Class Analysis , Multifactorial Inheritance/genetics , Factor Analysis, Statistical , Humans , Mental Disorders/genetics , Mental Disorders/physiopathology , Multivariate Analysis , Polymorphism, Single Nucleotide
11.
Nat Genet ; 50(5): 746-753, 2018 05.
Article in English | MEDLINE | ID: mdl-29662166

ABSTRACT

We develop a Bayesian mixed linear model that simultaneously estimates single-nucleotide polymorphism (SNP)-based heritability, polygenicity (proportion of SNPs with nonzero effects), and the relationship between SNP effect size and minor allele frequency for complex traits in conventionally unrelated individuals using genome-wide SNP data. We apply the method to 28 complex traits in the UK Biobank data (N = 126,752) and show that on average, 6% of SNPs have nonzero effects, which in total explain 22% of phenotypic variance. We detect significant (P < 0.05/28) signatures of natural selection in the genetic architecture of 23 traits, including reproductive, cardiovascular, and anthropometric traits, as well as educational attainment. The significant estimates of the relationship between effect size and minor allele frequency in complex traits are consistent with a model of negative (or purifying) selection, as confirmed by forward simulation. We conclude that negative selection acts pervasively on the genetic variants associated with human complex traits.


Subject(s)
Selection, Genetic/genetics , Bayes Theorem , Gene Frequency/genetics , Genome-Wide Association Study/methods , Genotype , Humans , Linear Models , Models, Genetic , Multifactorial Inheritance/genetics , Phenotype , Polymorphism, Single Nucleotide/genetics , Quantitative Trait, Heritable
12.
Nat Hum Behav ; 1(10): 757-765, 2017 Oct.
Article in English | MEDLINE | ID: mdl-29051922

ABSTRACT

Meta-analyses of genome-wide association studies (GWAS), which dominate genetic discovery are based on data from diverse historical time periods and populations. Genetic scores derived from GWAS explain only a fraction of the heritability estimates obtained from whole-genome studies on single populations, known as the 'hidden heritability' puzzle. Using seven sampling populations (N=35,062), we test whether hidden heritability is attributed to heterogeneity across sampling populations and time, showing that estimates are substantially smaller from across compared to within populations. We show that the hidden heritability varies substantially: from zero (height), to 20% for BMI, 37% for education, 40% for age at first birth and up to 75% for number of children. Simulations demonstrate that our results more likely reflect heterogeneity in phenotypic measurement or gene-environment interaction than genetic heterogeneity. These findings have substantial implications for genetic discovery, suggesting that large homogenous datasets are required for behavioural phenotypes and that gene-environment interaction may be a central challenge for genetic discovery.

13.
PLoS Genet ; 13(1): e1006495, 2017 Jan.
Article in English | MEDLINE | ID: mdl-28095416

ABSTRACT

Large-scale genome-wide association results are typically obtained from a fixed-effects meta-analysis of GWAS summary statistics from multiple studies spanning different regions and/or time periods. This approach averages the estimated effects of genetic variants across studies. In case genetic effects are heterogeneous across studies, the statistical power of a GWAS and the predictive accuracy of polygenic scores are attenuated, contributing to the so-called 'missing heritability'. Here, we describe the online Meta-GWAS Accuracy and Power (MetaGAP) calculator (available at www.devlaming.eu) which quantifies this attenuation based on a novel multi-study framework. By means of simulation studies, we show that under a wide range of genetic architectures, the statistical power and predictive accuracy provided by this calculator are accurate. We compare the predictions from the MetaGAP calculator with actual results obtained in the GWAS literature. Specifically, we use genomic-relatedness-matrix restricted maximum likelihood to estimate the SNP heritability and cross-study genetic correlation of height, BMI, years of education, and self-rated health in three large samples. These estimates are used as input parameters for the MetaGAP calculator. Results from the calculator suggest that cross-study heterogeneity has led to attenuation of statistical power and predictive accuracy in recent large-scale GWAS efforts on these traits (e.g., for years of education, we estimate a relative loss of 51-62% in the number of genome-wide significant loci and a relative loss in polygenic score R2 of 36-38%). Hence, cross-study heterogeneity contributes to the missing heritability.


Subject(s)
Data Accuracy , Genome-Wide Association Study/standards , Software , Genome-Wide Association Study/methods , Humans , Meta-Analysis as Topic
15.
Nat Genet ; 48(12): 1462-1472, 2016 12.
Article in English | MEDLINE | ID: mdl-27798627

ABSTRACT

The genetic architecture of human reproductive behavior-age at first birth (AFB) and number of children ever born (NEB)-has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the underlying mechanisms of AFB and NEB are poorly understood. We report a large genome-wide association study of both sexes including 251,151 individuals for AFB and 343,072 individuals for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study and 4 additional loci associated in a gene-based effort. These loci harbor genes that are likely to have a role, either directly or by affecting non-local gene expression, in human reproduction and infertility, thereby increasing understanding of these complex traits.


Subject(s)
Birth Order , Genome-Wide Association Study , Parity/genetics , Quantitative Trait Loci , Reproduction/genetics , Reproductive Behavior/physiology , Female , Fertility/genetics , Humans , Maternal Age , Phenotype , Polymorphism, Single Nucleotide/genetics , Pregnancy
17.
Nature ; 533(7604): 539-42, 2016 05 26.
Article in English | MEDLINE | ID: mdl-27225129

ABSTRACT

Educational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends our earlier discovery sample of 101,069 individuals to 293,723 individuals, and a replication study in an independent sample of 111,349 individuals from the UK Biobank. We identify 74 genome-wide significant loci associated with the number of years of schooling completed. Single-nucleotide polymorphisms associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioural phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because educational attainment is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric diseases.


Subject(s)
Brain/metabolism , Educational Status , Fetus/metabolism , Gene Expression Regulation/genetics , Genome-Wide Association Study , Polymorphism, Single Nucleotide/genetics , Alzheimer Disease/genetics , Bipolar Disorder/genetics , Cognition , Computational Biology , Gene-Environment Interaction , Humans , Molecular Sequence Annotation , Schizophrenia/genetics , United Kingdom
18.
Nat Genet ; 48(6): 624-33, 2016 06.
Article in English | MEDLINE | ID: mdl-27089181

ABSTRACT

Very few genetic variants have been associated with depression and neuroticism, likely because of limitations on sample size in previous studies. Subjective well-being, a phenotype that is genetically correlated with both of these traits, has not yet been studied with genome-wide data. We conducted genome-wide association studies of three phenotypes: subjective well-being (n = 298,420), depressive symptoms (n = 161,460), and neuroticism (n = 170,911). We identify 3 variants associated with subjective well-being, 2 variants associated with depressive symptoms, and 11 variants associated with neuroticism, including 2 inversion polymorphisms. The two loci associated with depressive symptoms replicate in an independent depression sample. Joint analyses that exploit the high genetic correlations between the phenotypes (|ρ^| ≈ 0.8) strengthen the overall credibility of the findings and allow us to identify additional variants. Across our phenotypes, loci regulating expression in central nervous system and adrenal or pancreas tissues are strongly enriched for association.


Subject(s)
Anxiety Disorders/genetics , Depression/genetics , Genome-Wide Association Study , Polymorphism, Single Nucleotide , Bayes Theorem , Humans , Neuroticism , Phenotype
19.
Biomed Res Int ; 2015: 143712, 2015.
Article in English | MEDLINE | ID: mdl-26273586

ABSTRACT

In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for prediction in quantitative genetics using single-nucleotide polymorphism data is discussed. In particular, we consider (i) the theoretical foundations of ridge regression, (ii) its link to commonly used methods in animal breeding, (iii) the computational feasibility, and (iv) the scope for constructing prediction models with nonlinear effects (e.g., dominance and epistasis). Based on a simulation study we gauge the current and future potential of ridge regression for prediction of human traits using genome-wide SNP data. We conclude that, for outcomes with a relatively simple genetic architecture, given current sample sizes in most cohorts (i.e., N < 10,000) the predictive accuracy of ridge regression is slightly higher than the classical genome-wide association study approach of repeated simple regression (i.e., one regression per SNP). However, both capture only a small proportion of the heritability. Nevertheless, we find evidence that for large-scale initiatives, such as biobanks, sample sizes can be achieved where ridge regression compared to the classical approach improves predictive accuracy substantially.


Subject(s)
DNA Mutational Analysis/trends , Forecasting , Genetics/trends , Genome-Wide Association Study/trends , Polymorphism, Single Nucleotide/genetics , Regression Analysis , Data Interpretation, Statistical
SELECTION OF CITATIONS
SEARCH DETAIL
...