Search | VHL Regional Portal

1.

Multi-trait association mapping for phosphorous efficiency reveals flexible root architectures in sorghum.

Hufnagel, Barbara; Bernardino, Karine C; Malosetti, Marcos; Sousa, Sylvia M; Silva, Lidianne A; Guimaraes, Claudia Teixeira; Coelho, Antônio Marcos; Santos, Thiago Teixeira; Viana, Joao H M; Schaffert, Robert E; Kochian, Leon V; Eeuwijk, Fred A; Magalhaes, Jurandir V.

BMC Plant Biol ; 24(1): 562, 2024 Jun 15.

Article in English | MEDLINE | ID: mdl-38877425

ABSTRACT

BACKGROUND: On tropical regions, phosphorus (P) fixation onto aluminum and iron oxides in soil clays restricts P diffusion from the soil to the root surface, limiting crop yields. While increased root surface area favors P uptake under low-P availability, the relationship between the three-dimensional arrangement of the root system and P efficiency remains elusive. Here, we simultaneously assessed allelic effects of loci associated with a variety of root and P efficiency traits, in addition to grain yield under low-P availability, using multi-trait genome-wide association. We also set out to establish the relationship between root architectural traits assessed in hydroponics and in a low-P soil. Our goal was to better understand the influence of root morphology and architecture in sorghum performance under low-P availability. RESULT: In general, the same alleles of associated SNPs increased root and P efficiency traits including grain yield in a low-P soil. We found that sorghum P efficiency relies on pleiotropic loci affecting root traits, which enhance grain yield under low-P availability. Root systems with enhanced surface area stemming from lateral root proliferation mostly up to 40 cm soil depth are important for sorghum adaptation to low-P soils, indicating that differences in root morphology leading to enhanced P uptake occur exactly in the soil layer where P is found at the highest concentration. CONCLUSION: Integrated QTLs detected in different mapping populations now provide a comprehensive molecular genetic framework for P efficiency studies in sorghum. This indicated extensive conservation of P efficiency QTL across populations and emphasized the terminal portion of chromosome 3 as an important region for P efficiency in sorghum. Increases in root surface area via enhancement of lateral root development is a relevant trait for sorghum low-P soil adaptation, impacting the overall architecture of the sorghum root system. In turn, particularly concerning the critical trait for water and nutrient uptake, root surface area, root system development in deeper soil layers does not occur at the expense of shallow rooting, which may be a key reason leading to the distinctive sorghum adaptation to tropical soils with multiple abiotic stresses including low P availability and drought.

Subject(s)

Genome-Wide Association Study , Phosphorus , Plant Roots , Quantitative Trait Loci , Sorghum , Sorghum/genetics , Sorghum/metabolism , Sorghum/growth & development , Phosphorus/metabolism , Plant Roots/growth & development , Plant Roots/metabolism , Plant Roots/genetics , Plant Roots/anatomy & histology , Chromosome Mapping , Polymorphism, Single Nucleotide , Soil/chemistry , Phenotype

2.

A guided network estimation approach using multi-omic information.

Bartzis, Georgios; Peeters, Carel F W; Ligterink, Wilco; Van Eeuwijk, Fred A.

BMC Bioinformatics ; 25(1): 202, 2024 May 30.

Article in English | MEDLINE | ID: mdl-38816801

ABSTRACT

INTODUCTION: In systems biology, an organism is viewed as a system of interconnected molecular entities. To understand the functioning of organisms it is essential to integrate information about the variations in the concentrations of those molecular entities. This information can be structured as a set of networks with interconnections and with some hierarchical relations between them. Few methods exist for the reconstruction of integrative networks. OBJECTIVE: In this work, we propose an integrative network reconstruction method in which the network organization for a particular type of omics data is guided by the network structure of a related type of omics data upstream in the omic cascade. The structure of these guiding data can be either already known or be estimated from the guiding data themselves. METHODS: The method consists of three steps. First a network structure for the guiding data should be provided. Next, responses in the target set are regressed on the full set of predictors in the guiding data with a Lasso penalty to reduce the number of predictors and an L2 penalty on the differences between coefficients for predictors that share edges in the network for the guiding data. Finally, a network is reconstructed on the fitted target responses as functions of the predictors in the guiding data. This way we condition the target network on the network of the guiding data. CONCLUSIONS: We illustrate our approach on two examples in Arabidopsis. The method detects groups of metabolites that have a similar genetic or transcriptomic basis.

Subject(s)

Arabidopsis , Arabidopsis/genetics , Arabidopsis/metabolism , Systems Biology/methods , Gene Regulatory Networks , Algorithms , Computational Biology/methods , Multiomics

3.

A conceptual framework for the dynamic modeling of time-resolved phenotypes for sets of genotype-environment-management combinations: a model library.

van Voorn, George A K; Boer, Martin P; Truong, Sandra Huynh; Friedenberg, Nicholas A; Gugushvili, Shota; McCormick, Ryan; Bustos Korts, Daniela; Messina, Carlos D; van Eeuwijk, Fred A.

Front Plant Sci ; 14: 1172359, 2023.

Article in English | MEDLINE | ID: mdl-37389290

ABSTRACT

Introduction: Dynamic crop growth models are an important tool to predict complex traits, like crop yield, for modern and future genotypes in their current and evolving environments, as those occurring under climate change. Phenotypic traits are the result of interactions between genetic, environmental, and management factors, and dynamic models are designed to generate the interactions producing phenotypic changes over the growing season. Crop phenotype data are becoming increasingly available at various levels of granularity, both spatially (landscape) and temporally (longitudinal, time-series) from proximal and remote sensing technologies. Methods: Here we propose four phenomenological process models of limited complexity based on differential equations for a coarse description of focal crop traits and environmental conditions during the growing season. Each of these models defines interactions between environmental drivers and crop growth (logistic growth, with implicit growth restriction, or explicit restriction by irradiance, temperature, or water availability) as a minimal set of constraints without resorting to strongly mechanistic interpretations of the parameters. Differences between individual genotypes are conceptualized as differences in crop growth parameter values. Results: We demonstrate the utility of such low-complexity models with few parameters by fitting them to longitudinal datasets from the simulation platform APSIM-Wheat involving in silico biomass development of 199 genotypes and data of environmental variables over the course of the growing season at four Australian locations over 31 years. While each of the four models fits well to particular combinations of genotype and trial, none of them provides the best fit across the full set of genotypes by trials because different environmental drivers will limit crop growth in different trials and genotypes in any specific trial will not necessarily experience the same environmental limitation. Discussion: A combination of low-complexity phenomenological models covering a small set of major limiting environmental factors may be a useful forecasting tool for crop growth under genotypic and environmental variation.

4.

Genetic Mapping of Genotype-by-Ploidy Effects in Arabidopsis thaliana.

Wijnen, Cris L; Becker, Frank F M; Okkersen, Andries A; de Snoo, C Bastiaan; Boer, Martin P; van Eeuwijk, Fred A; Wijnker, Erik; Keurentjes, Joost J B.

Genes (Basel) ; 14(6)2023 05 26.

Article in English | MEDLINE | ID: mdl-37372341

ABSTRACT

Plants can express different phenotypic responses following polyploidization, but ploidy-dependent phenotypic variation has so far not been assigned to specific genetic factors. To map such effects, segregating populations at different ploidy levels are required. The availability of an efficient haploid inducer line in Arabidopsis thaliana allows for the rapid development of large populations of segregating haploid offspring. Because Arabidopsis haploids can be self-fertilised to give rise to homozygous doubled haploids, the same genotypes can be phenotyped at both the haploid and diploid ploidy level. Here, we compared the phenotypes of recombinant haploid and diploid offspring derived from a cross between two late flowering accessions to map genotype × ploidy (G × P) interactions. Ploidy-specific quantitative trait loci (QTLs) were detected at both ploidy levels. This implies that mapping power will increase when phenotypic measurements of monoploids are included in QTL analyses. A multi-trait analysis further revealed pleiotropic effects for a number of the ploidy-specific QTLs as well as opposite effects at different ploidy levels for general QTLs. Taken together, we provide evidence of genetic variation between different Arabidopsis accessions being causal for dissimilarities in phenotypic responses to altered ploidy levels, revealing a G × P effect. Additionally, by investigating a population derived from late flowering accessions, we revealed a major vernalisation-specific QTL for variation in flowering time, countering the historical bias of research in early flowering accessions.

Subject(s)

Arabidopsis , Chromosome Mapping , Genotype , Quantitative Trait Loci/genetics , Haploidy

5.

Pairwise ratio-based differential abundance analysis of infant microbiome 16S sequencing data.

Mildau, Kevin; Te Beest, Dennis E; Engel, Bas; Gort, Gerrit; Lambert, Jolanda; Swinkels, Sophie H N; van Eeuwijk, Fred A.

NAR Genom Bioinform ; 5(1): lqad001, 2023 Mar.

Article in English | MEDLINE | ID: mdl-36685726

ABSTRACT

Differential abundance analysis of infant 16S microbial sequencing data is complicated by challenging data properties, including high sparsity, extreme dispersion and the relative nature of the information contained within the data. In this study, we propose a pairwise ratio analysis that uses the compositional data analysis principle of subcompositional coherence and merges it with a beta-binomial regression model. The resulting method provides a flexible and easily interpretable approach to infant 16S sequencing data differential abundance analysis that does not require zero imputation. We evaluate the proposed method using infant 16S data from clinical trials and demonstrate that the proposed method has the power to detect differences, and demonstrate how its results can be used to gain insights. We further evaluate the method using data-inspired simulations and compare its power against related methods. Our results indicate that power is high for pairwise differential abundance analysis of taxon pairs that have a large abundance. In contrast, results for sparse taxon pairs show a decrease in power and substantial variability in method performance. While our method shows promising performance on well-measured subcompositions, we advise strong filtering steps in order to avoid excessive numbers of underpowered comparisons in practical applications.

6.

statgenMPP: an R package implementing an IBD-based mixed model approach for QTL mapping in a wide range of multi-parent populations.

Li, Wenhao; Boer, Martin P; van Rossum, Bart-Jan; Zheng, Chaozhi; Joosen, Ronny V L; van Eeuwijk, Fred A.

Bioinformatics ; 38(22): 5134-5136, 2022 11 15.

Article in English | MEDLINE | ID: mdl-36193999

ABSTRACT

MOTIVATION: Multi-parent populations (MPPs) are popular for QTL mapping because they combine wide genetic diversity in parents with easy control of population structure, but a limited number of software tools for QTL mapping are specifically developed for general MPP designs. RESULTS: We developed an R package called statgenMPP, adopting a unified identity-by-descent (IBD)-based mixed model approach for QTL analysis in MPPs. The package offers easy-to-use functionalities of IBD calculations, mixed model solutions and visualizations for QTL mapping in a wide range of MPP designs, including diallele, nested-association mapping populations, multi-parent advanced genetic inter-cross populations and other complicated MPPs with known crossing schemes. AVAILABILITY AND IMPLEMENTATION: The R package statgenMPP is open-source and freely available on CRAN at https://CRAN.R-project.org/package=statgenMPP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

Software , Chromosome Mapping

7.

Identification of environment types and adaptation zones with self-organizing maps; applications to sunflower multi-environment data in Europe.

Bustos-Korts, Daniela; Boer, Martin P; Layton, Jamie; Gehringer, Anke; Tang, Tom; Wehrens, Ron; Messina, Charlie; de la Vega, Abelardo J; van Eeuwijk, Fred A.

Theor Appl Genet ; 135(6): 2059-2082, 2022 Jun.

Article in English | MEDLINE | ID: mdl-35524815

ABSTRACT

KEY MESSAGE: We evaluate self-organizing maps (SOM) to identify adaptation zones and visualize multi-environment genotypic responses. We apply SOM to multiple traits and crop growth model output of large-scale European sunflower data. Genotype-by-environment interactions (G × E) complicate the selection of well-adapted varieties. A possible solution is to group trial locations into adaptation zones with G × E occurring mainly between zones. By selecting for good performance inside those zones, response to selection is increased. In this paper, we present a two-step procedure to identify adaptation zones that starts from a self-organizing map (SOM). In the SOM, trials across locations and years are assigned to groups, called units, that are organized on a two-dimensional grid. Units that are further apart contain more distinct trials. In an iterative process of reweighting trial contributions to units, the grid configuration is learnt simultaneously with the trial assignment to units. An aggregation of the units in the SOM by hierarchical clustering then produces environment types, i.e. trials with similar growing conditions. Adaptation zones can subsequently be identified by grouping trial locations with similar distributions of environment types across years. For the construction of SOMs, multiple data types can be combined. We compared environment types and adaptation zones obtained for European sunflower from quantitative traits like yield, oil content, phenology and disease scores with those obtained from environmental indices calculated with the crop growth model Sunflo. We also show how results are affected by input data organization and user-defined weights for genotypes and traits. Adaptation zones for European sunflower as identified by our SOM-based strategy captured substantial genotype-by-location interaction and pointed to trials in Spain, Turkey and South Bulgaria as inducing different genotypic responses.

Subject(s)

Helianthus , Adaptation, Physiological , Algorithms , Cluster Analysis , Genotype , Helianthus/genetics

8.

Little heterosis found in diploid hybrid potato: The genetic underpinnings of a new hybrid crop.

Adams, James R; de Vries, Michiel E; Zheng, Chaozhi; van Eeuwijk, Fred A.

G3 (Bethesda) ; 12(6)2022 05 30.

Article in English | MEDLINE | ID: mdl-35460241

ABSTRACT

Hybrid potato breeding has become a novel alternative to conventional potato breeding allowing breeders to overcome intractable barriers (e.g. tetrasomic inheritance, masked deleterious alleles, obligate clonal propagation) with the benefit of seed-based propagule, flexible population design, and the potential of hybrid vigor. Until now, however, no formal inquiry has adequately examined the relevant genetic components for complex traits in hybrid potato populations. In this present study, we use a 2-step multivariate modeling approach to estimate the variance components to assess the magnitude of the general and specific combining abilities in diploid hybrid potato. Specific combining ability effects were identified for all yield components studied here warranting evidence of nonadditive genetic effects in hybrid potato yield. However, the estimated general combining ability effects were on average 2 times larger than their respective specific combining ability quantile across all yield phenotypes. Tuber number general combining abilities and specific combining abilities were found to be highly correlated with total yield's genetic components. Tuber volume was shown to have the largest proportion of additive and nonadditive genetic variation suggesting under-selection of this phenotype in this population. The prominence of additive effects found for all traits presents evidence that the mid-parent value alone is useful for hybrid potato evaluation. Heterotic vigor stands to be useful in bolstering simpler traits but this will be dependent on target phenotypes and market requirements. This study represents the first diallel analysis of its kind in diploid potato using material derived from a commercial hybrid breeding program.

Subject(s)

Hybrid Vigor , Solanum tuberosum , Alleles , Diploidy , Hybrid Vigor/genetics , Plant Breeding , Solanum tuberosum/genetics

9.

A two-stage approach for the spatio-temporal analysis of high-throughput phenotyping data.

Pérez-Valencia, Diana M; Rodríguez-Álvarez, María Xosé; Boer, Martin P; Kronenberg, Lukas; Hund, Andreas; Cabrera-Bosquet, Llorenç; Millet, Emilie J; Eeuwijk, Fred A van.

Sci Rep ; 12(1): 3177, 2022 02 24.

Article in English | MEDLINE | ID: mdl-35210494

ABSTRACT

High throughput phenotyping (HTP) platforms and devices are increasingly used for the characterization of growth and developmental processes for large sets of plant genotypes. Such HTP data require challenging statistical analyses in which longitudinal genetic signals need to be estimated against a background of spatio-temporal noise processes. We propose a two-stage approach for the analysis of such longitudinal HTP data. In a first stage, we correct for design features and spatial trends per time point. In a second stage, we focus on the longitudinal modelling of the spatially corrected data, thereby taking advantage of shared longitudinal features between genotypes and plants within genotypes. We propose a flexible hierarchical three-level P-spline growth curve model, with plants/plots nested in genotypes, and genotypes nested in populations. For selection of genotypes in a plant breeding context, we show how to extract new phenotypes, like growth rates, from the estimated genotypic growth curves and their first-order derivatives. We illustrate our approach on HTP data from the PhenoArch greenhouse platform at INRAE Montpellier and the outdoor Field Phenotyping platform at ETH Zürich.

10.

GWAS in tetraploid potato: identification and validation of SNP markers associated with glycoalkaloid content.

Vos, Peter G; Paulo, M João; Bourke, Peter M; Maliepaard, Chris A; van Eeuwijk, Fred A; Visser, Richard G F; van Eck, Herman J.

Mol Breed ; 42(12): 76, 2022 Dec.

Article in English | MEDLINE | ID: mdl-37313326

ABSTRACT

Genome-wide association studies (GWAS) are a useful tool to unravel the genetic architecture of complex traits, but the results can be difficult to interpret. Population structure, genetic heterogeneity, and rare alleles easily result in false positive or false negative associations. This paper describes the analysis of a GWAS panel combined with three bi-parental mapping populations to validate GWAS results, using phenotypic data for steroidal glycoalkaloid (SGA) accumulation and the ratio (SGR) between the two major glycoalkaloids α-solanine and α-chaconine in potato tubers. SGAs are secondary metabolites in the Solanaceae family, functional as a defence against various pests and pathogens and in high quantities toxic for humans. With GWAS, we identified five quantitative trait loci (QTL) of which Sga1.1, Sgr8.1, and Sga11.1 were validated, but not Sga3.1 and Sgr7.1. In the bi-parental populations, Sga5.1 and Sga7.1 were mapped, but these were not identified with GWAS. The QTLs Sga1.1, Sga7.1, Sgr7.1, and Sgr8.1 co-localize with genes GAME9, GAME 6/GAME 11, SGT1, and SGT2, respectively. For other genes involved in SGA synthesis, no QTLs were identified. The results of this study illustrate a number of pitfalls in GWAS of which population structure seems the most important. We also show that introgression breeding for disease resistance has introduced new haplotypes to the gene pool involved in higher SGA levels in certain pedigrees. Finally, we show that high SGA levels remain unpredictable in potato but that α-solanine/α-chaconine ratio has a predictable outcome with specific SGT1 and SGT2 haplotypes. Supplementary Information: The online version contains supplementary material available at 10.1007/s11032-022-01344-2.

11.

An IBD-based mixed model approach for QTL mapping in multiparental populations.

Li, Wenhao; Boer, Martin P; Zheng, Chaozhi; Joosen, Ronny V L; van Eeuwijk, Fred A.

Theor Appl Genet ; 134(11): 3643-3660, 2021 Nov.

Article in English | MEDLINE | ID: mdl-34342658

ABSTRACT

KEY MESSAGE: The identity-by-descent (IBD)-based mixed model approach introduced in this study can detect quantitative trait loci (QTLs) referring to the parental origin and simultaneously account for multilevel relatedness of individuals within and across families. This unified approach is proved to be a powerful approach for all kinds of multiparental population (MPP) designs. Multiparental populations (MPPs) have become popular for quantitative trait loci (QTL) detection. Tools for QTL mapping in MPPs are mostly developed for specific MPPs and do not generalize well to other MPPs. We present an IBD-based mixed model approach for QTL mapping in all kinds of MPP designs, e.g., diallel, Nested Association Mapping (NAM), and Multiparental Advanced Generation Intercross (MAGIC) designs. The first step is to compute identity-by-descent (IBD) probabilities using a general Hidden Markov model framework, called reconstructing ancestry blocks bit by bit (RABBIT). Next, functions of IBD information are used as design matrices, or genetic predictors, in a mixed model approach to estimate variance components for multiallelic genetic effects associated with parents. Family-specific residual genetic effects are added, and a polygenic effect is structured by kinship relations between individuals. Case studies of simulated diallel, NAM, and MAGIC designs proved that the advanced IBD-based multi-QTL mixed model approach incorporating both kinship relations and family-specific residual variances (IBD.MQMkin_F) is robust across a variety of MPP designs and allele segregation patterns in comparison to a widely used benchmark association mapping method, and in most cases, outperformed or behaved at least as well as other tools developed for specific MPP designs in terms of mapping power and resolution. Successful analyses of real data cases confirmed the wide applicability of our IBD-based mixed model methodology.

Subject(s)

Chromosome Mapping , Models, Genetic , Quantitative Trait Loci , Alleles , Computer Simulation , Linear Models , Markov Chains , Plants/genetics

12.

Improving Genomic Prediction Using High-Dimensional Secondary Phenotypes.

Arouisse, Bader; Theeuwen, Tom P J M; van Eeuwijk, Fred A; Kruijer, Willem.

Front Genet ; 12: 667358, 2021.

Article in English | MEDLINE | ID: mdl-34108993

ABSTRACT

In the past decades, genomic prediction has had a large impact on plant breeding. Given the current advances of high-throughput phenotyping and sequencing technologies, it is increasingly common to observe a large number of traits, in addition to the target trait of interest. This raises the important question whether these additional or "secondary" traits can be used to improve genomic prediction for the target trait. With only a small number of secondary traits, this is known to be the case, given sufficiently high heritabilities and genetic correlations. Here we focus on the more challenging situation with a large number of secondary traits, which is increasingly common since the arrival of high-throughput phenotyping. In this case, secondary traits are usually incorporated through additional relatedness matrices. This approach is however infeasible when secondary traits are not measured on the test set, and cannot distinguish between genetic and non-genetic correlations. An alternative direction is to extend the classical selection indices using penalized regression. So far, penalized selection indices have not been applied in a genomic prediction setting, and require plot-level data in order to reliably estimate genetic correlations. Here we aim to overcome these limitations, using two novel approaches. Our first approach relies on a dimension reduction of the secondary traits, using either penalized regression or random forests (LS-BLUP/RF-BLUP). We then compute the bivariate GBLUP with the dimension reduction as secondary trait. For simulated data (with available plot-level data), we also use bivariate GBLUP with the penalized selection index as secondary trait (SI-BLUP). In our second approach (GM-BLUP), we follow existing multi-kernel methods but replace secondary traits by their genomic predictions, with the advantage that genomic prediction is also possible when secondary traits are only measured on the training set. For most of our simulated data, SI-BLUP was most accurate, often closely followed by RF-BLUP or LS-BLUP. In real datasets, involving metabolites in Arabidopsis and transcriptomics in maize, no method could substantially improve over univariate prediction when secondary traits were only available on the training set. LS-BLUP and RF-BLUP were most accurate when secondary traits were available also for the test set.

13.

Lessons from a GWAS study of a wheat pre-breeding program: pyramiding resistance alleles to Fusarium crown rot.

Malosetti, Marcos; Zwep, Laura B; Forrest, Kerrie; van Eeuwijk, Fred A; Dieters, Mark.

Theor Appl Genet ; 134(3): 897-908, 2021 Mar.

Article in English | MEDLINE | ID: mdl-33367942

ABSTRACT

Much has been published on QTL detection for complex traits using bi-parental and multi-parental crosses (linkage analysis) or diversity panels (GWAS studies). While successful for detection, transferability of results to real applications has proven more difficult. Here, we combined a QTL detection approach using a pre-breeding populations which utilized intensive phenotypic selection for the target trait across multiple plant generations, combined with rapid generation turnover (i.e. "speed breeding") to allow cycling of multiple plant generations each year. The reasoning is that QTL mapping information would complement the selection process by identifying the genome regions under selection within the relevant germplasm. Questions to answer were the location of the genomic regions determining response to selection and the origin of the favourable alleles within the pedigree. We used data from a pre-breeding program that aimed at pyramiding different resistance sources to Fusarium crown rot into elite (but susceptible) wheat backgrounds. The population resulted from a complex backcrossing scheme involving multiple resistance donors and multiple elite backgrounds, akin to a MAGIC population (985 genotypes in total, with founders, and two major offspring layers within the pedigree). A significant increase in the resistance level was observed (i.e. a positive response to selection) after the selection process, and 17 regions significantly associated with that response were identified using a GWAS approach. Those regions included known QTL as well as potentially novel regions contributing resistance to Fusarium crown rot. In addition, we were able to trace back the sources of the favourable alleles for each QTL. We demonstrate that QTL detection using breeding populations under selection for the target trait can identify QTL controlling the target trait and that the frequency of the favourable alleles was increased as a response to selection, thereby validating the QTL detected. This is a valuable opportunistic approach that can provide QTL information that is more easily transferred to breeding applications.

Subject(s)

Disease Resistance/genetics , Fusarium/physiology , Genetic Markers , Plant Breeding , Plant Diseases/genetics , Quantitative Trait Loci , Triticum/genetics , Alleles , Chromosome Mapping/methods , Chromosomes, Plant/genetics , Disease Resistance/immunology , Genetic Linkage , Plant Diseases/microbiology , Triticum/immunology , Triticum/microbiology

14.

Reconstruction of Networks with Direct and Indirect Genetic Effects.

Kruijer, Willem; Behrouzi, Pariya; Bustos-Korts, Daniela; Rodríguez-Álvarez, María Xosé; Mahmoudi, Seyed Mahdi; Yandell, Brian; Wit, Ernst; van Eeuwijk, Fred A.

Genetics ; 214(4): 781-807, 2020 04.

Article in English | MEDLINE | ID: mdl-32015018

ABSTRACT

Genetic variance of a phenotypic trait can originate from direct genetic effects, or from indirect effects, i.e., through genetic effects on other traits, affecting the trait of interest. This distinction is often of great importance, for example, when trying to improve crop yield and simultaneously control plant height. As suggested by Sewall Wright, assessing contributions of direct and indirect effects requires knowledge of (1) the presence or absence of direct genetic effects on each trait, and (2) the functional relationships between the traits. Because experimental validation of such relationships is often unfeasible, it is increasingly common to reconstruct them using causal inference methods. However, most current methods require all genetic variance to be explained by a small number of quantitative trait loci (QTL) with fixed effects. Only a few authors have considered the "missing heritability" case, where contributions of many undetectable QTL are modeled with random effects. Usually, these are treated as nuisance terms that need to be eliminated by taking residuals from a multi-trait mixed model (MTM). But fitting such an MTM is challenging, and it is impossible to infer the presence of direct genetic effects. Here, we propose an alternative strategy, where genetic effects are formally included in the graph. This has important advantages: (1) genetic effects can be directly incorporated in causal inference, implemented via our PCgen algorithm, which can analyze many more traits; and (2) we can test the existence of direct genetic effects, and improve the orientation of edges between traits. Finally, we show that reconstruction is much more accurate if individual plant or plot data are used, instead of genotypic means. We have implemented the PCgen-algorithm in the R-package pcgen.

Subject(s)

Crops, Agricultural/genetics , Models, Genetic , Gene Regulatory Networks , Phenotype , Quantitative Trait Loci , Quantitative Trait, Heritable

15.

Multi-environment analysis of sorghum breeding trials using additive and dominance genomic relationships.

Hunt, Colleen H; Hayes, Ben J; van Eeuwijk, Fred A; Mace, Emma S; Jordan, David R.

Theor Appl Genet ; 133(3): 1009-1018, 2020 Mar.

Article in English | MEDLINE | ID: mdl-31907563

ABSTRACT

KEY MESSAGE: Multi-environment models using marker-based kinship information for both additive and dominance effects can accurately predict hybrid performance in different environments. Sorghum is an important hybrid crop that is grown extensively in many subtropical and tropical regions including Northern NSW and Queensland in Australia. The highly varying weather patterns in the Australian summer months mean that sorghum hybrids exhibit a great deal of variation in yield between locations. To ultimately enable prediction of the outcome of crossing parental lines, both additive effects on yield performance and dominance interaction effects need to be characterised. This paper demonstrates that fitting a linear mixed model that includes both types of effects calculated using genetic markers in relationship matrices improves predictions. Genotype by environment interactions was investigated by comparing FA1 (single-factor analytic) and FA2 (two-factor analytic) structures. The G×E causes a change in hybrid rankings between trials with a difference of up to 25% of the hybrids in the top 10% of each trial. The prediction accuracies increased with the addition of the dominance term (over and above that achieved with an additive effect alone) by an average of 15% and a maximum of 60%. The percentage of dominance of the total genetic variance varied between trials with the trials with higher broad-sense heritability having the greater percentage of dominance. The inclusion of dominance in the factor analytic models improves the accuracy of the additive effects. Breeders selecting high yielding parents for crossing need to be aware of effects due to environment and dominance.

Subject(s)

Plant Breeding , Sorghum/genetics , Australia , Climate , Epistasis, Genetic , Genes, Dominant , Genetic Association Studies , Genetic Markers , Genetic Variation , Genomics , Genotype , Models, Genetic , Pedigree , Phenotype , Polymorphism, Single Nucleotide , Selection, Genetic , Sorghum/growth & development

16.

Dietary Intakes of Vegetable Protein, Folate, and Vitamins B-6 and B-12 Are Partially Correlated with Physical Functioning of Dutch Older Adults Using Copula Graphical Models.

Behrouzi, Pariya; Grootswagers, Pol; Keizer, Paul L C; Smeets, Ellen T H C; Feskens, Edith J M; de Groot, Lisette C P G M; van Eeuwijk, Fred A.

J Nutr ; 150(3): 634-643, 2020 03 01.

Article in English | MEDLINE | ID: mdl-31858107

ABSTRACT

BACKGROUND: In nutritional epidemiology, dealing with confounding and complex internutrient relations are major challenges. An often-used approach is dietary pattern analyses, such as principal component analysis, to deal with internutrient correlations, and to more closely resemble the true way nutrients are consumed. However, despite these improvements, these approaches still require subjective decisions in the preselection of food groups. Moreover, they do not make efficient use of multivariate dietary data, because they detect only marginal associations. We propose the use of copula graphical models (CGMs) to model and make statistical inferences regarding complex associations among variables in multivariate data, where associations between all variables can be learned simultaneously. OBJECTIVE: We aimed to reconstruct nutritional intake and physical functioning networks in Dutch older adults by applying a CGM. METHODS: We addressed this issue by uncovering the pairwise associations between variables while correcting for the effect of remaining variables. More specifically, we used a CGM to infer the precision matrix, which contains all the conditional independence relations between nodes in the graph. The nonzero elements of the precision matrix indicate the presence of a direct association. We applied this method to reconstruct nutrient-physical functioning networks from the combined data of 4 studies (Nu-Age, ProMuscle, ProMO, and V-Fit, total n = 662, mean ± SD age = 75 ± 7 y). The method was implemented in the R package nutriNetwork which is freely available at https://cran.r-project.org/web/packages/nutriNetwork. RESULTS: Greater intakes of vegetable protein and vitamin B-6 were partially correlated with higher scores on the total Short Physical Performance Battery (SPPB) and the chair rise test. Greater intakes of vitamin B-12 and folate were partially correlated with higher scores on the chair rise test and the total SPPB, respectively. CONCLUSIONS: We determined that vegetable protein, vitamin B-6, folate, and vitamin B-12 intakes are partially correlated with improved functional outcome measurements in Dutch older adults.

Subject(s)

Folic Acid/administration & dosage , Models, Theoretical , Physical Functional Performance , Plant Proteins, Dietary/administration & dosage , Vitamin B 12/administration & dosage , Vitamin B 6/administration & dosage , Aged , Aged, 80 and over , Body Mass Index , Frail Elderly , Humans , Netherlands

17.

From QTLs to Adaptation Landscapes: Using Genotype-To-Phenotype Models to Characterize G×E Over Time.

Bustos-Korts, Daniela; Malosetti, Marcos; Chenu, Karine; Chapman, Scott; Boer, Martin P; Zheng, Bangyou; van Eeuwijk, Fred A.

Front Plant Sci ; 10: 1540, 2019.

Article in English | MEDLINE | ID: mdl-31867027

ABSTRACT

Genotype by environment interaction (G×E) for the target trait, e.g. yield, is an emerging property of agricultural systems and results from the interplay between a hierarchy of secondary traits involving the capture and allocation of environmental resources during the growing season. This hierarchy of secondary traits ranges from basic traits that correspond to response mechanisms/sensitivities, to intermediate traits that integrate a larger number of processes over time and therefore show a larger amount of G×E. Traits underlying yield differ in their contribution to adaptation across environmental conditions and have different levels of G×E. Here, we provide a framework to study the performance of genotype to phenotype (G2P) modeling approaches. We generate and analyze response surfaces, or adaptation landscapes, for yield and yield related traits, emphasizing the organization of the traits in a hierarchy and their development and interactions over time. We use the crop growth model APSIM-wheat with genotype-dependent parameters as a tool to simulate non-linear trait responses over time with complex trait dependencies and apply it to wheat crops in Australia. For biological realism, APSIM parameters were given a genetic basis of 300 QTLs sampled from a gamma distribution whose shape and rate parameters were estimated from real wheat data. In the simulations, the hierarchical organization of the traits and their interactions over time cause G×E for yield even when underlying traits do not show G×E. Insight into how G×E arises during growth and development helps to improve the accuracy of phenotype predictions within and across environments and to optimize trial networks. We produced a tangible simulated adaptation landscape for yield that we first investigated for its biological credibility by statistical models for G×E that incorporate genotypic and environmental covariables. Subsequently, the simulated trait data were used to evaluate statistical genotype-to-phenotype models for multiple traits and environments and to characterize relationships between traits over time and across environments, as a way to identify traits that could be useful to select for specific adaptation. Designed appropriately, these types of simulated landscapes might also serve as a basis to train other, more deep learning methodologies in order to transfer such network models to real-world situations.

18.

Combining Crop Growth Modeling and Statistical Genetic Modeling to Evaluate Phenotyping Strategies.

Bustos-Korts, Daniela; Boer, Martin P; Malosetti, Marcos; Chapman, Scott; Chenu, Karine; Zheng, Bangyou; van Eeuwijk, Fred A.

Front Plant Sci ; 10: 1491, 2019.

Article in English | MEDLINE | ID: mdl-31827479

ABSTRACT

Genomic prediction of complex traits, say yield, benefits from including information on correlated component traits. Statistical criteria to decide which yield components to consider in the prediction model include the heritability of the component traits and their genetic correlation with yield. Not all component traits are easy to measure. Therefore, it may be attractive to include proxies to yield components, where these proxies are measured in (high-throughput) phenotyping platforms during the growing season. Using the Agricultural Production Systems Simulator (APSIM)-wheat cropping systems model, we simulated phenotypes for a wheat diversity panel segregating for a set of physiological parameters regulating phenology, biomass partitioning, and the ability to capture environmental resources. The distribution of the additive quantitative trait locus effects regulating the APSIM physiological parameters approximated the same distribution of quantitative trait locus effects on real phenotypic data for yield and heading date. We use the crop growth model APSIM-wheat to simulate phenotypes in three Australian environments with contrasting water deficit patterns. The APSIM output contained the dynamics of biomass and canopy cover, plus yield at the end of the growing season. Each water deficit pattern triggered different adaptive mechanisms and the impact of component traits differed between drought scenarios. We evaluated multiple phenotyping schedules by adding plot and measurement error to the dynamics of biomass and canopy cover. We used these trait dynamics to fit parametric models and P-splines to extract parameters with a larger heritability than the phenotypes at individual time points. We used those parameters in multi-trait prediction models for final yield. The combined use of crop growth models and multi-trait genomic prediction models provides a procedure to assess the efficiency of phenotyping strategies and compare methods to model trait dynamics. It also allows us to quantify the impact of yield components on yield prediction accuracy even in different environment types. In scenarios with mild or no water stress, yield prediction accuracy benefitted from including biomass and green canopy cover parameters. The advantage of the multi-trait model was smaller for the early-drought scenario, due to the reduced correlation between the secondary and the target trait. Therefore, multi-trait genomic prediction models for yield require scenario-specific correlated traits.

19.

Genomic Prediction of Grain Yield and Drought-Adaptation Capacity in Sorghum Is Enhanced by Multi-Trait Analysis.

Velazco, Julio G; Jordan, David R; Mace, Emma S; Hunt, Colleen H; Malosetti, Marcos; van Eeuwijk, Fred A.

Front Plant Sci ; 10: 997, 2019.

Article in English | MEDLINE | ID: mdl-31417601

ABSTRACT

Grain yield and stay-green drought adaptation trait are important targets of selection in grain sorghum breeding for broad adaptation to a range of environments. Genomic prediction for these traits may be enhanced by joint multi-trait analysis. The objectives of this study were to assess the capacity of multi-trait models to improve genomic prediction of parental breeding values for grain yield and stay-green in sorghum by using information from correlated auxiliary traits, and to determine the combinations of traits that optimize predictive results in specific scenarios. The dataset included phenotypic performance of 2645 testcross hybrids across 26 environments as well as genomic and pedigree information on their female parental lines. The traits considered were grain yield (GY), stay-green (SG), plant height (PH), and flowering time (FT). We evaluated the improvement in predictive performance of multi-trait G-BLUP models relative to single-trait G-BLUP. The use of a blended kinship matrix exploiting pedigree and genomic information was also explored to optimize multi-trait predictions. Predictive ability for GY increased up to 16% when PH information on the training population was exploited through multi-trait genomic analysis. For SG prediction, full advantage from multi-trait G-BLUP was obtained only when GY information was also available on the predicted lines per se, with predictive ability improvements of up to 19%. Predictive ability, unbiasedness and accuracy of predictions from conventional multi-trait G-BLUP were further optimized by using a combined pedigree-genomic relationship matrix. Results of this study suggest that multi-trait genomic evaluation combining routinely measured traits may be used to improve prediction of crop productivity and drought adaptability in grain sorghum.

20.

Construction of Genetic Linkage Maps in Multiparental Populations.

Zheng, Chaozhi; Boer, Martin P; van Eeuwijk, Fred A.

Genetics ; 212(4): 1031-1044, 2019 08.

Article in English | MEDLINE | ID: mdl-31182487

ABSTRACT

Construction of genetic linkage maps has become a routine step for mapping quantitative trait loci (QTL), particularly in animal and plant breeding populations. Many multiparental populations have recently been produced to increase genetic diversity and QTL mapping resolution. However, few software packages are available for map construction in these populations. In this paper, we build a general framework for the construction of genetic linkage maps from genotypic data in diploid populations, including bi- and multiparental populations, cross-pollinated (CP) populations, and breeding pedigrees. The framework is implemented as an automatic pipeline called magicMap, where the maximum multilocus likelihood approach utilizes genotypic information efficiently. We evaluate magicMap by extensive simulations and eight real datasets: one biparental, one CP, four multiparent advanced generation intercross (MAGIC), and two nested association mapping (NAM) populations, the number of markers ranging from a few hundred to tens of thousands. Not only is magicMap the only software capable of accommodating all of these designs, it is more accurate and robust to missing genotypes and genotyping errors than commonly used packages.

Subject(s)

Chromosome Mapping/methods , Genetics, Population , Software , Algorithms , Animals , Arabidopsis/genetics , Breeding , Diploidy , Genetic Markers , Genotyping Techniques/methods

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL