Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
2.
Plant Methods ; 20(1): 42, 2024 Mar 16.
Article in English | MEDLINE | ID: mdl-38493115

ABSTRACT

Genomic selection (GS) has become an increasingly popular tool in plant breeding programs, propelled by declining genotyping costs, an increase in computational power, and rediscovery of the best linear unbiased prediction methodology over the past two decades. This development has led to an accumulation of extensive historical datasets with genotypic and phenotypic information, triggering the question of how to best utilize these datasets. Here, we investigate whether all available data or a subset should be used to calibrate GS models for across-year predictions in a 7-year dataset of a commercial hybrid sunflower breeding program. We employed a multi-objective optimization approach to determine the ideal years to include in the training set (TRS). Next, for a given combination of TRS years, we further optimized the TRS size and its genetic composition. We developed the Min_GRM size optimization method which consistently found the optimal TRS size, reducing dimensionality by 20% with an approximately 1% loss in predictive ability. Additionally, the Tails_GEGVs algorithm displayed potential, outperforming the use of all data by using just 60% of it for grain yield, a high-complexity, low-heritability trait. Moreover, maximizing the genetic diversity of the TRS resulted in a consistent predictive ability across the entire range of genotypic values in the test set. Interestingly, the Tails_GEGVs algorithm, due to its ability to leverage heterogeneity, enhanced predictive performance for key hybrids with extreme genotypic values. Our study provides new insights into the optimal utilization of historical data in plant breeding programs, resulting in improved GS model predictive ability.

3.
Mol Plant ; 17(4): 552-578, 2024 Apr 01.
Article in English | MEDLINE | ID: mdl-38475993

ABSTRACT

Genomic selection, the application of genomic prediction (GP) models to select candidate individuals, has significantly advanced in the past two decades, effectively accelerating genetic gains in plant breeding. This article provides a holistic overview of key factors that have influenced GP in plant breeding during this period. We delved into the pivotal roles of training population size and genetic diversity, and their relationship with the breeding population, in determining GP accuracy. Special emphasis was placed on optimizing training population size. We explored its benefits and the associated diminishing returns beyond an optimum size. This was done while considering the balance between resource allocation and maximizing prediction accuracy through current optimization algorithms. The density and distribution of single-nucleotide polymorphisms, level of linkage disequilibrium, genetic complexity, trait heritability, statistical machine-learning methods, and non-additive effects are the other vital factors. Using wheat, maize, and potato as examples, we summarize the effect of these factors on the accuracy of GP for various traits. The search for high accuracy in GP-theoretically reaching one when using the Pearson's correlation as a metric-is an active research area as yet far from optimal for various traits. We hypothesize that with ultra-high sizes of genotypic and phenotypic datasets, effective training population optimization methods and support from other omics approaches (transcriptomics, metabolomics and proteomics) coupled with deep-learning algorithms could overcome the boundaries of current limitations to achieve the highest possible prediction accuracy, making genomic selection an effective tool in plant breeding.


Subject(s)
Genome, Plant , Plant Breeding , Humans , Genome, Plant/genetics , Selection, Genetic , Genomics , Phenotype , Genotype , Plants , Polymorphism, Single Nucleotide/genetics
4.
JACS Au ; 3(10): 2631-2639, 2023 Oct 23.
Article in English | MEDLINE | ID: mdl-37885586

ABSTRACT

The development of emerging decarbonization technologies requires advanced tools for decision-making that incorporate the environmental perspective from the early design. Today, Life Cycle Assessment (LCA) is the preferred tool to promote sustainability in the technology development, identifying environmental challenges and opportunities and defining the final implementation pathways. So far, most environmental studies related to decarbonization emerging solutions are still limited to midpoint metrics, mainly the carbon footprint, with global sustainability implications being relatively unexplored. In this sense, the Planetary Boundaries (PBs) have been recently proposed to identify the distance to the ideal reference state. Hence, PB-LCA methodology can be currently applied to transform the resource use and emissions to changes in the values of PB control variables. This study shows a complete picture of the LCA's role in developing emerging technologies. For this purpose, a case study based on the electrochemical conversion of CO2 to formic acid is used to show the possibilities of LCA approaches highlighting the potential pitfalls when going beyond greenhouse gas emission reduction and obtaining the absolute sustainability level in terms of four PBs.

5.
Theor Appl Genet ; 136(3): 30, 2023 Mar 09.
Article in English | MEDLINE | ID: mdl-36892603

ABSTRACT

KEY MESSAGE: Maximizing CDmean and Avg_GRM_self were the best criteria for training set optimization. A training set size of 50-55% (targeted) or 65-85% (untargeted) is needed to obtain 95% of the accuracy.  With the advent of genomic selection (GS) as a widespread breeding tool, mechanisms to efficiently design an optimal training set for GS models became more relevant, since they allow maximizing the accuracy while minimizing the phenotyping costs. The literature described many training set optimization methods, but there is a lack of a comprehensive comparison among them. This work aimed to provide an extensive benchmark among optimization methods and optimal training set size by testing a wide range of them in seven datasets, six different species, different genetic architectures, population structure, heritabilities, and with several GS models to provide some guidelines about their application in breeding programs. Our results showed that targeted optimization (uses information from the test set) performed better than untargeted (does not use test set data), especially when heritability was low. The mean coefficient of determination was the best targeted method, although it was computationally intensive. Minimizing the average relationship within the training set was the best strategy for untargeted optimization. Regarding the optimal training set size, maximum accuracy was obtained when the training set was the entire candidate set. Nevertheless, a 50-55% of the candidate set was enough to reach 95-100% of the maximum accuracy in the targeted scenario, while we needed a 65-85% for untargeted optimization. Our results also suggested that a diverse training set makes GS robust against population structure, while including clustering information was less effective. The choice of the GS model did not have a significant influence on the prediction accuracies.


Subject(s)
Models, Genetic , Selection, Genetic , Phenotype , Genome , Genomics/methods , Genotype
SELECTION OF CITATIONS
SEARCH DETAIL
...