Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
G3 (Bethesda) ; 13(10)2023 09 30.
Artigo em Inglês | MEDLINE | ID: mdl-37523773

RESUMO

In maize, the community-standard transformant line B104 is a useful model for dissecting features of transfer DNA (T-DNA) integration due to its compatibility with Agrobacterium-mediated transformation and the availability of its genome sequence. Knowledge of transgene integration sites permits the analysis of the genomic environment that governs the strength of gene expression and phenotypic effects due to the disruption of an endogenous gene or regulatory element. In this study, we optimized a fusion primer and nested integrated PCR (FPNI-PCR) technique for T-DNA detection in maize to characterize the integration sites of 89 T-DNA insertions in 81 transformant lines. T-DNA insertions preferentially occurred in gene-rich regions and regions distant from centromeres. Integration junctions with and without microhomologous sequences as well as junctions with de novo sequences were detected. Sequence analysis of integration junctions indicated that T-DNA was incorporated via the error-prone repair pathways of nonhomologous (predominantly) and microhomology-mediated (minor) end-joining. This report provides a quantitative assessment of Agrobacterium-mediated T-DNA integration in maize with respect to insertion site features, the genomic distribution of T-DNA incorporation, and the mechanisms of integration. It also demonstrates the utility of the FPNI-PCR technique, which can be adapted to any species of interest.


Assuntos
Agrobacterium , Zea mays , Agrobacterium/genética , Zea mays/genética , Transformação Genética , DNA Bacteriano/genética , DNA de Plantas/genética , Plantas Geneticamente Modificadas/genética
2.
G3 (Bethesda) ; 13(9)2023 08 30.
Artigo em Inglês | MEDLINE | ID: mdl-37368984

RESUMO

Tropical maize can be used to diversify the genetic base of temperate germplasm and help create climate-adapted cultivars. However, tropical maize is unadapted to temperate environments, in which sensitivities to long photoperiods and cooler temperatures result in severely delayed flowering times, developmental defects, and little to no yield. Overcoming this maladaptive syndrome can require a decade of phenotypic selection in a targeted, temperate environment. To accelerate the incorporation of tropical diversity in temperate breeding pools, we tested if an additional generation of genomic selection can be used in an off-season nursery where phenotypic selection is not very effective. Prediction models were trained using flowering time recorded on random individuals in separate lineages of a heterogenous population grown at two northern U.S. latitudes. Direct phenotypic selection and genomic prediction model training was performed within each target environment and lineage, followed by genomic prediction of random intermated progenies in the off-season nursery. Performance of genomic prediction models was evaluated on self-fertilized progenies of prediction candidates grown in both target locations in the following summer season. Prediction abilities ranged from 0.30 to 0.40 among populations and evaluation environments. Prediction models with varying marker effect distributions or spatial field effects had similar accuracies. Our results suggest that genomic selection in a single off-season generation could increase genetic gains for flowering time by more than 50% compared to direct selection in summer seasons only, reducing the time required to change the population mean to an acceptably adapted flowering time by about one-third to one-half.


Assuntos
Melhoramento Vegetal , Zea mays , Humanos , Zea mays/genética , Meio Ambiente , Adaptação Fisiológica/genética , Genômica , Seleção Genética
3.
BMC Genom Data ; 24(1): 29, 2023 05 25.
Artigo em Inglês | MEDLINE | ID: mdl-37231352

RESUMO

OBJECTIVES: This report provides information about the public release of the 2018-2019 Maize G X E project of the Genomes to Fields (G2F) Initiative datasets. G2F is an umbrella initiative that evaluates maize hybrids and inbred lines across multiple environments and makes available phenotypic, genotypic, environmental, and metadata information. The initiative understands the necessity to characterize and deploy public sources of genetic diversity to face the challenges for more sustainable agriculture in the context of variable environmental conditions. DATA DESCRIPTION: Datasets include phenotypic, climatic, and soil measurements, metadata information, and inbred genotypic information for each combination of location and year. Collaborators in the G2F initiative collected data for each location and year; members of the group responsible for coordination and data processing combined all the collected information and removed obvious erroneous data. The collaborators received the data before the DOI release to verify and declare that the data generated in their own locations was accurate. ReadMe and description files are available for each dataset. Previous years of evaluation are already publicly available, with common hybrids present to connect across all locations and years evaluated since this project's inception.


Assuntos
Genoma de Planta , Zea mays , Fenótipo , Zea mays/genética , Estações do Ano , Genótipo , Genoma de Planta/genética
4.
New Phytol ; 238(2): 737-749, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36683443

RESUMO

Crop genetic diversity for climate adaptations is globally partitioned. We performed experimental evolution in maize to understand the response to selection and how plant germplasm can be moved across geographical zones. Initialized with a common population of tropical origin, artificial selection on flowering time was performed for two generations at eight field sites spanning 25° latitude, a 2800 km transect. We then jointly tested all selection lineages across the original sites of selection, for the target trait and 23 other traits. Modeling intergenerational shifts in a physiological reaction norm revealed separate components for flowering-time plasticity. Generalized and local modes of selection altered the plasticity of each lineage, leading to a latitudinal pattern in the responses to selection that were strongly driven by photoperiod. This transformation led to widespread changes in developmental, architectural, and yield traits, expressed collectively in an environment-dependent manner. Furthermore, selection for flowering time alone alleviated a maladaptive syndrome and improved yields for tropical maize in the temperate zone. Our findings show how phenotypic selection can rapidly shift the flowering phenology and plasticity of maize. They also demonstrate that selecting crops to local conditions can accelerate adaptation to climate change.


Assuntos
Flores , Zea mays , Flores/genética , Zea mays/genética , Fenótipo , Fotoperíodo
5.
Nat Commun ; 13(1): 3225, 2022 06 09.
Artigo em Inglês | MEDLINE | ID: mdl-35680899

RESUMO

Combined phenomic and genomic approaches are required to evaluate the margin of progress of breeding strategies. Here, we analyze 65 years of genetic progress in maize yield, which was similar (101 kg ha-1 year-1) across most frequent environmental scenarios in the European growing area. Yield gains were linked to physiologically simple traits (plant phenology and architecture) which indirectly affected reproductive development and light interception in all studied environments, marked by significant genomic signatures of selection. Conversely, studied physiological processes involved in stress adaptation remained phenotypically unchanged (e.g. stomatal conductance and growth sensitivity to drought) and showed no signatures of selection. By selecting for yield, breeders indirectly selected traits with stable effects on yield, but not physiological traits whose effects on yield can be positive or negative depending on environmental conditions. Because yield stability under climate change is desirable, novel breeding strategies may be needed for exploiting alleles governing physiological adaptive traits.


Assuntos
Melhoramento Vegetal , Zea mays , Alelos , Secas , Fenótipo , Zea mays/genética
6.
G3 (Bethesda) ; 11(11)2021 10 19.
Artigo em Inglês | MEDLINE | ID: mdl-34542584

RESUMO

Lima bean, Phaseolus lunatus, is closely related to common bean and is high in fiber and protein, with a low glycemic index. Lima bean is widely grown in the state of Delaware, where late summer and early fall weather are conducive to pod production. The same weather conditions also promote diseases such as pod rot and downy mildew, the latter of which has caused previous epidemics. A better understanding of the genes underlying resistance to this and other pathogens is needed to keep this industry thriving in the region. Our current study sought to sequence, assemble, and annotate a commercially available cultivar called Bridgeton, which could then serve as a reference genome, a basis of comparison to other Phaseolus taxa, and a resource for the identification of potential resistance genes. Combined efforts of sequencing, linkage, and comparative analysis resulted in a 623 Mb annotated assembly for lima bean, as well as a better understanding of an evolutionarily dynamic resistance locus in legumes.


Assuntos
Phaseolus , Ligação Genética , Phaseolus/genética
7.
G3 (Bethesda) ; 11(2)2021 02 09.
Artigo em Inglês | MEDLINE | ID: mdl-33585867

RESUMO

High-dimensional and high-throughput genomic, field performance, and environmental data are becoming increasingly available to crop breeding programs, and their integration can facilitate genomic prediction within and across environments and provide insights into the genetic architecture of complex traits and the nature of genotype-by-environment interactions. To partition trait variation into additive and dominance (main effect) genetic and corresponding genetic-by-environment variances, and to identify specific environmental factors that influence genotype-by-environment interactions, we curated and analyzed genotypic and phenotypic data on 1918 maize (Zea mays L.) hybrids and environmental data from 65 testing environments. For grain yield, dominance variance was similar in magnitude to additive variance, and genetic-by-environment variances were more important than genetic main effect variances. Models involving both additive and dominance relationships best fit the data and modeling unique genetic covariances among all environments provided the best characterization of the genotype-by-environment interaction patterns. Similarity of relative hybrid performance among environments was modeled as a function of underlying weather variables, permitting identification of weather covariates driving correlations of genetic effects across environments. The resulting models can be used for genomic prediction of mean hybrid performance across populations of environments tested or for environment-specific predictions. These results can also guide efforts to incorporate high-throughput environmental data into genomic prediction models and predict values in new environments characterized with the same environmental characteristics.


Assuntos
Interação Gene-Ambiente , Zea mays , Genótipo , Modelos Genéticos , Fenótipo , Melhoramento Vegetal
8.
Bioinformatics ; 37(6): 868-870, 2021 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-32840564

RESUMO

MOTIVATION: Ancestral haplotype maps provide useful information about genomic variation and insights into biological processes. Reconstructing the descendent haplotype structure of homologous chromosomes, particularly for large numbers of individuals, can help with characterizing the recombination landscape, elucidating genotype-to-phenotype relationships, improving genomic predictions and more. Inferring haplotype maps from sparse genotype data is an efficient approach to whole-genome haplotyping, but this is a non-trivial problem. A standardized approach is needed to validate whether haplotype reconstruction software, conceived population designs and existing data for a given population provides accurate haplotype information for further inference. RESULTS: We introduce SPEARS, a pipeline for the simulation-based appraisal of genome-wide haplotype maps constructed from sparse genotype data. Using a specified pedigree, the pipeline generates virtual genotypes (known data) with genotyping errors and missing data structure. It then proceeds to mimic analysis in practice, capturing sources of error due to genotyping, imputation and haplotype inference. Standard metrics allow researchers to assess different population designs and which features of haplotype structure or regions of the genome are sufficiently accurate for analysis. Haplotype maps for 1000 outcross progeny from a multi-parent population of maize are used to demonstrate SPEARS. AVAILABILITYAND IMPLEMENTATION: SPEARS, the protocol and suite of scripts, are publicly available under an MIT license at GitHub (https://github.com/maizeatlas/spears). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Genoma , Software , Simulação por Computador , Genótipo , Haplótipos/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética
9.
Front Genet ; 11: 592769, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33763106

RESUMO

Genomic prediction provides an efficient alternative to conventional phenotypic selection for developing improved cultivars with desirable characteristics. New and improved methods to genomic prediction are continually being developed that attempt to deal with the integration of data types beyond genomic information. Modern automated weather systems offer the opportunity to capture continuous data on a range of environmental parameters at specific field locations. In principle, this information could characterize training and target environments and enhance predictive ability by incorporating weather characteristics as part of the genotype-by-environment (G×E) interaction component in prediction models. We assessed the usefulness of including weather data variables in genomic prediction models using a naïve environmental kinship model across 30 environments comprising the Genomes to Fields (G2F) initiative in 2014 and 2015. Specifically four different prediction scenarios were evaluated (i) tested genotypes in observed environments; (ii) untested genotypes in observed environments; (iii) tested genotypes in unobserved environments; and (iv) untested genotypes in unobserved environments. A set of 1,481 unique hybrids were evaluated for grain yield. Evaluations were conducted using five different models including main effect of environments; general combining ability (GCA) effects of the maternal and paternal parents modeled using the genomic relationship matrix; specific combining ability (SCA) effects between maternal and paternal parents; interactions between genetic (GCA and SCA) effects and environmental effects; and finally interactions between the genetics effects and environmental covariates. Incorporation of the genotype-by-environment interaction term improved predictive ability across all scenarios. However, predictive ability was not improved through inclusion of naive environmental covariates in G×E models. More research should be conducted to link the observed weather conditions with important physiological aspects in plant development to improve predictive ability through the inclusion of weather data.

10.
Genetics ; 213(4): 1479-1494, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31615843

RESUMO

Understanding the evolutionary capacity of populations to adapt to novel environments is one of the major pursuits in genetics. Moreover, for plant breeding, maladaptation is the foremost barrier to capitalizing on intraspecific variation in order to develop new breeds for future climate scenarios in agriculture. Using a unique study design, we simultaneously dissected the population and quantitative genomic basis of short-term evolution in a tropical landrace of maize that was translocated to a temperate environment and phenotypically selected for adaptation in flowering time phenology. Underlying 10 generations of directional selection, which resulted in a 26-day mean decrease in female-flowering time, [Formula: see text] of the heritable variation mapped to [Formula: see text] of the genome, where, overall, alleles shifted in frequency beyond the boundaries of genetic drift in the expected direction given their flowering time effects. However, clustering these non-neutral alleles based on their profiles of frequency change revealed transient shifts underpinning a transition in genotype-phenotype relationships across generations. This was distinguished by initial reductions in the frequencies of few relatively large positive effect alleles and subsequent enrichment of many rare negative effect alleles, some of which appear to represent allelic series. With these genomic shifts, the population reached an adapted state while retaining [Formula: see text] of the standing molecular marker variation in the founding population. Robust selection and association mapping tests highlighted several key genes driving the phenotypic response to selection. Our results reveal the evolutionary dynamics of a finite polygenic architecture conditioning a capacity for rapid environmental adaptation in maize.


Assuntos
Adaptação Fisiológica/genética , Meio Ambiente , Genoma de Planta , Genômica , Zea mays/genética , Zea mays/fisiologia , Mapeamento Cromossômico , Cromossomos de Plantas/genética , Flores/genética , Efeito Fundador , Frequência do Gene/genética , Genes de Plantas , Variação Genética , Genética Populacional , Haplótipos/genética , Fenômica , Fenótipo , Seleção Genética , Fatores de Tempo
11.
G3 (Bethesda) ; 9(9): 2905-2912, 2019 09 04.
Artigo em Inglês | MEDLINE | ID: mdl-31300480

RESUMO

Southern Leaf Blight, Northern Leaf Blight, and Gray Leaf Spot, caused by ascomycete fungi, are among the most important foliar diseases of maize worldwide. Previously, disease resistance quantitative trait loci (QTL) for all three diseases were identified in a connected set of chromosome segment substitution line (CSSL) populations designed for the identification of disease resistance QTL. Some QTL for different diseases co-localized, indicating the presence of multiple disease resistance (MDR) QTL. The goal of this study was to perform an independent test of several of the MDR QTL identified to confirm their existence and derive a more precise estimate of allele additive and dominance effects. Twelve F2:3 family populations were produced, in which selected QTL were segregating in an otherwise uniform genetic background. The populations were assessed for each of the three diseases in replicated trials and genotyped with markers previously associated with disease resistance. Pairwise phenotypic correlations across all the populations for resistance to the three diseases ranged from 0.2 to 0.3 and were all significant at the alpha level of 0.01. Of the 44 QTL tested, 16 were validated (identified at the same genomic location for the same disease or diseases) and several novel QTL/disease associations were found. Two MDR QTL were associated with resistance to all three diseases. This study identifies several potentially important MDR QTL and demonstrates the importance of independently evaluating QTL effects following their initial identification.


Assuntos
Resistência à Doença/genética , Doenças das Plantas/genética , Locos de Características Quantitativas , Zea mays/genética , Ascomicetos/patogenicidade , Marcadores Genéticos , Doenças das Plantas/microbiologia , Zea mays/microbiologia
12.
Mol Plant ; 12(3): 390-401, 2019 03 04.
Artigo em Inglês | MEDLINE | ID: mdl-30625380

RESUMO

Improved capacity of genomics and biotechnology has greatly enhanced genetic studies in different areas. Genomic selection exploits the genotype-to-phenotype relationship at the whole-genome level and is being implemented in many crops. Here we show that design-thinking and data-mining techniques can be leveraged to optimize genomic prediction of hybrid performance. We phenotyped a set of 276 maize hybrids generated by crossing founder inbreds of nested association mapping populations for flowering time, ear height, and grain yield. With 10 296 310 SNPs available from the parental inbreds, we explored the patterns of genomic relationships and phenotypic variation to establish training samples based on clustering, graphic network analysis, and genetic mating scheme. Our analysis showed that training set designs outperformed random sampling and earlier methods that either minimize the mean of prediction error variance or maximize the mean of generalized coefficient of determination. Additional analyses of 2556 wheat hybrids from an early-stage hybrid breeding system and 1439 rice hybrids from an established hybrid breeding system validated the approaches. Together, we demonstrated that effective genomic prediction models can be established with a training set 2%-13% of the size of the whole set, enabling an efficient exploration of enormous inference space of genetic combinations.


Assuntos
Genômica/métodos , Oryza/genética , Triticum/genética , Zea mays/genética , Produtos Agrícolas/genética , Produtos Agrícolas/crescimento & desenvolvimento , Genótipo , Hibridização Genética , Endogamia , Oryza/crescimento & desenvolvimento , Fenótipo , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único , Triticum/crescimento & desenvolvimento , Zea mays/crescimento & desenvolvimento
13.
BMC Bioinformatics ; 19(1): 302, 2018 Aug 20.
Artigo em Inglês | MEDLINE | ID: mdl-30126356

RESUMO

BACKGROUND: Targeted resequencing with high-throughput sequencing (HTS) platforms can be used to efficiently interrogate the genomes of large numbers of individuals. A critical issue for research and applications using HTS data, especially from long-read platforms, is error in base calling arising from technological limits and bioinformatic algorithms. We found that the community standard long amplicon analysis (LAA) module from Pacific Biosciences is prone to substantial bioinformatic errors that raise concerns about findings based on this pipeline, prompting the need for a new method. RESULTS: A single molecule real-time (SMRT) sequencing-error correction and assembly pipeline, C3S-LAA, was developed for libraries of pooled amplicons. By uniquely leveraging the structure of SMRT sequence data (comprised of multiple low quality subreads from which higher quality circular consensus sequences are formed) to cluster raw reads, C3S-LAA produced accurate consensus sequences and assemblies of overlapping amplicons from single sample and multiplexed libraries. In contrast, despite read depths in excess of 100X per amplicon, the standard long amplicon analysis module from Pacific Biosciences generated unexpected numbers of amplicon sequences with substantial inaccuracies in the consensus sequences. A bootstrap analysis showed that the C3S-LAA pipeline per se was effective at removing bioinformatic sources of error, but in rare cases a read depth of nearly 400X was not sufficient to overcome minor but systematic errors inherent to amplification or sequencing. CONCLUSIONS: C3S-LAA uses a divide and conquer processing algorithm for SMRT amplicon-sequence data that generates accurate consensus sequences and local sequence assemblies. Solving the confounding bioinformatic source of error in LAA allowed for the identification of limited instances of errors due to DNA amplification or sequencing of homopolymeric nucleotide tracts. For research and development in genomics, C3S-LAA allows meaningful conclusions and biological inferences to be made from accurately polished sequence output.


Assuntos
Testes Genéticos/métodos , Genômica/métodos , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos
14.
BMC Res Notes ; 11(1): 452, 2018 Jul 09.
Artigo em Inglês | MEDLINE | ID: mdl-29986751

RESUMO

OBJECTIVES: Crop improvement relies on analysis of phenotypic, genotypic, and environmental data. Given large, well-integrated, multi-year datasets, diverse queries can be made: Which lines perform best in hot, dry environments? Which alleles of specific genes are required for optimal performance in each environment? Such datasets also can be leveraged to predict cultivar performance, even in uncharacterized environments. The maize Genomes to Fields (G2F) Initiative is a multi-institutional organization of scientists working to generate and analyze such datasets from existing, publicly available inbred lines and hybrids. G2F's genotype by environment project has released 2014 and 2015 datasets to the public, with 2016 and 2017 collected and soon to be made available. DATA DESCRIPTION: Datasets include DNA sequences; traditional phenotype descriptions, as well as detailed ear, cob, and kernel phenotypes quantified by image analysis; weather station measurements; and soil characterizations by site. Data are released as comma separated value spreadsheets accompanied by extensive README text descriptions. For genotypic and phenotypic data, both raw data and a version with outliers removed are reported. For weather data, two versions are reported: a full dataset calibrated against nearby National Weather Service sites and a second calibrated set with outliers and apparent artifacts removed.


Assuntos
Conjuntos de Dados como Assunto , Genótipo , Fenótipo , Zea mays/genética , Meio Ambiente , Genoma de Planta , Endogamia , Melhoramento Vegetal , Estações do Ano , Análise de Sequência de DNA
15.
Microsc Res Tech ; 81(2): 141-152, 2018 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-27342138

RESUMO

The study of phenotypic variation in plant pathogenesis provides fundamental information about the nature of disease resistance. Cellular mechanisms that alter pathogenesis can be elucidated with confocal microscopy; however, systematic phenotyping platforms-from sample processing to image analysis-to investigate this do not exist. We have developed a platform for 3D phenotyping of cellular features underlying variation in disease development by fluorescence-specific resolution of host and pathogen interactions across time (4D). A confocal microscopy phenotyping platform compatible with different maize-fungal pathosystems (fungi: Setosphaeria turcica, Cochliobolus heterostrophus, and Cercospora zeae-maydis) was developed. Protocols and techniques were standardized for sample fixation, optical clearing, species-specific combinatorial fluorescence staining, multisample imaging, and image processing for investigation at the macroscale. The sample preparation methods presented here overcome challenges to fluorescence imaging such as specimen thickness and topography as well as physiological characteristics of the samples such as tissue autofluorescence and presence of cuticle. The resulting imaging techniques provide interesting qualitative and quantitative information not possible with conventional light or electron 2D imaging. Microsc. Res. Tech., 81:141-152, 2018. © 2016 Wiley Periodicals, Inc.


Assuntos
Fungos/patogenicidade , Processamento de Imagem Assistida por Computador/métodos , Microscopia Confocal/métodos , Zea mays/microbiologia , Automação , Imagem Óptica/métodos , Doenças das Plantas/microbiologia , Manejo de Espécimes/métodos , Coloração e Rotulagem/métodos
16.
Nat Commun ; 8(1): 1348, 2017 11 07.
Artigo em Inglês | MEDLINE | ID: mdl-29116144

RESUMO

Remarkable productivity has been achieved in crop species through artificial selection and adaptation to modern agronomic practices. Whether intensive selection has changed the ability of improved cultivars to maintain high productivity across variable environments is unknown. Understanding the genetic control of phenotypic plasticity and genotype by environment (G × E) interaction will enhance crop performance predictions across diverse environments. Here we use data generated from the Genomes to Fields (G2F) Maize G × E project to assess the effect of selection on G × E variation and characterize polymorphisms associated with plasticity. Genomic regions putatively selected during modern temperate maize breeding explain less variability for yield G × E than unselected regions, indicating that improvement by breeding may have reduced G × E of modern temperate cultivars. Trends in genomic position of variants associated with stability reveal fewer genic associations and enrichment of variants 0-5000 base pairs upstream of genes, hypothetically due to control of plasticity by short-range regulatory elements.


Assuntos
Genoma de Planta , Polimorfismo de Nucleotídeo Único , Zea mays/fisiologia , Quimera , Frequência do Gene , Variação Genética , Fenótipo , Melhoramento Vegetal , Seleção Genética , Clima Tropical , Zea mays/genética
17.
G3 (Bethesda) ; 7(7): 2161-2170, 2017 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-28526729

RESUMO

High-throughput sequencing (HTS) of reduced representation genomic libraries has ushered in an era of genotyping-by-sequencing (GBS), where genome-wide genotype data can be obtained for nearly any species. However, there remains a need for imputation-free GBS methods for genotyping large samples taken from heterogeneous populations of heterozygous individuals. This requires that a number of issues encountered with GBS be considered, including the sequencing of nonoverlapping sets of loci across multiple GBS libraries, a common missing data problem that results in low call rates for markers per individual, and a tendency for applicability only in inbred line samples with sufficient linkage disequilibrium for accurate imputation. We addressed these issues while developing and validating a new, comprehensive platform for GBS. This study supports the notion that GBS can be tailored to particular aims, and using Zea mays our results indicate that large samples of unknown pedigree can be genotyped to obtain complete and accurate GBS data. Optimizing size selection to sequence a high proportion of shared loci among individuals in different libraries and using simple in silico filters, a GBS procedure was established that produces high call rates per marker (>85%) with accuracy exceeding 99.4%. Furthermore, by capitalizing on the sequence-read structure of GBS data (stacks of reads), a new tool for resolving local haplotypes and scoring phased genotypes was developed, a feature that is not available in many GBS pipelines. Using local haplotypes reduces the marker dimensionality of the genotype matrix while increasing the informativeness of the data. Phased GBS in maize also revealed the existence of reproducibly inaccurate (apparent accuracy) genotypes that were due to divergent copy number variants (CNVs) unobservable in the underlying single nucleotide polymorphism (SNP) data.


Assuntos
Dosagem de Genes , Loci Gênicos , Variação Genética , Desequilíbrio de Ligação , Zea mays/genética , Estudo de Associação Genômica Ampla
18.
Sci Rep ; 7: 44437, 2017 03 16.
Artigo em Inglês | MEDLINE | ID: mdl-28300202

RESUMO

Isolating and sequencing specific regions in a genome is a cornerstone of molecular biology. This has been facilitated by computationally encoding the thermodynamics of DNA hybridization for automated design of hybridization and priming oligonucleotides. However, the repetitive composition of genomes challenges the identification of target-specific oligonucleotides, which limits genetics and genomics research on many species. Here, a tool called ThermoAlign was developed that ensures the design of target-specific primer pairs for DNA amplification. This is achieved by evaluating the thermodynamics of hybridization for full-length oligonucleotide-template alignments - thermoalignments - across the genome to identify primers predicted to bind specifically to the target site. For amplification-based resequencing of regions that cannot be amplified by a single primer pair, a directed graph analysis method is used to identify minimum amplicon tiling paths. Laboratory validation by standard and long-range polymerase chain reaction and amplicon resequencing with maize, one of the most repetitive genomes sequenced to date (≈85% repeat content), demonstrated the specificity-by-design functionality of ThermoAlign. ThermoAlign is released under an open source license and bundled in a dependency-free container for wide distribution. It is anticipated that this tool will facilitate multiple applications in genetics and genomics and be useful in the workflow of high-throughput targeted resequencing studies.


Assuntos
Primers do DNA/metabolismo , Genoma de Planta , Hibridização de Ácido Nucleico/métodos , Reação em Cadeia da Polimerase/métodos , Análise de Sequência de DNA/métodos , Zea mays/genética , Sequência de Bases , Primers do DNA/síntese química , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , Alinhamento de Sequência , Software , Termodinâmica
19.
PLoS One ; 12(1): e0168910, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28045987

RESUMO

Despite the reduction in the price of sequencing, it remains expensive to sequence and assemble whole, complex genomes of multiple samples for population studies, particularly for large genomes like those of many crop species. Enrichment of target genome regions coupled with next generation sequencing is a cost-effective strategy to obtain sequence information for loci of interest across many individuals, providing a less expensive approach to evaluating sequence variation at the population scale. Here we evaluate amplicon-based enrichment coupled with semiconductor sequencing on a validation set consisting of three maize inbred lines, two hybrids and 19 landrace accessions. We report the use of a multiplexed panel of 319 PCR assays that target 20 candidate loci associated with photoperiod sensitivity in maize while requiring 25 ng or less of starting DNA per sample. Enriched regions had an average on-target sequence read depth of 105 with 98% of the sequence data mapping to the maize 'B73' reference and 80% of the reads mapping to the target interval. Sequence reads were aligned to B73 and 1,486 and 1,244 variants were called using SAMtools and GATK, respectively. Of the variants called by both SAMtools and GATK, 30% were not previously reported in maize. Due to the high sequence read depth, heterozygote genotypes could be called with at least 92.5% accuracy in hybrid materials using GATK. The genetic data are congruent with previous reports of high total genetic diversity and substantial population differentiation among maize landraces. In conclusion, semiconductor sequencing of highly multiplexed PCR reactions is a cost-effective strategy for resequencing targeted genomic loci in diverse maize materials.


Assuntos
Flores/fisiologia , Genes de Plantas , Sequenciamento de Nucleotídeos em Larga Escala , Zea mays/genética , Algoritmos , Mapeamento Cromossômico , Biologia Computacional , Genômica , Genótipo , Heterozigoto , Reação em Cadeia da Polimerase , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , Zea mays/fisiologia
20.
Plant Physiol ; 172(3): 1787-1803, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27670817

RESUMO

Physiological leaf spotting, or flecking, is a mild-lesion phenotype observed on the leaves of several commonly used maize (Zea mays) inbred lines and has been anecdotally linked to enhanced broad-spectrum disease resistance. Flecking was assessed in the maize nested association mapping (NAM) population, comprising 4,998 recombinant inbred lines from 25 biparental families, and in an association population, comprising 279 diverse maize inbreds. Joint family linkage analysis was conducted with 7,386 markers in the NAM population. Genome-wide association tests were performed with 26.5 million single-nucleotide polymorphisms (SNPs) in the NAM population and with 246,497 SNPs in the association population, resulting in the identification of 18 and three loci associated with variation in flecking, respectively. Many of the candidate genes colocalizing with associated SNPs are similar to genes that function in plant defense response via cell wall modification, salicylic acid- and jasmonic acid-dependent pathways, redox homeostasis, stress response, and vesicle trafficking/remodeling. Significant positive correlations were found between increased flecking, stronger defense response, increased disease resistance, and increased pest resistance. A nonlinear relationship with total kernel weight also was observed whereby lines with relatively high levels of flecking had, on average, lower total kernel weight. We present evidence suggesting that mild flecking could be used as a selection criterion for breeding programs trying to incorporate broad-spectrum disease resistance.


Assuntos
Resistência à Doença/genética , Doenças das Plantas/genética , Doenças das Plantas/imunologia , Folhas de Planta/genética , Zea mays/genética , Alelos , Mapeamento Cromossômico , Genética Populacional , Estudo de Associação Genômica Ampla , Endogamia , Luz , Fenótipo , Folhas de Planta/efeitos da radiação , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Espécies Reativas de Oxigênio/metabolismo , Sementes/genética , Zea mays/efeitos da radiação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...