Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Language
Publication year range
1.
Genome Biol ; 15(3): R59, 2014 Mar 04.
Article in English | MEDLINE | ID: mdl-24647006

ABSTRACT

BACKGROUND: The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. RESULTS: We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. CONCLUSIONS: In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied.


Subject(s)
Contig Mapping/methods , Genome, Plant , Pinus taeda/genetics , Sequence Analysis, DNA/methods , DNA, Plant/genetics , Haploidy
2.
Genetics ; 196(3): 891-909, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24653211

ABSTRACT

The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20-40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%.


Subject(s)
Genome, Plant , Molecular Sequence Annotation/methods , Pinus taeda/genetics , DNA, Plant/analysis , Evolution, Molecular , Genes, Plant , Multigene Family , Phylogeny , Sequence Alignment
3.
Genetics ; 195(4): 1353-72, 2013 Dec.
Article in English | MEDLINE | ID: mdl-24121773

ABSTRACT

A primary goal of evolutionary genetics is to discover and explain the genetic basis of fitness-related traits and how this genetic basis evolves within natural populations. Unprecedented technological advances have fueled the discovery of genetic variants associated with ecologically relevant phenotypes in many different life forms, as well as the ability to scan genomes for deviations from selectively neutral models of evolution. Theoretically, the degree of overlap between lists of genomic regions identified using each approach is related to the genetic architecture of fitness-related traits and the strength and type of natural selection molding variation at these traits within natural populations. Here we address for the first time in a plant the degree of overlap between these lists, using patterns of nucleotide diversity and divergence for >7000 unique amplicons described from the extensive expressed sequence tag libraries generated for loblolly pine (Pinus taeda L.) in combination with the >1000 published genetic associations. We show that loci associated with phenotypic traits are distinct with regard to neutral expectations. Phenotypes measured at the whole plant level (e.g., disease resistance) exhibit an approximately twofold increase in the proportion of adaptive nonsynonymous substitutions over the genome-wide average. As expected for polygenic traits, these signals were apparent only when loci were considered at the level of functional sets. The ramifications of this result are discussed in light of the continued efforts to dissect the genetic basis of quantitative traits.


Subject(s)
Evolution, Molecular , Genes, Plant , Multifactorial Inheritance , Phenotype , Pinus taeda/genetics , Expressed Sequence Tags , Genetic Loci
4.
Evol Appl ; 5(7): 762-75, 2012 Nov.
Article in English | MEDLINE | ID: mdl-23144662

ABSTRACT

A candidate gene approach was used to identify levels of nucleotide diversity and to identify genes departing from neutral expectations in coniferous species of the Alpine European forest. Twelve samples were collected from four species that dominate montane and subalpine forests throughout Europe: Abies alba Mill, Larix decidua Mill, Pinus cembra L., and Pinus mugo Turra. A total of 800 genes, originally resequenced in Pinus taeda L., were resequenced across 12 independent trees for each of the four species. Genes were assigned to two categories, candidate and control, defined through homology-based searches to Arabidopsis. Estimates of nucleotide diversity per site varied greatly between polymorphic candidate genes (range: 0.0004-0.1295) and among species (range: 0.0024-0.0082), but were within the previously established ranges for conifers. Tests of neutrality using stringent significance thresholds, performed under the standard neutral model, revealed one to seven outlier loci for each species. Some of these outliers encode proteins that are involved with plant stress responses and form the basis for further evolutionary enquiries.

5.
Mol Ecol Resour ; 10(3): 542-5, 2010 May.
Article in English | MEDLINE | ID: mdl-21565054

ABSTRACT

Patterns of DNA sequence polymorphisms can be used to understand the processes of demography and adaptation within natural populations. High-throughput generation of DNA sequence data has historically been the bottleneck with respect to data processing and experimental inference. Advances in marker technologies have largely solved this problem. Currently, the limiting step is computational, with most molecular population genetic software allowing a gene-by-gene analysis through a graphical user interface. An easy-to-use analysis program that allows both high-throughput processing of multiple sequence alignments along with the flexibility to simulate data under complex demographic scenarios is currently lacking. We introduce a new program, named DnaSAM, which allows high-throughput estimation of DNA sequence diversity and neutrality statistics from experimental data along with the ability to test those statistics via Monte Carlo coalescent simulations. These simulations are conducted using the ms program, which is able to incorporate several genetic parameters (e.g. recombination) and demographic scenarios (e.g. population bottlenecks). The output is a set of diversity and neutrality statistics with associated probability values under a user-specified null model that are stored in easy to manipulate text file.

6.
Genetics ; 183(1): 289-98, 2009 Sep.
Article in English | MEDLINE | ID: mdl-19596906

ABSTRACT

Forest trees exhibit remarkable adaptations to their environments. The genetic basis for phenotypic adaptation to climatic gradients has been established through a long history of common garden, provenance, and genecological studies. The identities of genes underlying these traits, however, have remained elusive and thus so have the patterns of adaptive molecular diversity in forest tree genomes. Here, we report an analysis of diversity and divergence for a set of 121 cold-hardiness candidate genes in coastal Douglas fir (Pseudotsuga menziesii var. menziesii). Application of several different tests for neutrality, including those that incorporated demographic models, revealed signatures of selection consistent with selective sweeps at three to eight loci, depending upon the severity of a bottleneck event and the method used to detect selection. Given the high levels of recombination, these candidate genes are likely to be closely linked to the target of selection if not the genes themselves. Putative homologs in Arabidopsis act primarily to stabilize the plasma membrane and protect against denaturation of proteins at freezing temperatures. These results indicate that surveys of nucleotide diversity and divergence, when framed within the context of further association mapping experiments, will come full circle with respect to their utility in the dissection of complex phenotypic traits into their genetic components.


Subject(s)
Acclimatization/genetics , Cold Temperature , Genetic Loci , Polymorphism, Single Nucleotide/genetics , Pseudotsuga/genetics , Selection, Genetic , Chromosome Mapping , Genes, Plant , Genetic Drift , Genetic Loci/physiology , Genome, Plant , Geography , Linkage Disequilibrium , Polymorphism, Restriction Fragment Length , Pseudotsuga/physiology , Selection, Genetic/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...