Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 28
Filter
Add more filters










Publication year range
1.
Am J Bot ; : e16350, 2024 Jun 02.
Article in English | MEDLINE | ID: mdl-38825760

ABSTRACT

PREMISE: The Caryophyllaceae (the carnation family) have undergone multiple transitions into colder climates and convergence on cushion plant adaptation, indicating that they may provide a natural system for cold adaptation research. Previous research has suggested that putative ancient whole-genome duplications (WGDs) are correlated with niche shifts into colder climates across the Caryophyllales. Here, we explored the genomic changes potentially involved in one of these discovered shifts in the Caryophyllaceae. METHODS: We constructed a data set combining 26 newly generated transcriptomes with 45 published transcriptomes, including 11 cushion plant species across seven genera. With this data set, we inferred a dated phylogeny for the Caryophyllaceae and mapped ancient WGDs and gene duplications onto the phylogeny. We also examined functional groups enriched for gene duplications related to the climatic shift. RESULTS: The ASTRAL topology was mostly congruent with the current consensus of relationships within the family. We inferred 15 putative ancient WGDs in the family, including eight that have not been previously published. The oldest ancient WGD (ca. 64.4-56.7 million years ago), WGD1, was found to be associated with a shift into colder climates by previous research. Gene regions associated with ubiquitination were overrepresented in gene duplications retained after WGD1 and those convergently retained by cushion plants in Colobanthus and Eremogone, along with other functional annotations. CONCLUSIONS: Gene family expansions induced by ancient WGDs may have contributed to the shifts to cold climatic niches in the Caryophyllaceae. Transcriptomic data are crucial resources that help unravel heterogeneity in deep-time evolutionary patterns in plants.

2.
Plant Cell ; 36(7): 2491-2511, 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-38598645

ABSTRACT

Nucleotide-binding domain and leucine-rich repeat (NLR) proteins are a prominent class of intracellular immune receptors in plants. However, our understanding of plant NLR structure and function is limited to the evolutionarily young flowering plant clade. Here, we describe an extended spectrum of NLR diversity across divergent plant lineages and demonstrate the structural and functional similarities of N-terminal domains that trigger immune responses. We show that the broadly distributed coiled-coil (CC) and toll/interleukin-1 receptor (TIR) domain families of nonflowering plants retain immune-related functions through translineage activation of cell death in the angiosperm Nicotiana benthamiana. We further examined a CC subfamily specific to nonflowering lineages and uncovered an essential N-terminal MAEPL motif that is functionally comparable with motifs in resistosome-forming CC-NLRs. Consistent with a conserved role in immunity, the ectopic activation of CCMAEPL in the nonflowering liverwort Marchantia polymorpha led to profound growth inhibition, defense gene activation, and signatures of cell death. Moreover, comparative transcriptomic analyses of CCMAEPL activity delineated a common CC-mediated immune program shared across evolutionarily divergent nonflowering and flowering plants. Collectively, our findings highlight the ancestral nature of NLR-mediated immunity during plant evolution that dates its origin to at least ∼500 million years ago.


Subject(s)
Marchantia , NLR Proteins , Nicotiana , Plant Proteins , NLR Proteins/genetics , NLR Proteins/metabolism , Nicotiana/genetics , Nicotiana/immunology , Nicotiana/metabolism , Plant Proteins/genetics , Plant Proteins/metabolism , Marchantia/genetics , Marchantia/immunology , Marchantia/metabolism , Protein Domains , Phylogeny , Plant Immunity/genetics , Receptors, Immunologic/genetics , Receptors, Immunologic/metabolism , Gene Expression Regulation, Plant
3.
Ann Bot ; 133(3): 459-472, 2024 Apr 10.
Article in English | MEDLINE | ID: mdl-38181407

ABSTRACT

BACKGROUND AND AIMS: Transcriptome sequencing is a cost-effective approach that allows researchers to study a broad range of questions. However, to preserve RNA for transcriptome sequencing, tissue is often kept in special conditions, such as immediate ultracold freezing. Here, we demonstrate that RNA can be obtained from 6-month-old, field-collected samples stored in silica gel at room temperature. Using these transcriptomes, we explore the evolutionary relationships of the genus Pitcairnia (Bromeliaceae) in the Dominican Republic and infer barriers to gene flow. METHODS: We extracted RNA from silica-dried leaf tissue from 19 Pitcairnia individuals collected across the Dominican Republic. We used a series of macro- and micro-evolutionary approaches to examine the relationships and patterns of gene flow among individuals. KEY RESULTS: We produced high-quality transcriptomes from silica-dried material and demonstrated that evolutionary relationships on the island match geography more closely than species delimitation methods. A population genetic examination indicates that a combination of ecological and geographical features presents barriers to gene flow in Pitcairnia. CONCLUSIONS: Transcriptomes can be obtained from silica-preserved tissue. The genetic diversity among Pitcairnia populations does not warrant classification as separate species, but the Dominican Republic contains several barriers to gene flow, notably the Cordillera Central mountain range.


Subject(s)
Gene Flow , Transcriptome , Humans , Transcriptome/genetics , Caribbean Region , Plant Leaves/genetics , RNA
4.
Front Plant Sci ; 14: 1125107, 2023.
Article in English | MEDLINE | ID: mdl-37063179

ABSTRACT

Chloroplasts and mitochondria each contain their own genomes, which have historically been and continue to be important sources of information for inferring the phylogenetic relationships among land plants. The organelles are predominantly inherited from the same parent, and therefore should exhibit phylogenetic concordance. In this study, we examine the mitochondrion and chloroplast genomes of 226 land plants to infer the degree of similarity between the organelles' evolutionary histories. Our results show largely concordant topologies are inferred between the organelles, aside from four well-supported conflicting relationships that warrant further investigation. Despite broad patterns of topological concordance, our findings suggest that the chloroplast and mitochondrial genomes evolved with significant differences in molecular evolution. The differences result in the genes from the chloroplast and the mitochondrion preferentially clustering with other genes from their respective organelles by a program that automates selection of evolutionary model partitions for sequence alignments. Further investigation showed that changes in compositional heterogeneity are not always uniform across divergences in the land plant tree of life. These results indicate that although the chloroplast and mitochondrial genomes have coexisted for over 1 billion years, phylogenetically, they are still evolving sufficiently independently to warrant separate models of evolution. As genome sequencing becomes more accessible, research into these organelles' evolution will continue revealing insight into the ancient cellular events that shaped not only their history, but the history of plants as a whole.

5.
Curr Biol ; 32(24): 5323-5334.e6, 2022 12 19.
Article in English | MEDLINE | ID: mdl-36423640

ABSTRACT

Plants combine both chemical and structural means to appear colorful. We now have an extensive understanding of the metabolic pathways used by flowering plants to synthesize pigments, but the mechanisms remain obscure whereby cells produce microscopic structures sufficiently regular to interfere with light and create an optical effect. Here, we combine transgenic approaches in a novel model system, Hibiscus trionum, with chemical analyses of the cuticle, both in transgenic lines and in different species of Hibiscus, to investigate the formation of a semi-ordered diffraction grating on the petal surface. We show that regulating both cuticle production and epidermal cell growth is insufficient to determine the type of cuticular pattern produced. Instead, the chemical composition of the cuticle plays a crucial role in restricting the formation of diffraction gratings to the pigmented region of the petal. This suggests that buckling, driven by spatiotemporal regulation of cuticle chemistry, could pattern the petal surface at the nanoscale.


Subject(s)
Flowers , Hibiscus , Flowers/physiology , Hibiscus/physiology , Models, Biological
6.
Essays Biochem ; 66(6): 753-768, 2022 Dec 08.
Article in English | MEDLINE | ID: mdl-36205404

ABSTRACT

Colourful spots, stripes and rings decorate the corolla of most flowering plants and fulfil important biotic and abiotic functions. Spatial differences in the pigmentation of epidermal cells can create these patterns. The last few years have yielded new data that have started to illuminate the mechanisms controlling the function, formation and evolution of petal patterns. These advances have broad impacts beyond the immediate field as pigmentation patterns are wonderful systems to explore multiscale biological problems: from understanding how cells make decisions at the microscale to examining the roots of biodiversity at the macroscale. These new results also reveal there is more to petal patterning than meets the eye, opening up a brand new area of investigation. In this mini-review, we summarise our current knowledge on the Eco-Evo-Devo of petal pigmentation patterns and discuss some of the most exciting yet unanswered questions that represent avenues for future research.

7.
Mol Biol Evol ; 39(3)2022 03 02.
Article in English | MEDLINE | ID: mdl-35212724

ABSTRACT

Dissecting the relationship between gene function and substitution rates is key to understanding genome-wide patterns of molecular evolution. Biochemical pathways provide powerful systems for investigating this relationship because the functional role of each gene is often well characterized. Here, we investigate the evolution of the flavonoid pigment pathway in the colorful Petunieae clade of the tomato family (Solanaceae). This pathway is broadly conserved in plants, both in terms of its structural elements and its MYB, basic helix-loop-helix, and WD40 transcriptional regulators, and its function has been extensively studied, particularly in model species of petunia. We built a phylotranscriptomic data set for 69 species of Petunieae to infer patterns of molecular evolution across pathway genes and across lineages. We found that transcription factors exhibit faster rates of molecular evolution (dN/dS) than their targets, with the highly specialized MYB genes evolving fastest. Using the largest comparative data set to date, we recovered little support for the hypothesis that upstream enzymes evolve slower than those occupying more downstream positions, although expression levels do predict molecular evolutionary rates. Although shifts in floral pigmentation were only weakly related to changes affecting coding regions, we found a strong relationship with the presence/absence patterns of MYB transcripts. Intensely pigmented species express all three main MYB anthocyanin activators in petals, whereas pale or white species express few or none. Our findings reinforce the notion that pathway regulators have a dynamic history, involving higher rates of molecular evolution than structural components, along with frequent changes in expression during color transitions.


Subject(s)
Flowers , Transcription Factors , Anthocyanins , Flavonoids/genetics , Flavonoids/metabolism , Flowers/genetics , Gene Expression Regulation, Plant , Pigmentation/genetics , Plant Proteins/genetics , Plant Proteins/metabolism , Plants, Genetically Modified/metabolism , Transcription Factors/metabolism
8.
Syst Biol ; 71(4): 943-958, 2022 06 16.
Article in English | MEDLINE | ID: mdl-34240209

ABSTRACT

Gene tree conflict is common and finding methods to analyze and alleviate the negative effects that conflict has on species tree analysis is a crucial part of phylogenomics. This study aims to expand the discussion of inferring species trees and molecular branch lengths when conflict is present. Conflict is typically examined in two ways: inferring its prevalence and inferring the influence of the individual genes (how strongly one gene supports any given topology compared to an alternative topology). Here, we examine a procedure for incorporating both conflict and the influence of genes in order to infer evolutionary relationships. All supported relationships in the gene trees are analyzed and the likelihood of the genes constrained to these relationships is summed to provide a likelihood for the relationship. Consensus tree assembly is conducted based on the sum of likelihoods for a given relationship and choosing relationships based on the most likely relationship assuming it does not conflict with a relationship that has a higher likelihood score. If it is not possible for all most likely relationships to be combined into a single bifurcating tree then multiple trees are produced and a consensus tree with a polytomy is created. This procedure allows for more influential genes to have a greater influence on an inferred relationship, does not assume conflict has arisen from any one source and does not force the data set to produce a single bifurcating tree. Using this approach, on three empirical data sets, we examine and discuss the relationship between influence and prevalence of gene tree conflict. We find that in one of the data sets, assembling a bifurcating consensus tree solely composed of the most likely relationships is impossible. To account for conflict in molecular rate analysis we also introduce a concordance-based approach to the summary and estimation of branch lengths suitable for downstream comparative analyses. We demonstrate through simulation that even under high levels of stochastic conflict, the mean and median of the concordant rates recapitulate the true molecular rate better than using a supermatrix approach. Using a large phylogenomic data set, we examine rate heterogeneity across concordant genes with a focus on the branch subtending crown angiosperms. Notably, we find highly variable rates of evolution along the branch subtending crown angiosperms. The approaches outlined here have several limitations, but they also represent some alternative methods for harnessing the complexity of phylogenomic data sets and enrich our inferences of both species relationships and evolutionary processes.[Branch length estimation; consensus tree; gene tree conflict; gene tree filtering; phylogenetics; phylogenomics.].


Subject(s)
Magnoliopsida , Phylogeny
9.
Nature ; 602(7896): 280-286, 2022 02.
Article in English | MEDLINE | ID: mdl-34937943

ABSTRACT

Grafting is possible in both animals and plants. Although in animals the process requires surgery and is often associated with rejection of non-self, in plants grafting is widespread, and has been used since antiquity for crop improvement1. However, in the monocotyledons, which represent the second largest group of terrestrial plants and include many staple crops, the absence of vascular cambium is thought to preclude grafting2. Here we show that the embryonic hypocotyl allows intra- and inter-specific grafting in all three monocotyledon groups: the commelinids, lilioids and alismatids. We show functional graft unions through histology, application of exogenous fluorescent dyes, complementation assays for movement of endogenous hormones, and growth of plants to maturity. Expression profiling identifies genes that unify the molecular response associated with grafting in monocotyledons and dicotyledons, but also gene families that have not previously been associated with tissue union. Fusion of susceptible wheat scions to oat rootstocks confers resistance to the soil-borne pathogen Gaeumannomyces graminis. Collectively, these data overturn the consensus that monocotyledons cannot form graft unions, and identify the hypocotyl (mesocotyl in grasses) as a meristematic tissue that allows this process. We conclude that graft compatibility is a shared ability among seed-bearing plants.


Subject(s)
Avena , Plant Roots , Plant Shoots , Transplants , Triticum , Ascomycota/pathogenicity , Avena/embryology , Avena/microbiology , Hypocotyl , Meristem , Plant Roots/embryology , Plant Roots/microbiology , Plant Shoots/embryology , Plant Shoots/microbiology , Triticum/embryology , Triticum/microbiology
10.
Mol Biol Evol ; 37(11): 3380-3388, 2020 11 01.
Article in English | MEDLINE | ID: mdl-32658966

ABSTRACT

Most phylogenetic analyses assume that a single evolutionary history underlies one gene. However, both biological processes and errors can cause intragenic conflict. The extent to which this conflict is present in empirical data sets is not well documented, but if common, could have far-reaching implications for phylogenetic analyses. We examined several large phylogenomic data sets from diverse taxa using a fast and simple method to identify well-supported intragenic conflict. We found conflict to be highly variable between data sets, from 1% to >92% of genes investigated. We analyzed four exemplar genes in detail and analyzed simulated data under several scenarios. Our results suggest that alignment error may be one major source of conflict, but other conflicts remain unexplained and may represent biological signal or other errors. Whether as part of data analysis pipelines or to explore biologically processes, analyses of within-gene phylogenetic signal should become common.


Subject(s)
Datasets as Topic , Models, Genetic , Phylogeny , Animals , Computer Simulation , Insecta/genetics , Mammals/genetics
11.
Am J Bot ; 107(5): 773-789, 2020 05.
Article in English | MEDLINE | ID: mdl-32350864

ABSTRACT

PREMISE: Large genomic data sets offer the promise of resolving historically recalcitrant species relationships. However, different methodologies can yield conflicting results, especially when clades have experienced ancient, rapid diversification. Here, we analyzed the ancient radiation of Ericales and explored sources of uncertainty related to species tree inference, conflicting gene tree signal, and the inferred placement of gene and genome duplications. METHODS: We used a hierarchical clustering approach, with tree-based homology and orthology detection, to generate six filtered phylogenomic matrices consisting of data from 97 transcriptomes and genomes. Support for species relationships was inferred from multiple lines of evidence including shared gene duplications, gene tree conflict, gene-wise edge-based analyses, concatenation, and coalescent-based methods, and is summarized in a consensus framework. RESULTS: Our consensus approach supported a topology largely concordant with previous studies, but suggests that the data are not capable of resolving several ancient relationships because of lack of informative characters, sensitivity to methodology, and extensive gene tree conflict correlated with paleopolyploidy. We found evidence of a whole-genome duplication before the radiation of all or most ericalean families, and demonstrate that tree topology and heterogeneous evolutionary rates affect the inferred placement of genome duplications. CONCLUSIONS: We provide several hypotheses regarding the history of Ericales, and confidently resolve most nodes, but demonstrate that a series of ancient divergences are unresolvable with these data. Whether paleopolyploidy is a major source of the observed phylogenetic conflict warrants further investigation.


Subject(s)
Ericales , Biological Evolution , Consensus , Genome , Phylogeny
12.
Syst Biol ; 69(3): 579-592, 2020 05 01.
Article in English | MEDLINE | ID: mdl-31747023

ABSTRACT

Studies have demonstrated that pervasive gene tree conflict underlies several important phylogenetic relationships where different species tree methods produce conflicting results. Here, we present a means of dissecting the phylogenetic signal for alternative resolutions within a data set in order to resolve recalcitrant relationships and, importantly, identify what the data set is unable to resolve. These procedures extend upon methods for isolating conflict and concordance involving specific candidate relationships and can be used to identify systematic error and disambiguate sources of conflict among species tree inference methods. We demonstrate these on a large phylogenomic plant data set. Our results support the placement of Amborella as sister to the remaining extant angiosperms, Gnetales as sister to pines, and the monophyly of extant gymnosperms. Several other contentious relationships, including the resolution of relationships within the bryophytes and the eudicots, remain uncertain given the low number of supporting gene trees. To address whether concatenation of filtered genes amplified phylogenetic signal for relationships, we implemented a combinatorial heuristic to test combinability of genes. We found that nested conflicts limited the ability of data filtering methods to fully ameliorate conflicting signal amongst gene trees. These analyses confirmed that the underlying conflicting signal does not support broad concatenation of genes. Our approach provides a means of dissecting a specific data set to address deep phylogenetic relationships while also identifying the inferential boundaries of the data set. [Angiosperms; coalescent; gene-tree conflict; genomics; phylogenetics; phylogenomics.].


Subject(s)
Classification/methods , Phylogeny , Plants/classification , Genes, Plant/genetics , Plants/genetics
13.
PeerJ ; 7: e7747, 2019.
Article in English | MEDLINE | ID: mdl-31579615

ABSTRACT

Evolutionary relationships among plants have been inferred primarily using chloroplast data. To date, no study has comprehensively examined the plastome for gene tree conflict. Using a broad sampling of angiosperm plastomes, we characterize gene tree conflict among plastid genes at various time scales and explore correlates to conflict (e.g., evolutionary rate, gene length, molecule type). We uncover notable gene tree conflict against a backdrop of largely uninformative genes. We find alignment length and tree length are strong predictors of concordance, and that nucleotides outperform amino acids. Of the most commonly used markers, matK, greatly outperforms rbcL; however, the rarely used gene rpoC2 is the top-performing gene in every analysis. We find that rpoC2 reconstructs angiosperm phylogeny as well as the entire concatenated set of protein-coding chloroplast genes. Our results suggest that longer genes are superior for phylogeny reconstruction. The alleviation of some conflict through the use of nucleotides suggests that stochastic and systematic error is likely the root of most of the observed conflict, but further research on biological conflict within plastome is warranted given documented cases of heteroplasmic recombination. We suggest that researchers should filter genes for topological concordance when performing downstream comparative analyses on phylogenetic data, even when using chloroplast genomes.

14.
Am Nat ; 193(4): 530-544, 2019 04.
Article in English | MEDLINE | ID: mdl-30912965

ABSTRACT

Performance curves are valuable tools for quantifying the fundamental niches of organisms and testing hypotheses about evolution, life-history trade-offs, and the drivers of variation in species' distribution patterns. Here, we present a novel Bayesian method for characterizing performance curves that facilitates comparisons among species. We then use this model to quantify and compare the hydrological performance curves of 14 different taxa in the genus Lasthenia, an ecologically diverse clade of plants that collectively occupy a variety of habitats with unique hydrological features, including seasonally flooded wetlands called vernal pools. We conducted a growth chamber experiment to measure each taxon's fitness across five hydrological treatments that ranged from severe drought to extended flooding, and we identified differences in hydrological performance curves that explain their associations with vernal pool and terrestrial habitats. Our analysis revealed that the distribution of vernal pool taxa in the field does not reflect their optimal hydrological environments: all taxa, regardless of habitat affinity, have highest fitness under similar hydrological conditions of saturated soil without submergence. We also found that a taxon's relative position across flood gradients within vernal pools is best predicted by the height of its performance curve. These results demonstrate the utility of our approach for generating insights into when and how performance curves evolve among taxa as they diversify into distinct environments. To facilitate its use, the modeling framework has been developed into an R package.


Subject(s)
Asteraceae/physiology , Ecosystem , Genetic Fitness , Models, Biological , Water/physiology , Bayes Theorem , Biological Evolution , Droughts , Floods , Software
15.
Mol Biol Evol ; 36(1): 112-126, 2019 01 01.
Article in English | MEDLINE | ID: mdl-30371871

ABSTRACT

Several plant lineages have evolved adaptations that allow survival in extreme and harsh environments including many families within the plant clade Portulacineae (Caryophyllales) such as the Cactaceae, Didiereaceae, and Montiaceae. Here, using newly generated transcriptomic data, we reconstructed the phylogeny of Portulacineae and examined potential correlates between molecular evolution and adaptation to harsh environments. Our phylogenetic results were largely congruent with previous analyses, but we identified several early diverging nodes characterized by extensive gene tree conflict. For particularly contentious nodes, we present detailed information about the phylogenetic signal for alternative relationships. We also analyzed the frequency of gene duplications, confirmed previously identified whole genome duplications (WGD), and proposed a previously unidentified WGD event within the Didiereaceae. We found that the WGD events were typically associated with shifts in climatic niche but did not find a direct association with WGDs and diversification rate shifts. Diversification shifts occurred within the Portulacaceae, Cactaceae, and Anacampserotaceae, and whereas these did not experience WGDs, the Cactaceae experienced extensive gene duplications. We examined gene family expansion and molecular evolutionary patterns with a focus on genes associated with environmental stress responses and found evidence for significant gene family expansion in genes with stress adaptation and clades found in extreme environments. These results provide important directions for further and deeper examination of the potential links between molecular evolutionary patterns and adaptation to harsh environments.


Subject(s)
Adaptation, Biological , Biological Evolution , Caryophyllales/genetics , Cold Temperature , Droughts , Multigene Family , Polyploidy
16.
Syst Biol ; 67(5): 916-924, 2018 09 01.
Article in English | MEDLINE | ID: mdl-29893968

ABSTRACT

Recent studies have demonstrated that conflict is common among gene trees in phylogenomic studies, and that less than one percent of genes may ultimately drive species tree inference in supermatrix analyses. Herein, we examined two data sets where supermatrix and coalescent-based species trees conflict. We identified two highly influential "outlier" genes in each data set. When removed from each data set, the inferred supermatrix trees matched the topologies obtained from coalescent analyses. We also demonstrate that, while the outlier genes in the vertebrate data set have been shown in a previous study to be the result of errors in orthology detection, the outlier genes from a plant data set did not exhibit any obvious systematic error, and therefore, may be the result of some biological process yet to be determined. While topological comparisons among a small set of alternate topologies can be helpful in discovering outlier genes, they can be limited in several ways, such as assuming all genes share the same topology. Coalescent species tree methods relax this assumption but do not explicitly facilitate the examination of specific edges. Coalescent methods often also assume that conflict is the result of incomplete lineage sorting. Herein, we explored a framework that allows for quickly examining alternative edges and support for large phylogenomic data sets that does not assume a single topology for all genes. For both data sets, these analyses provided detailed results confirming the support for coalescent-based topologies. This framework suggests that we can improve our understanding of the underlying signal in phylogenomic data sets by asking more targeted edge-based questions.


Subject(s)
Caryophyllales/classification , Genomics , Phylogeny , Vertebrates/classification , Animals , Caryophyllales/genetics , Models, Genetic , Vertebrates/genetics
17.
PLoS One ; 13(5): e0197433, 2018.
Article in English | MEDLINE | ID: mdl-29772020

ABSTRACT

Phylogenomic datasets have been successfully used to address questions involving evolutionary relationships, patterns of genome structure, signatures of selection, and gene and genome duplications. However, despite the recent explosion in genomic and transcriptomic data, the utility of these data sources for efficient divergence-time inference remains unexamined. Phylogenomic datasets pose two distinct problems for divergence-time estimation: (i) the volume of data makes inference of the entire dataset intractable, and (ii) the extent of underlying topological and rate heterogeneity across genes makes model mis-specification a real concern. "Gene shopping", wherein a phylogenomic dataset is winnowed to a set of genes with desirable properties, represents an alternative approach that holds promise in alleviating these issues. We implemented an approach for phylogenomic datasets (available in SortaDate) that filters genes by three criteria: (i) clock-likeness, (ii) reasonable tree length (i.e., discernible information content), and (iii) least topological conflict with a focal species tree (presumed to have already been inferred). Such a winnowing procedure ensures that errors associated with model (both clock and topology) mis-specification are minimized, therefore reducing error in divergence-time estimation. We demonstrated the efficacy of this approach through simulation and applied it to published animal (Aves, Diplopoda, and Hymenoptera) and plant (carnivorous Caryophyllales, broad Caryophyllales, and Vitales) phylogenomic datasets. By quantifying rate heterogeneity across both genes and lineages we found that every empirical dataset examined included genes with clock-like, or nearly clock-like, behavior. Moreover, many datasets had genes that were clock-like, exhibited reasonable evolutionary rates, and were mostly compatible with the species tree. We identified overlap in age estimates when analyzing these filtered genes under strict clock and uncorrelated lognormal (UCLN) models. However, this overlap was often due to imprecise estimates from the UCLN model. We find that "gene shopping" can be an efficient approach to divergence-time inference for phylogenomic datasets that may otherwise be characterized by extensive gene tree heterogeneity.


Subject(s)
Genomics/methods , Animals , Evolution, Molecular , Humans , Models, Genetic , Phylogeny
18.
Am J Bot ; 105(3): 446-462, 2018 03.
Article in English | MEDLINE | ID: mdl-29738076

ABSTRACT

PREMISE OF THE STUDY: The Caryophyllales contain ~12,500 species and are known for their cosmopolitan distribution, convergence of trait evolution, and extreme adaptations. Some relationships within the Caryophyllales, like those of many large plant clades, remain unclear, and phylogenetic studies often recover alternative hypotheses. We explore the utility of broad and dense transcriptome sampling across the order for resolving evolutionary relationships in Caryophyllales. METHODS: We generated 84 transcriptomes and combined these with 224 publicly available transcriptomes to perform a phylogenomic analysis of Caryophyllales. To overcome the computational challenge of ortholog detection in such a large data set, we developed an approach for clustering gene families that allowed us to analyze >300 transcriptomes and genomes. We then inferred the species relationships using multiple methods and performed gene-tree conflict analyses. KEY RESULTS: Our phylogenetic analyses resolved many clades with strong support, but also showed significant gene-tree discordance. This discordance is not only a common feature of phylogenomic studies, but also represents an opportunity to understand processes that have structured phylogenies. We also found taxon sampling influences species-tree inference, highlighting the importance of more focused studies with additional taxon sampling. CONCLUSIONS: Transcriptomes are useful both for species-tree inference and for uncovering evolutionary complexity within lineages. Through analyses of gene-tree conflict and multiple methods of species-tree inference, we demonstrate that phylogenomic data can provide unparalleled insight into the evolutionary history of Caryophyllales. We also discuss a method for overcoming computational challenges associated with homolog clustering in large data sets.


Subject(s)
Biological Evolution , Caryophyllales/genetics , Genes, Plant , Genomics/methods , Models, Genetic , Phylogeny , Transcriptome , Cactaceae/genetics , Carnivory , Cluster Analysis , Evolution, Molecular , Genome, Plant , Sequence Analysis, DNA , Sequence Homology , Species Specificity
19.
Am J Bot ; 105(3): 385-403, 2018 03.
Article in English | MEDLINE | ID: mdl-29746719

ABSTRACT

PREMISE OF THE STUDY: Phylogenetic support has been difficult to evaluate within the green plant tree of life partly due to a lack of specificity between conflicted versus poorly informed branches. As data sets continue to expand in both breadth and depth, new support measures are needed that are more efficient and informative. METHODS: We describe the Quartet Sampling (QS) method, a quartet-based evaluation system that synthesizes several phylogenetic and genomic analytical approaches. QS characterizes discordance in large-sparse and genome-wide data sets, overcoming issues of alignment sparsity and distinguishing strong conflict from weak support. We tested QS with simulations and recent plant phylogenies inferred from variously sized data sets. KEY RESULTS: QS scores demonstrated convergence with increasing replicates and were not strongly affected by branch depth. Patterns of QS support from different phylogenies led to a coherent understanding of ancestral branches defining key disagreements, including the relationships of Ginkgo to cycads, magnoliids to monocots and eudicots, and mosses to liverworts. The relationships of ANA-grade angiosperms (Amborella, Nymphaeales, Austrobaileyales), major monocot groups, bryophytes, and fern families are likely highly discordant in their evolutionary histories, rather than poorly informed. QS can also detect discordance due to introgression in phylogenomic data. CONCLUSIONS: Quartet Sampling is an efficient synthesis of phylogenetic tests that offers more comprehensive and specific information on branch support than conventional measures. The QS method corroborates growing evidence that phylogenomic investigations that incorporate discordance testing are warranted when reconstructing complex evolutionary histories, in particular those surrounding ANA-grade, monocots, and nonvascular plants.


Subject(s)
Biological Evolution , DNA, Plant/analysis , Genome, Plant , Genomics/methods , Phylogeny , Viridiplantae/genetics , Bryophyta/genetics , Computer Simulation , Cycadopsida/genetics , Ferns/genetics , Ginkgo biloba/genetics , Hepatophyta/genetics , Magnoliopsida/genetics , Reproducibility of Results
20.
New Phytol ; 217(2): 855-870, 2018 01.
Article in English | MEDLINE | ID: mdl-28944472

ABSTRACT

Studies of the macroevolutionary legacy of polyploidy are limited by an incomplete sampling of these events across the tree of life. To better locate and understand these events, we need comprehensive taxonomic sampling as well as homology inference methods that accurately reconstruct the frequency and location of gene duplications. We assembled a data set of transcriptomes and genomes from 168 species in Caryophyllales, of which 43 transcriptomes were newly generated for this study, representing one of the most densely sampled genomic-scale data sets available. We carried out phylogenomic analyses using a modified phylome strategy to reconstruct the species tree. We mapped the phylogenetic distribution of polyploidy events by both tree-based and distance-based methods, and explicitly tested scenarios for allopolyploidy. We identified 26 ancient and more recent polyploidy events distributed throughout Caryophyllales. Two of these events were inferred to be allopolyploidy. Through dense phylogenomic sampling, we show the propensity of polyploidy throughout the evolutionary history of Caryophyllales. We also provide a framework for utilizing transcriptome data to detect allopolyploidy, which is important as it may have different macroevolutionary implications compared with autopolyploidy.


Subject(s)
Caryophyllales/genetics , Polyploidy , Transcriptome/genetics , Ecosystem , Likelihood Functions , Phylogeny , Species Specificity
SELECTION OF CITATIONS
SEARCH DETAIL
...