Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
Add more filters










Publication year range
1.
Mol Biol Evol ; 31(1): 177-83, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24136916

ABSTRACT

The phylogenetic branching order of the green algal groups that gave rise to land plants remains uncertain despite its fundamental importance to understanding plant evolution. Previous studies have demonstrated that land plants evolved from streptophyte algae, but different lineages of streptophytes have been suggested to be the sister group of land plants. To better understand the evolutionary history of land plants and to determine the potential effects of "long-branch attraction" in phylogenetic reconstruction, we analyzed a chloroplast genome data set including three new chloroplast genomes from streptophyte algae: Coleochaetae orbicularis (Coleochaetales), Nitella hookeri (Charales), and Spirogyra communis (Zygnematales). We further applied a site pattern sorting method together with site- and time-heterogeneous models to investigate the branching order among streptophytes and land plants. Our chloroplast phylogenomic analyses support previous hypotheses based on nuclear data in placing Zygnematales alone, or a clade consisting of Coleochaetales plus Zygnematales, as the closest living relatives of land plants.


Subject(s)
Chlorophyta/genetics , Embryophyta/genetics , Genome, Chloroplast , Biological Evolution , Chlorophyta/classification , DNA, Algal/genetics , DNA, Chloroplast/genetics , Embryophyta/classification , Phylogeny , Sequence Analysis, DNA
2.
Genome Biol Evol ; 5(1): 31-44, 2013.
Article in English | MEDLINE | ID: mdl-23221676

ABSTRACT

Cyanobacteria forged two major evolutionary transitions with the invention of oxygenic photosynthesis and the bestowal of photosynthetic lifestyle upon eukaryotes through endosymbiosis. Information germane to understanding those transitions is imprinted in cyanobacterial genomes, but deciphering it is complicated by lateral gene transfer (LGT). Here, we report genome sequences for the morphologically most complex true-branching cyanobacteria, and for Scytonema hofmanni PCC 7110, which with 12,356 proteins is the most gene-rich prokaryote currently known. We investigated components of cyanobacterial evolution that have been vertically inherited, horizontally transferred, and donated to eukaryotes at plastid origin. The vertical component indicates a freshwater origin for water-splitting photosynthesis. Networks of the horizontal component reveal that 60% of cyanobacterial gene families have been affected by LGT. Plant nuclear genes acquired from cyanobacteria define a lower bound frequency of 611 multigene families that, in turn, specify diazotrophic cyanobacterial lineages as having a gene collection most similar to that possessed by the plastid ancestor.


Subject(s)
Cyanobacteria/genetics , Evolution, Molecular , Genome, Bacterial , Genome, Plastid , Photosynthesis/genetics , Cyanobacteria/metabolism , Ecosystem , Fresh Water , Gene Transfer, Horizontal , Multigene Family , Phylogeny , Plants/genetics , Symbiosis/genetics , Water/metabolism
3.
Syst Biol ; 62(1): 50-61, 2013 Jan 01.
Article in English | MEDLINE | ID: mdl-22851550

ABSTRACT

Correct rooting of the angiosperm radiation is both challenging and necessary for understanding the origins and evolution of physiological and phenotypic traits in flowering plants. The problem is known to be difficult due to the large genetic distance separating flowering plants from other seed plants and the sparse taxon sampling among basal angiosperms. Here, we provide further evidence for concern over substitution model misspecification in analyses of chloroplast DNA sequences. We show that support for Amborella as the sole representative of the most basal angiosperm lineage is founded on sequence site patterns poorly described by time-reversible substitution models. Improving the fit between sequence data and substitution model identifies Trithuria, Nymphaeaceae, and Amborella as surviving relatives of the most basal lineage of flowering plants. This finding indicates that aquatic and herbaceous species dominate the earliest extant lineage of flowering plants. [; ; ; ; ; .].


Subject(s)
Magnoliopsida/classification , Magnoliopsida/genetics , Phylogeny , DNA, Chloroplast/genetics , Genetic Heterogeneity , Models, Genetic , Sequence Alignment , Tracheophyta/classification , Tracheophyta/genetics
4.
Plant J ; 71(4): 615-26, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22469001

ABSTRACT

Mitochondrial genomes of spermatophytes are the largest of all organellar genomes. Their large size has been attributed to various factors; however, the relative contribution of these factors to mitochondrial DNA (mtDNA) expansion remains undetermined. We estimated their relative contribution in Malus domestica (apple). The mitochondrial genome of apple has a size of 396 947 bp and a one to nine ratio of coding to non-coding DNA, close to the corresponding average values for angiosperms. We determined that 71.5% of the apple mtDNA sequence was highly similar to sequences of its nuclear DNA. Using nuclear gene exons, nuclear transposable elements and chloroplast DNA as markers of promiscuous DNA content in mtDNA, we estimated that approximately 20% of the apple mtDNA consisted of DNA sequences imported from other cell compartments, mostly from the nucleus. Similar marker-based estimates of promiscuous DNA content in the mitochondrial genomes of other species ranged between 21.2 and 25.3% of the total mtDNA length for grape, between 23.1 and 38.6% for rice, and between 47.1 and 78.4% for maize. All these estimates are conservative, because they underestimate the import of non-functional DNA. We propose that the import of promiscuous DNA is a core mechanism for mtDNA size expansion in seed plants. In apple, maize and grape this mechanism contributed far more to genome expansion than did homologous recombination. In rice the estimated contribution of both mechanisms was found to be similar.


Subject(s)
Genome, Mitochondrial , Genome, Plant , Malus/genetics , Cell Nucleus/genetics , DNA, Chloroplast , DNA, Mitochondrial , Evolution, Molecular , Exons , Magnoliopsida/genetics , Molecular Sequence Data , Plant Viruses/genetics , Protein Biosynthesis , Repetitive Sequences, Nucleic Acid , Seeds/genetics
5.
Genome Biol Evol ; 3: 1340-8, 2011.
Article in English | MEDLINE | ID: mdl-22016337

ABSTRACT

Resolving the closest relatives of Gnetales has been an enigmatic problem in seed plant phylogeny. The problem is known to be difficult because of the extent of divergence between this diverse group of gymnosperms and their closest phylogenetic relatives. Here, we investigate the evolutionary properties of conifer chloroplast DNA sequences. To improve taxon sampling of Cupressophyta (non-Pinaceae conifers), we report sequences from three new chloroplast (cp) genomes of Southern Hemisphere conifers. We have applied a site pattern sorting criterion to study compositional heterogeneity, heterotachy, and the fit of conifer chloroplast genome sequences to a general time reversible + G substitution model. We show that non-time reversible properties of aligned sequence positions in the chloroplast genomes of Gnetales mislead phylogenetic reconstruction of these seed plants. When 2,250 of the most varied sites in our concatenated alignment are excluded, phylogenetic analyses favor a close evolutionary relationship between the Gnetales and Pinaceae-the Gnepine hypothesis. Our analytical protocol provides a useful approach for evaluating the robustness of phylogenomic inferences. Our findings highlight the importance of goodness of fit between substitution model and data for understanding seed plant phylogeny.


Subject(s)
Genome, Chloroplast , Gnetophyta/classification , Phylogeny , Seeds/genetics , Tracheophyta/classification , DNA, Chloroplast/genetics , Gnetophyta/genetics , Models, Genetic , Tracheophyta/genetics
6.
J Mol Evol ; 71(5-6): 319-31, 2010 Dec.
Article in English | MEDLINE | ID: mdl-20976444

ABSTRACT

Noisy data, especially in combination with misalignment and model misspecification can have an adverse effect on phylogeny reconstruction; however, effective methods to identify such data are few. One particularly important class of noisy data is saturated positions. To avoid potential errors related to saturation in phylogenomic analyses, we present an automated procedure involving the step-wise removal of the most variable positions in a given data set coupled with a stopping criterion derived from correlation analyses of pairwise ML distances calculated from the deleted (saturated) and the remaining (conserved) subsets of the alignment. Through a comparison with existing methods, we demonstrate both the effectiveness of our proposed procedure for identifying noisy data and the effect of the removal of such data using a well-publicized case study involving placental mammals. At the least, our procedure will identify data sets requiring greater data exploration, and we recommend its use to investigate the effect on phylogenetic analyses of removing subsets of variable positions exhibiting weak or no correlation to the rest of the alignment. However, we would argue that this procedure, by identifying and removing noisy data, facilitates the construction of more accurate phylogenies by, for example, ameliorating potential long-branch attraction artefacts.


Subject(s)
Artifacts , Databases, Genetic , Genomics/methods , Phylogeny , Animals , Automation , Conserved Sequence/genetics , Likelihood Functions , Mammals , Models, Genetic
7.
J Mol Evol ; 68(3): 197-204, 2009 Mar.
Article in English | MEDLINE | ID: mdl-19247564

ABSTRACT

It is widely appreciated that noisy, highly variable data can impede phylogeney reconstruction. Researchers have for a long time omitted problematic data from phylogenetic analyses, such as the third-codon positions and variable regions. In the analyses of the phylogenetic relations of the angiosperms; however, inclusion of complete gene sequences into genomic-scale alignments has become a common practice. Here we demonstrate that this practice can be misleading. We show that support of the basal-most position of Amborella trichopoda among the angiosperms in the chloroplast genomic data is based only on a tiny subset (< 1% of the total alignment length) of the most variable positions in alignment, exhibiting mean maximum likelihood (ML) distance among the angiosperm operational taxonomic units (OTUs) approximately 36 substitutions/site. Exclusion of these positions leads to disappearance of the basal Amborella branch. Likewise, the recently reported sister-group relationship of Ceratophyllum to the eudicots is based on the presence of 2% of the most variable positions in the genomic alignment, exhibiting, on average, 20 substitutions/site in comparison among the angiosperm OTUs. These observations highlight a need for excluding a certain proportion of saturated positions in alignment from phylogenomic analyses.


Subject(s)
Chloroplasts/genetics , Genome, Plastid , Magnoliopsida/genetics , Phylogeny , Data Interpretation, Statistical , Databases, Genetic , Evolution, Molecular , Genes, rRNA , Models, Genetic , Plant Proteins/genetics , RNA, Plant/genetics , RNA, Transfer/genetics , Sequence Alignment
8.
Mol Biol Evol ; 26(1): 99-110, 2009 Jan.
Article in English | MEDLINE | ID: mdl-18922764

ABSTRACT

The mitochondrial genome of grape (Vitis vinifera), the largest organelle genome sequenced so far, is presented. The genome is 773,279 nt long and has the highest coding capacity among known angiosperm mitochondrial DNAs (mtDNAs). The proportion of promiscuous DNA of plastid origin in the genome is also the largest ever reported for an angiosperm mtDNA, both in absolute and relative terms. In all, 42.4% of chloroplast genome of Vitis has been incorporated into its mitochondrial genome. In order to test if horizontal gene transfer (HGT) has also contributed to the gene content of the grape mtDNA, we built phylogenetic trees with the coding sequences of mitochondrial genes of grape and their homologs from plant mitochondrial genomes. Many incongruent gene tree topologies were obtained. However, the extent of incongruence between these gene trees is not significantly greater than that observed among optimal trees for chloroplast genes, the common ancestry of which has never been in doubt. In both cases, we attribute this incongruence to artifacts of tree reconstruction, insufficient numbers of characters, and gene paralogy. This finding leads us to question the recent phylogenetic interpretation of Bergthorsson et al. (2003, 2004) and Richardson and Palmer (2007) that rampant HGT into the mtDNA of Amborella best explains phylogenetic incongruence between mitochondrial gene trees for angiosperms. The only evidence for HGT into the Vitis mtDNA found involves fragments of two coding sequences stemming from two closteroviruses that cause the leaf roll disease of this plant. We also report that analysis of sequences shared by both chloroplast and mitochondrial genomes provides evidence for a previously unknown gene transfer route from the mitochondrion to the chloroplast.


Subject(s)
DNA, Mitochondrial/genetics , Evolution, Molecular , Vitis/genetics , Chloroplasts/genetics , DNA, Chloroplast/genetics , Gene Transfer, Horizontal , Mitochondria/genetics , Mutagenesis, Insertional , Protein Biosynthesis , Vitis/cytology
9.
Gene ; 381: 81-91, 2006 Oct 15.
Article in English | MEDLINE | ID: mdl-16959440

ABSTRACT

We readdress the issue of phylogeny of the basal extant angiosperms employing a source previously not systematically investigated, specifically, the non-coding sequences of cpDNA. Comparison of trees with and without grasses or the outgroup (Pinus) in our analyses revealed no rearrangements in tree topology that might be expected if LBA were distorting the position of the magnoliids. For each model applied, irrespective of whether monocots or ANITA members appeared basally divergent, the orchid Phalaenopsis assumed the same position on the trees with the reduced taxon set as did the branch bearing the orchid plus the grasses in the full alignment. However, our new test of model fitness revealed a different flaw influencing the placement of monocots, which is related to model mis-specification. This flaw similarly affects the full alignment and the alignment with grasses removed. In both cases the models favouring a relatively derived position for the monocots and basal placement of the branch of Amborella plus Nymphaea provide better overall prediction of the observed data structure. In the view of apparent unsuitability of the bootstrap method for large data sets, our novel test provides a new means of exploring conflicts caused by systematic errors in phylogenetic analyses.


Subject(s)
Magnoliopsida/genetics , Models, Biological , Phylogeny , Cell Nucleus/genetics , Evolution, Molecular , Magnoliopsida/classification , Plastids/genetics
10.
Mol Biol Evol ; 22(9): 1813-22, 2005 Sep.
Article in English | MEDLINE | ID: mdl-15930156

ABSTRACT

Determining the phylogenetic relationships among the major lines of angiosperms is a long-standing problem, yet the uncertainty as to the phylogenetic affinity of these lines persists. While a number of studies have suggested that the ANITA (Amborella-Nymphaeales-Illiciales-Trimeniales-Aristolochiales) grade is basal within angiosperms, studies of complete chloroplast genome sequences also suggested an alternative tree, wherein the line leading to the grasses branches first among the angiosperms. To improve taxon sampling in the existing chloroplast genome data, we sequenced the chloroplast genome of the monocot Acorus calamus. We generated a concatenated alignment (89,436 positions for 15 taxa), encompassing almost all sequences usable for phylogeny reconstruction within spermatophytes. The data still contain support for both the ANITA-basal and grasses-basal hypotheses. Using simulations we can show that were the ANITA-basal hypothesis true, parsimony (and distance-based methods with many models) would be expected to fail to recover it. The self-evident explanation for this failure appears to be a long-branch attraction (LBA) between the clade of grasses and the out-group. However, this LBA cannot explain the discrepancies observed between tree topology recovered using the maximum likelihood (ML) method and the topologies recovered using the parsimony and distance-based methods when grasses are deleted. Furthermore, the fact that neither maximum parsimony nor distance methods consistently recover the ML tree, when according to the simulations they would be expected to, when the out-group (Pinus) is deleted, suggests that either the generating tree is not correct or the best symmetric model is misspecified (or both). We demonstrate that the tree recovered under ML is extremely sensitive to model specification and that the best symmetric model is misspecified. Hence, we remain agnostic regarding phylogenetic relationships among basal angiosperm lineages.


Subject(s)
Acorus/genetics , DNA, Chloroplast/genetics , Genome, Plant , Phylogeny , Acorus/classification , Base Sequence , Genetic Variation , Nucleic Acid Amplification Techniques , Sequence Alignment
11.
Mol Biol Evol ; 21(7): 1445-54, 2004 Jul.
Article in English | MEDLINE | ID: mdl-15084683

ABSTRACT

Angiosperms (flowering plants) dominate contemporary terrestrial flora with roughly 250,000 species, but their origin and early evolution are still poorly understood. In recent years, molecular evidence has accumulated suggesting a dicotyledonous origin of monocots. Phylogenetic reconstructions have suggested that several dicotyledonous groups that include taxa such as Amborella, Austrobaileya, and Nymphaea branch off as the most basal among angiosperms. This has led to the concept of monocots, "eudicots," "basal dicots," and "ANITA" groupings. Here, we present the sequence and phylogenetic analyses of the chloroplast DNA of Nymphaea alba. Phylogenetic analyses of our 14-species data set, consisting of 29,991 aligned nucleotide positions per chloroplast genome, revealed consistent support for Nymphaea being a divergent member of a monophyletic dicot assemblage. Three distinct angiosperm lineages were supported in the majority of our phylogenetic analyses-eudicots, Magnoliopsida, and monocots. However, the monocot lineage leading to the grasses was the deepest branching. Although analyses of only one individual gene alignment (out of 61) is consistent with some recently proposed hypotheses for the paraphyly of dicots, we also report observations that nine genes do not support paraphyly of dicots. Instead, they support the basal monocot-dicot split. Consistent with this finding, we also report observations suggesting that the monocot lineage leading to the grasses has the strongest phylogenetic affinity to gymnosperms. Our findings have general implications for studies of substitution model specification and analyses of concatenated genome data.


Subject(s)
Chloroplasts/genetics , Genome, Plant , Nymphaea/classification , Nymphaea/genetics , Phylogeny , Base Sequence , Genomics , Magnoliopsida/classification , Magnoliopsida/genetics , Molecular Sequence Data , Sequence Alignment
12.
Mol Biol Evol ; 20(9): 1499-505, 2003 Sep.
Article in English | MEDLINE | ID: mdl-12832641

ABSTRACT

Phylogenetic analyses based on comparison of a limited number of genes recently suggested that Amborella trichopoda is the most ancient angiosperm. Here we present the complete sequence of the chloroplast genome of this plant. It does not display any of the genes characteristic of chloroplast DNA of the gymnosperm Pinus thunbergii (chlB, chlL, chlN, psaM, and ycf12). The majority of phylogenetic analyses of protein-coding genes of this chloroplast DNA suggests that Amborella is not the basal angiosperm and not even the most basal among dicots.


Subject(s)
Chloroplasts/genetics , Evolution, Molecular , Genes, Plant , Genome, Plant , Magnoliopsida/genetics , Trees/genetics , Biological Evolution , Magnoliopsida/classification , Phylogeny
SELECTION OF CITATIONS
SEARCH DETAIL
...