Search | VHL Regional Portal

1.

Autopolyploidy, Allopolyploidy, and Phylogenetic Networks with Horizontal Arcs.

Huber, K T; Maher, L J.

Bull Math Biol ; 85(5): 40, 2023 04 06.

Article in English | MEDLINE | ID: mdl-37022524

ABSTRACT

Polyploidization is an evolutionary process by which a species acquires multiple copies of its complete set of chromosomes. The reticulate nature of the signal left behind by it means that phylogenetic networks offer themselves as a framework to reconstruct the evolutionary past of species affected by it. The main strategy for doing this is to first construct a so-called multiple-labelled tree and to then somehow derive such a network from it. The following question therefore arises: How much can be said about that past if such a tree is not readily available? By viewing a polyploid dataset as a certain vector which we call a ploidy (level) profile, we show that among other results, there always exists a phylogenetic network in the form of a beaded phylogenetic tree with additional arcs that realizes a given ploidy profile. Intriguingly, the two end vertices of almost all of these additional arcs can be interpreted as having co-existed in time thereby adding biological realism to our network, a feature that is, in general, not enjoyed by phylogenetic networks. In addition, we show that our network may be viewed as a generator of ploidy profile space, a novel concept similar to phylogenetic tree space that we introduce to be able to compare phylogenetic networks that realize one and the same ploidy profile. We illustrate our findings in terms of a publicly available Viola dataset.

Subject(s)

Mathematical Concepts , Models, Biological , Phylogeny , Biological Evolution , Time , Models, Genetic , Algorithms

2.

Forest-Based Networks.

Huber, K T; Moulton, V; Scholz, G E.

Bull Math Biol ; 84(10): 119, 2022 09 15.

Article in English | MEDLINE | ID: mdl-36107279

ABSTRACT

In evolutionary studies, it is common to use phylogenetic trees to represent the evolutionary history of a set of species. However, in case the transfer of genes or other genetic information between the species or their ancestors has occurred in the past, a tree may not provide a complete picture of their history. In such cases, tree-based phylogenetic networks can provide a useful, more refined representation of the species' evolution. Such a network is essentially a phylogenetic tree with some arcs added between the tree's edges so as to represent reticulate events such as gene transfer, hybridization and recombination. Even so, this model does not permit the direct representation of evolutionary scenarios where reticulate events have taken place between different subfamilies or lineages of species. To represent such scenarios, in this paper we introduce the notion of a forest-based network, that is, a collection of leaf-disjoint phylogenetic trees on a set of species with arcs added between the edges of distinct trees within the collection. Forest-based networks include the recently introduced class of overlaid species forests which can be used to model introgression. As we shall see, even though the definition of forest-based networks is closely related to that of tree-based networks, they lead to new mathematical theory which complements that of tree-based networks. As well as studying the relationship of forest-based networks with other classes of phylogenetic networks, such as tree-child networks and universal tree-based networks, we present some characterizations of some special classes of forest-based networks. We expect that our results will be useful for developing new models and algorithms to understand reticulate evolution, such as introgression and gene transfer between species.

Subject(s)

Mathematical Concepts , Models, Genetic , Humans , Forests , Models, Biological , Phylogeny

3.

The hybrid number of a ploidy profile.

Huber, K T; Maher, L J.

J Math Biol ; 85(3): 30, 2022 09 16.

Article in English | MEDLINE | ID: mdl-36114394

ABSTRACT

Polyploidization, whereby an organism inherits multiple copies of the genome of their parents, is an important evolutionary event that has been observed in plants and animals. One way to study such events is in terms of the ploidy number of the species that make up a dataset of interest. It is therefore natural to ask: How much information about the evolutionary past of the set of species that form a dataset can be gleaned from the ploidy numbers of the species? To help answer this question, we introduce and study the novel concept of a ploidy profile which allows us to formalize it in terms of a multiplicity vector indexed by the species the dataset is comprised of. Using the framework of a phylogenetic network, we present a closed formula for computing the hybrid number (i.e. the minimal number of polyploidization events required to explain a ploidy profile) of a large class of ploidy profiles. This formula relies on the construction of a certain phylogenetic network from the simplification sequence of a ploidy profile and the hybrid number of the ploidy profile with which this construction is initialized. Both of them can be computed easily in case the ploidy numbers that make up the ploidy profile are not too large. To help illustrate the applicability of our approach, we apply it to a simplified version of a publicly available Viola dataset.

Subject(s)

Genome , Ploidies , Animals , Phylogeny

4.

Correction to: Tree-Based Unrooted Phylogenetic Networks.

Francis, A; Huber, K T; Moulton, V.

Bull Math Biol ; 81(3): 936-937, 2019 Mar.

Article in English | MEDLINE | ID: mdl-30446916

ABSTRACT

The level-5 example of a network presented in Fig. 4 of Francis et al. (2018) is tree-based even though it states in the caption and in the text that this is not the case.

5.

Tree-Based Unrooted Phylogenetic Networks.

Francis, A; Huber, K T; Moulton, V.

Bull Math Biol ; 80(2): 404-416, 2018 02.

Article in English | MEDLINE | ID: mdl-29238909

ABSTRACT

Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An unrooted phylogenetic network on a non-empty, finite set X of taxa, or network, is a connected, simple graph in which every vertex has degree 1 or 3 and whose leaf set is X. It is called a phylogenetic tree if the underlying graph is a tree. In this paper we consider properties of tree-based networks, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hybridization which are important factors in the evolution of many organisms.

Subject(s)

Models, Genetic , Phylogeny , Bacteria/genetics , Evolution, Molecular , Gene Transfer, Horizontal , Hybridization, Genetic , Mathematical Concepts , Plants/genetics

6.

Uprooted Phylogenetic Networks.

Gambette, P; Huber, K T; Scholz, G E.

Bull Math Biol ; 79(9): 2022-2048, 2017 Sep.

Article in English | MEDLINE | ID: mdl-28762018

ABSTRACT

The need for structures capable of accommodating complex evolutionary signals such as those found in, for example, wheat has fueled research into phylogenetic networks. Such structures generalize the standard model of a phylogenetic tree by also allowing for cycles and have been introduced in rooted and unrooted form. In contrast to phylogenetic trees or their unrooted versions, rooted phylogenetic networks are notoriously difficult to understand. To help alleviate this, recent work on them has also centered on their "uprooted" versions. By focusing on such graphs and the combinatorial concept of a split system which underpins an unrooted phylogenetic network, we show that not only can a so-called (uprooted) 1-nested network N be obtained from the Buneman graph (sometimes also called a median network) associated with the split system [Formula: see text] induced on the set of leaves of N but also that that graph is, in a well-defined sense, optimal. Along the way, we establish the 1-nested analogue of the fundamental "splits equivalence theorem" for phylogenetic trees and characterize maximal circular split systems.

Subject(s)

Models, Genetic , Phylogeny , Algorithms , Biological Evolution , Mathematical Concepts , Triticum/classification , Triticum/genetics

7.

Minimum triplet covers of binary phylogenetic X-trees.

Huber, K T; Moulton, V; Steel, M.

J Math Biol ; 75(6-7): 1827-1840, 2017 12.

Article in English | MEDLINE | ID: mdl-28608005

ABSTRACT

Trees with labelled leaves and with all other vertices of degree three play an important role in systematic biology and other areas of classification. A classical combinatorial result ensures that such trees can be uniquely reconstructed from the distances between the leaves (when the edges are given any strictly positive lengths). Moreover, a linear number of these pairwise distance values suffices to determine both the tree and its edge lengths. A natural set of pairs of leaves is provided by any 'triplet cover' of the tree (based on the fact that each non-leaf vertex is the median vertex of three leaves). In this paper we describe a number of new results concerning triplet covers of minimum size. In particular, we characterize such covers in terms of an associated graph being a 2-tree. Also, we show that minimum triplet covers are 'shellable' and thereby provide a set of pairs for which the inter-leaf distance values will uniquely determine the underlying tree and its associated branch lengths.

Subject(s)

Models, Genetic , Phylogeny , Mathematical Concepts , Systems Biology

8.

On the challenge of reconstructing level-1 phylogenetic networks from triplets and clusters.

Gambette, Philippe; Huber, K T; Kelk, S.

J Math Biol ; 74(7): 1729-1751, 2017 06.

Article in English | MEDLINE | ID: mdl-27800561

ABSTRACT

Phylogenetic networks have gained prominence over the years due to their ability to represent complex non-treelike evolutionary events such as recombination or hybridization. Popular combinatorial objects used to construct them are triplet systems and cluster systems, the motivation being that any network N induces a triplet system [Formula: see text] and a softwired cluster system [Formula: see text]. Since in real-world studies it cannot be guaranteed that all triplets/softwired clusters induced by a network are available, it is of particular interest to understand whether subsets of [Formula: see text] or [Formula: see text] allow one to uniquely reconstruct the underlying network N. Here we show that even within the highly restricted yet biologically interesting space of level-1 phylogenetic networks it is not always possible to uniquely reconstruct a level-1 network N, even when all triplets in [Formula: see text] or all clusters in [Formula: see text] are available. On the positive side, we introduce a reasonably large subclass of level-1 networks the members of which are uniquely determined by their induced triplet/softwired cluster systems. Along the way, we also establish various enumerative results, both positive and negative, including results which show that certain special subclasses of level-1 networks N can be uniquely reconstructed from proper subsets of [Formula: see text] and [Formula: see text]. We anticipate these results to be of use in the design of algorithms for phylogenetic network inference.

Subject(s)

Classification/methods , Models, Biological , Phylogeny , Algorithms

9.

Two novel closure rules for constructing phylogenetic super-networks.

Grünewald, S; Huber, K T; Wu, Q.

Bull Math Biol ; 70(7): 1906-24, 2008 Oct.

Article in English | MEDLINE | ID: mdl-18665426

ABSTRACT

A contemporary and fundamental problem faced by many evolutionary biologists is how to puzzle together a collection Weierstrass p of partial trees (leaf-labeled trees whose leaves are bijectively labeled by species or, more generally, taxa, each supported by, e.g., a gene) into an overall parental structure that displays all trees in Weierstrass p. This already difficult problem is complicated by the fact that the trees in Weierstrass p regularly support conflicting phylogenetic relationships and are not on the same but only overlapping taxa sets. A desirable requirement on the sought after parental structure, therefore, is that it can accommodate the observed conflicts. Phylogenetic networks are a popular tool capable of doing precisely this. However, not much is known about how to construct such networks from partial trees, a notable exception being the Z-closure super-network approach, which is based on the Z-closure rule, and the Q-imputation approach. Although attractive approaches, they both suffer from the fact that the generated networks tend to be multidimensional making it necessary to apply some kind of filter to reduce their complexity.To avoid having to resort to a filter, we follow a different line of attack in this paper and develop closure rules for generating circular phylogenetic networks which have the attractive property that they can be represented in the plane. In particular, we introduce the novel Y-(closure) rule and show that this rule on its own or in combination with one of Meacham's closure rules (which we call the M-rule) has some very desirable theoretical properties. In addition, we present a case study based on Rivera et al. "ring of life" to explore the reconstructive power of the M- and Y-rule and also reanalyze an Arabidopsis thaliana data set.

Subject(s)

Arabidopsis/genetics , Models, Genetic , Phylogeny , Algorithms , Arabidopsis/classification , Archaea/classification , Archaea/genetics , Bacteria/classification , Bacteria/genetics , Evolution, Molecular , Yeasts/classification , Yeasts/genetics

10.

Phylogenetic networks from multi-labelled trees.

Huber, K T; Moulton, V.

J Math Biol ; 52(5): 613-32, 2006 May.

Article in English | MEDLINE | ID: mdl-16463187

ABSTRACT

It is now quite well accepted that the evolutionary past of certain species is better represented by phylogenetic networks as opposed to trees. For example, polyploids are typically thought to have resulted through hybridization and duplication, processes that are probably not best represented as bifurcating speciation events. Based on the knowledge of a multi-labelled tree relating collection of polyploids, we present a canonical construction of a phylogenetic network that exhibits the tree. In addition, we prove that the resulting network is in some well-defined sense a minimal network having this property.

Subject(s)

Models, Genetic , Phylogeny , Trees/genetics , Algorithms , Evolution, Molecular , Genetic Speciation , Polyploidy , Trees/classification

11.

The MinMax Squeeze: guaranteeing a minimal tree for population data.

Holland, B R; Huber, K T; Penny, D; Moulton, V.

Mol Biol Evol ; 22(2): 235-42, 2005 Feb.

Article in English | MEDLINE | ID: mdl-15483326

ABSTRACT

We report that for population data, where sequences are very similar to one another, it is often possible to use a two-pronged (MinMax Squeeze) approach to prove that a tree is the shortest possible under the parsimony criterion. Such population data can be in a range where parsimony is a maximum likelihood estimator. This is in sharp contrast to the case with species data, where sequences are much further apart and the problem of guaranteeing an optimal phylogenetic tree is known to be computationally prohibitive for realistic numbers of species, irrespective of whether likelihood or parsimony is the optimality criterion. The Squeeze uses both an upper bound (the length of the shortest tree known) and a lower bound derived from partitions of the columns (the length of the shortest tree possible). If the two bounds meet, the shortest known tree is thus proven to be a shortest possible tree. The implementation is first tested on simulated data sets and then applied to 53 complete human mitochondrial genomes. The shortest possible trees for those data have several significant improvements from the published tree. Namely, a pair of Australian lineages comes deeper in the tree (in agreement with archaeological data), and the non-African part of the tree shows greater agreement with the geographical distribution of lineages.

Subject(s)

Computational Biology/methods , DNA, Mitochondrial/genetics , Genetics, Population/methods , Phylogeny , Humans

12.

Delta plots: a tool for analyzing phylogenetic distance data.

Holland, B R; Huber, K T; Dress, A; Moulton, V.

Mol Biol Evol ; 19(12): 2051-9, 2002 Dec.

Article in English | MEDLINE | ID: mdl-12446797

ABSTRACT

A method is described that allows the assessment of treelikeness of phylogenetic distance data before tree estimation. This method is related to statistical geometry as introduced by Eigen, Winkler-Oswatitsch, and Dress (1988 [Proc. Natl. Acad. Sci. USA. 85:5913-5917]), and in essence, displays a measure for treelikeness of quartets in terms of a histogram that we call a delta plot. This allows identification of nontreelike data and analysis of noisy data sets arising from processes such as, for example, parallel evolution, recombination, or lateral gene transfer. In addition to an overall assessment of treelikeness, individual taxa can be ranked by reference to the treelikeness of the quartets to which they belong. Removal of taxa on the basis of this ranking results in an increase in accuracy of tree estimation. Recombinant data sets are simulated, and the method is shown to be capable of identifying single recombinant taxa on the basis of distance information alone, provided the parents of the recombinant sequence are sufficiently divergent and the mixture of tree histories is not strongly skewed toward a single tree. delta Plots and taxon rankings are applied to three biological data sets using distances derived from sequence alignment, gene order, and fragment length polymorphism.

Subject(s)

Models, Genetic , Phylogeny , Polymorphism, Restriction Fragment Length , Recombination, Genetic

13.

Pruned median networks: a technique for reducing the complexity of median networks.

Huber, K T; Moulton, V; Lockhart, P; Dress, A.

Mol Phylogenet Evol ; 19(2): 302-10, 2001 May.

Article in English | MEDLINE | ID: mdl-11341811

ABSTRACT

Observations from molecular marker studies on recently diverged species indicate that substitution patterns in DNA sequences can often be complex and poorly described by tree-like bifurcating evolutionary models. These observations might result from processes of species diversification and/or processes of sequence evolution that are not tree-like. In these cases, bifurcating tree representations provide poor visualization of phylogenetic signals in sequence data. In this paper, we use median networks to study DNA sequence substitution patterns in plant nuclear and chloroplast markers. We describe how to prune median networks to obtain so called pruned median networks. These simpler networks may help to provide a useful framework for investigating the phylogenetic complexity of recently diverged taxa with hybrid origins.

Subject(s)

DNA, Plant/genetics , Phylogeny , Base Sequence , Cell Nucleus/genetics , Chloroplasts/genetics , Evolution, Molecular , Models, Genetic , Molecular Sequence Data , Plants/genetics , Sequence Homology, Nucleic Acid

14.

An algorithm for constructing local regions in a phylogenetic network.

Huber, K T; Watson, E E; Hendy, M D.

Mol Phylogenet Evol ; 19(1): 1-8, 2001 Apr.

Article in English | MEDLINE | ID: mdl-11286486

ABSTRACT

The groupings of taxa in a phylogenetic tree cannot represent all the conflicting signals that usually occur among site patterns in aligned homologous genetic sequences. Hence a tree-building program must compromise by reporting a subset of the patterns, using some discriminatory criterion. Thus, in the worst case, out of possibly a large number of equally good trees, only an arbitrarily chosen tree might be reported by the tree-building program as "The Tree." This tree might then be used as a basis for phylogenetic conclusions. One strategy to represent conflicting patterns in the data is to construct a network. The Buneman graph is a theoretically very attractive example of such a network. In particular, a characterization for when this network will be a tree is known. Also the Buneman graph contains each of the most parsimonious trees indicated by the data. In this paper we describe a new method for constructing the Buneman graph that can be used for a generalization of Hadamard conjugation to networks. This new method differs from previous methods by allowing us to focus on local regions of the graph without having to first construct the full graph. The construction is illustrated by an example.

Subject(s)

Algorithms , Phylogeny , DNA, Mitochondrial/genetics , Humans

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL