Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 22
Filter
Add more filters










Publication year range
1.
Genomics Proteomics Bioinformatics ; 19(4): 662-667, 2021 08.
Article in English | MEDLINE | ID: mdl-34119695

ABSTRACT

Composition Vector Tree (CVTree) is an alignment-free algorithm to infer phylogenetic relationships from genome sequences. It has been successfully applied to study phylogeny and taxonomy of viruses, prokaryotes, and fungi based on the whole genomes, as well as chloroplast genomes, mitochondrial genomes, and metagenomes. Here we presented the standalone software for the CVTree algorithm. In the software, an extensible parallel workflow for the CVTree algorithm was designed. Based on the workflow, new alignment-free methods were also implemented. And by examining the phylogeny and taxonomy of 13,903 prokaryotes based on 16S rRNA sequences, we showed that CVTree software is an efficient and effective tool for studying phylogeny and taxonomy based on genome sequences. The code of CVTree software can be available at https://github.com/ghzuo/cvtree.


Subject(s)
Genome , Software , Algorithms , Phylogeny , RNA, Ribosomal, 16S/genetics
2.
Front Mol Biosci ; 7: 589718, 2020.
Article in English | MEDLINE | ID: mdl-33195438

ABSTRACT

The transition network provides a key to reveal the thermodynamic and kinetic properties of biomolecular systems. In this paper, we introduce a new method, named effective energy rescaling space trajectory mapping (EspcTM), to detect metastable states and construct transition networks based on the simulation trajectories of the complex biomolecular system. It mapped simulation trajectories into an orthogonal function space, whose bases were rescaled by effective energy, and clustered the interrelation between these trajectories to locate metastable states. By using the EspcTM method, we identified the metastable states and elucidated interstate transition kinetics of a Brownian particle and a dodecapeptide. It was found that the scaling parameters of effective energy also provided a clue to the dominating factors in dynamics. We believe that the EspcTM method is a useful tool for the studies of dynamics of the complex system and may provide new insight into the understanding of thermodynamics and kinetics of biomolecular systems.

3.
Genomics Proteomics Bioinformatics ; 16(5): 310-319, 2018 10.
Article in English | MEDLINE | ID: mdl-30550857

ABSTRACT

We report an important but long-overlooked manifestation of low-resolution power of 16S rRNA sequence analysis at the species level, namely, in 16S rRNA-based phylogenetic trees polyphyletic placements of closely-related species are abundant compared to those in genome-based phylogeny. This phenomenon makes the demarcation of genera within many families ambiguous in the 16S rRNA-based taxonomy. In this study, we reconstructed phylogenetic relationship for more than ten thousand prokaryote genomes using the CVTree method, which is based on whole-genome information. And many such genera, which are polyphyletic in 16S rRNA-based trees, are well resolved as monophyletic clusters by CVTree. We believe that with genome sequencing of prokaryotes becoming a commonplace, genome-based phylogeny is doomed to play a definitive role in the construction of a natural and objective taxonomy.


Subject(s)
Phylogeny , RNA, Ribosomal, 16S/genetics , Genome , Genomics , Sequence Analysis, DNA
4.
Synth Syst Biotechnol ; 2(3): 226-235, 2017 Sep.
Article in English | MEDLINE | ID: mdl-29318203

ABSTRACT

A monospecific genus contains a single species ever since it was proposed. Though formally more than half of the known prokaryotic genera are monospecific, we pick up those which actually raise taxonomic problems by violating monophyly of the taxon within which it resides. Taking monophyly as a guiding principle, our arguments are based on simultaneous support from 16S rRNA sequence analysis and whole-genome phylogeny of prokaryotes, as provided by the LVTree Viewer and CVTree Web Server, respectively. The main purpose of this study consists in calling attention to this specific way of global taxonomic analysis. Therefore, we refrain from making formal emendations for the time being.

6.
Genomics Proteomics Bioinformatics ; 14(2): 94-102, 2016 04.
Article in English | MEDLINE | ID: mdl-27018315

ABSTRACT

We describe an interactive viewer for the All-Species Living Tree (LVTree). The viewer incorporates treeing and lineage information from the ARB-SILVA website. It allows collapsing the tree branches at different taxonomic ranks and expanding the collapsed branches as well, keeping the overall topology of the tree unchanged. It also enables the user to observe the consequence of trial lineage modifications by re-collapsing the tree. The system reports taxon statistics at all ranks automatically after each collapsing and re-collapsing. These features greatly facilitate the comparison of the 16S rRNA sequence phylogeny with prokaryotic taxonomy in a taxon by taxon manner. In view of the fact that the present prokaryotic systematics is largely based on 16S rRNA sequence analysis, the current viewer may help reveal discrepancies between phylogeny and taxonomy. As an application, we show that in the latest release of LVTree, based on 11,939 rRNA sequences, as few as 24 lineage modifications are enough to bring all but two phyla (Proteobacteria and Firmicutes) to monophyletic clusters.


Subject(s)
Bacteria/genetics , User-Computer Interface , Bacteria/classification , Bacteroidetes/classification , Bacteroidetes/genetics , Internet , Phylogeny , RNA, Ribosomal, 16S/genetics , RNA, Ribosomal, 16S/metabolism
7.
Genomics Proteomics Bioinformatics ; 13(5): 321-31, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26563468

ABSTRACT

A faithful phylogeny and an objective taxonomy for prokaryotes should agree with each other and ultimately follow the genome data. With the number of sequenced genomes reaching tens of thousands, both tree inference and detailed comparison with taxonomy are great challenges. We now provide one solution in the latest Release 3.0 of the alignment-free and whole-genome-based web server CVTree3. The server resides in a cluster of 64 cores and is equipped with an interactive, collapsible, and expandable tree display. It is capable of comparing the tree branching order with prokaryotic classification at all taxonomic ranks from domains down to species and strains. CVTree3 allows for inquiry by taxon names and trial on lineage modifications. In addition, it reports a summary of monophyletic and non-monophyletic taxa at all ranks as well as produces print-quality subtree figures. After giving an overview of retrospective verification of the CVTree approach, the power of the new server is described for the mega-classification of prokaryotes and determination of taxonomic placement of some newly-sequenced genomes. A few discrepancies between CVTree and 16S rRNA analyses are also summarized with regard to possible taxonomic revisions. CVTree3 is freely accessible to all users at http://tlife.fudan.edu.cn/cvtree3/ without login requirements.


Subject(s)
Archaea/classification , Bacteria/classification , Genome, Archaeal/genetics , Genome, Bacterial/genetics , Internet , Archaea/genetics , Bacteria/genetics , Phylogeny , RNA, Ribosomal, 16S/genetics , Retrospective Studies , Sequence Alignment/methods
8.
Life (Basel) ; 5(1): 949-68, 2015 Mar 17.
Article in English | MEDLINE | ID: mdl-25789552

ABSTRACT

A tripartite comparison of Archaea phylogeny and taxonomy at and above the rank order is reported: (1) the whole-genome-based and alignment-free CVTree using 179 genomes; (2) the 16S rRNA analysis exemplified by the All-Species Living Tree with 366 archaeal sequences; and (3) the Second Edition of Bergey's Manual of Systematic Bacteriology complemented by some current literature. A high degree of agreement is reached at these ranks. From the newly proposed archaeal phyla, Korarchaeota, Thaumarchaeota, Nanoarchaeota and Aigarchaeota, to the recent suggestion to divide the class Halobacteria into three orders, all gain substantial support from CVTree. In addition, the CVTree helped to determine the taxonomic position of some newly sequenced genomes without proper lineage information. A few discrepancies between the CVTree and the 16S rRNA approaches call for further investigation.

9.
Comput Biol Chem ; 53 Pt A: 166-73, 2014 Dec.
Article in English | MEDLINE | ID: mdl-25205031

ABSTRACT

Using an enlarged alphabet of K-tuples is the way to carry out alignment-free comparison of genomes in the composition vector (CV) approach to prokaryotic phylogeny. We summarize the known aspects concerning the choice of K and examine the results of using CVs with subtraction of a statistical background for K=3-9 and using raw CVs without subtraction for K=1-12. The criterion for evaluation consists in direct comparison with taxonomy. For prokaryotes the best performances are obtained for K=5 and 6 with subtraction and for K=11, 12 or even more without subtraction. In general, CVs with subtractions are slightly better and less CPU consuming, but CVs without subtraction may provide complementary information.


Subject(s)
Algorithms , Archaea/classification , Bacteria/classification , Genome, Archaeal , Genome, Bacterial , Phylogeny , Archaea/genetics , Archaeal Proteins/chemistry , Archaeal Proteins/genetics , Bacteria/genetics , Bacterial Proteins/chemistry , Bacterial Proteins/genetics , Peptides/chemistry , Peptides/genetics , Sequence Analysis, DNA , Sequence Analysis, Protein
10.
Antonie Van Leeuwenhoek ; 105(2): 431-5, 2014 Feb.
Article in English | MEDLINE | ID: mdl-24301254

ABSTRACT

Ten well-annotated genomes of "Sulfolobus islandicus" strains from different geographic locations have been released at the NCBI database. Whole genome based composition vector trees indicate that these strains show the same branching patterns as originally reported by multi-locus sequence analysis. To determine whether the ten strains meet the criteria for separate species, DNA-DNA hybridization (DDH) was performed in silico. DDH values of strains from the same geographic location, i.e., Iceland, Kamchatka and North America, ranged from 82.4 to 95.4 %, clearly qualifying them as members of the same species. The lowest DDH values found between locations ranged from 75.5 to 76.6 %, which exceed the 70 % DDH threshold for a species thereby indicating they are all members of the same species based on the currently accepted definition. The clear divergences of strains from the different geographic locations are sufficiently great to consider them as separate geovars. "S. islandicus" has not yet been validly named and a type strain has not been deposited in culture collections. We urgently recommend that those who study the organism fulfill the criteria of the International Code of Nomenclature of Bacteria in order to designate a type strain and to identify and deposit related strains of this species to make them available to the broader scientific community.


Subject(s)
Genetic Variation , Phylogeography , Sulfolobus/genetics , Computational Biology/methods , Computer Simulation , Iceland , North America , Nucleic Acid Hybridization , Russia
11.
Sci Rep ; 3: 1660, 2013.
Article in English | MEDLINE | ID: mdl-23588597

ABSTRACT

The biological toxicity of high levels of breathing gases has been known for centuries, but the mechanism remains elusive. Earlier work mainly focused on the influences of dispersed gas molecules dissolved in water on biomolecules. However, recent studies confirmed the existence of aggregated gas molecules at the water-solid interface. In this paper, we have investigated the binding preference of aggregated gas molecules on proteins with molecular dynamics simulations, using nitrogen (N2) gas and the Src-homology 3 (SH3) domain as the model system. Aggregated N2 molecules were strongly bound by the active sites of the SH3 domain, which could impair the activity of the protein. In contrast, dispersed N2 molecules did not specifically interact with the SH3 domain. These observations extend our understanding of the possible toxicity of aggregates of gas molecules in the function of proteins.


Subject(s)
Gases/chemistry , Models, Chemical , Models, Molecular , Nitrogen/chemistry , src Homology Domains , src-Family Kinases/chemistry , src-Family Kinases/ultrastructure , Binding Sites , Computer Simulation , Protein Binding , Protein Conformation , Protein Denaturation
12.
J Phys Chem B ; 117(13): 3541-7, 2013 Apr 04.
Article in English | MEDLINE | ID: mdl-23477344

ABSTRACT

With the widespread applications of nanomaterials such as carbon nanotubes, there is a growing concern on the biosafety of these engineered nanoparticles, in particular their interactions with proteins. In molecular simulations of nanoparticle-protein interactions, the choice of empirical parameters (force fields) plays a decisive role, and thus is of great importance and should be examined carefully before wider applications. Here we compare three commonly used force fields, CHARMM, OPLSAA, and AMBER in study of the competitive binding of a single wall carbon nanotube (SWCNT) with a native proline-rich motif (PRM) ligand on its target protein SH3 domain, a ubiquitous protein-protein interaction mediator involved in signaling and regulatory pathways. We find that the SWCNT displays a general preference over the PRM in binding with SH3 domain in all the three force fields examined, although the degree of preference can be somewhat different, with the AMBER force field showing the highest preference. The SWCNT prevents the ligand from reaching its native binding pocket by (i) occupying the binding pocket directly, and (ii) binding with the ligand itself and then being trapped together onto some off-sites. The π-π stacking interactions between the SWCNT and aromatic residues are found to play a significant role in its binding to the SH3 domain in all the three force fields. Further analyses show that even the SWCNT-ligand binding can also be relatively more stable than the native ligand-protein binding, indicating a serious potential disruption to the protein SH3 function.


Subject(s)
Nanotubes, Carbon/chemistry , Proline/chemistry , Ubiquitins/chemistry , Binding Sites , Ligands , Models, Molecular , src Homology Domains
13.
Genomics Proteomics Bioinformatics ; 11(1): 61-5, 2013 Feb.
Article in English | MEDLINE | ID: mdl-23395177

ABSTRACT

Shigella species and Escherichia coli are closely related organisms. Early phenotyping experiments and several recent molecular studies put Shigella within the species E. coli. However, the whole-genome-based, alignment-free and parameter-free CVTree approach shows convincingly that four established Shigella species, Shigella boydii, Shigella sonnei, Shigella felxneri and Shigella dysenteriae, are distinct from E. coli strains, and form sister species to E. coli within the genus Escherichia. In view of the overall success and high resolution power of the CVTree approach, this result should be taken seriously. We hope that the present report may promote further in-depth study of the Shigella-E. coli relationship.


Subject(s)
Escherichia/classification , Genome, Bacterial , Shigella/classification , Escherichia/genetics , Escherichia coli/classification , Escherichia coli/genetics , Phylogeny , Shigella/genetics , Shigella boydii/classification , Shigella boydii/genetics , Shigella dysenteriae/classification , Shigella dysenteriae/genetics , Shigella sonnei/classification , Shigella sonnei/genetics
14.
Small ; 9(9-10): 1546-56, 2013 May 27.
Article in English | MEDLINE | ID: mdl-23038664

ABSTRACT

The widespread application of nanomaterials has spurred an interest in the study of interactions between nanoparticles and proteins due to the biosafety concerns of these nanomaterials. In this review, a summary is presented of some of the recent studies on this important subject, especially on the interactions of proteins with carbon nanotubes (CNTs) and metallofullerenols. Two potential molecular mechanisms have been proposed for CNTs' inhibition of protein functions. The driving forces of CNTs' adsorption onto proteins are found to be mainly hydrophobic interactions and the so-called π-π stacking between CNTs' carbon rings and proteins' aromatic residues. However, there is also recent evidence showing that endohedral metallofullerenol Gd@C82 (OH)22 can be used to inhibit tumor growth, thus acting as a potential nanomedicine. These recent findings have provided a better understanding of nanotoxicity at the molecular level and also suggested therapeutic potential by using nanoparticles' cytotoxicity against cancer cells.


Subject(s)
Nanotubes, Carbon/toxicity , Proteins/chemistry , Humans , Microscopy, Electron , Nanotubes, Carbon/chemistry
15.
ACS Nano ; 4(12): 7508-14, 2010 Dec 28.
Article in English | MEDLINE | ID: mdl-21080666

ABSTRACT

Nanoscale particles have become promising materials in many fields, such as cancer therapeutics, diagnosis, imaging, drug delivery, catalysis, as well as biosensors. In order to stimulate and facilitate these applications, there is an urgent need for the understanding of the nanoparticle toxicity and other risks involved with these nanoparticles to human health. In this study, we use large-scale molecular dynamics simulations to study the interaction between several proteins (WW domains) and carbon nanotubes (one form of hydrophobic nanoparticles). We have found that the carbon nanotube can plug into the hydrophobic core of proteins to form stable complexes. This plugging of nanotubes disrupts and blocks the active sites of WW domains from binding to the corresponding ligands, thus leading to the loss of the original function of the proteins. The key to this observation is the hydrophobic interaction between the nanoparticle and the hydrophobic residues, particularly tryptophans, in the core of the domain. We believe that these findings might provide a novel route to the nanoparticle toxicity on the molecular level for the hydrophobic nanoparticles.


Subject(s)
Hydrophobic and Hydrophilic Interactions , Molecular Dynamics Simulation , Nanotubes, Carbon/chemistry , Nanotubes, Carbon/toxicity , Proteins/chemistry , Proteins/metabolism , Humans , Protein Structure, Secondary , Protein Structure, Tertiary , Thermodynamics
16.
J Phys Chem B ; 114(17): 5835-9, 2010 May 06.
Article in English | MEDLINE | ID: mdl-20392088

ABSTRACT

The folding of a small RNA tetraloop hairpin is studied based on intensive molecular dynamics simulation, aiming to understand the folding mechanism of this small and fast RNA folder. Our results showed that this RNA hairpin has very complicated folding behavior in spite of its small size. It is found that the folding transition has low cooperativity. Instead of a two-state folding, four major states are observed, including the native state, the intermediate, the unfolded state, and the misfolded state. The misfolded state is mainly stabilized by the non-native hydrogen bonds, and is more compact. Two potential folding pathways, in which two basepairs formed with different order, are observed, and the pathway with the inboard basepair formed before the terminal one is much more favorable, and dominates the folding of the RNA hairpin.


Subject(s)
Molecular Dynamics Simulation , RNA/chemistry , Hydrogen Bonding , Models, Molecular , Nucleic Acid Conformation , Nucleic Acid Denaturation
17.
Phys Rev E Stat Nonlin Soft Matter Phys ; 81(3 Pt 1): 031917, 2010 Mar.
Article in English | MEDLINE | ID: mdl-20365780

ABSTRACT

The effect of solvation-related interaction on the low-temperature dynamics of proteins is studied by taking into account the desolvation barriers in the interactions of native contacts. It is found out that about the folding transition temperature, the protein folds in a cooperative manner, and the water molecules are expelled from the hydrophobic core at the final stage in the folding process. At low temperature, however, the protein would generally be trapped in many metastable conformations with some water molecules frozen inside the protein. The desolvation takes an important role in these processes. The number of frozen water molecules and that of frozen states of proteins are further analyzed with the methods based on principal component analysis (PCA) and the clustering of conformations. It is found out that both the numbers of frozen water molecules and the frozen states of the protein increase quickly below a certain temperature. Especially, the number of frozen states of the protein increases exponentially following the decrease in the temperature, which resembles the basic features of glassy dynamics. Interestingly, it is observed that the freezing of water molecules and that of protein conformations happen at almost the same temperature. This suggests that the solvation-related interaction performs an important role for the low-temperature dynamics of the model protein.


Subject(s)
Models, Chemical , Models, Molecular , Proteins/chemistry , Proteins/ultrastructure , Solvents/chemistry , Cold Temperature , Computer Simulation , Protein Conformation , Protein Denaturation , Protein Folding
18.
Genomics Proteomics Bioinformatics ; 8(4): 262-7, 2010 Dec.
Article in English | MEDLINE | ID: mdl-21382595

ABSTRACT

Composition vector trees (CVTrees) are inferred from whole-genome data by an alignment-free and parameter-free method. The agreement of these trees with the corresponding taxonomy provides an objective justification of the inferred phylogeny In this work, we show the stability and self-consistency of CVTrees by performing bootstrap and jackknife re-sampling tests adapted to this alignment-free approach. Our ultimate goal is to advocate the viewpoint that time-consuming statistical re-sampling tests can be avoided at all in using this alignment-free approach. Agreement with taxonomy should be taken as a major criterion to estimate prokaryotic phylogenetic trees.


Subject(s)
Computational Biology/methods , Genomics/methods , Phylogeny , Algorithms , Data Interpretation, Statistical
19.
Phys Rev Lett ; 103(13): 137801, 2009 Sep 25.
Article in English | MEDLINE | ID: mdl-19905541

ABSTRACT

Using molecular dynamics simulation, we show direct evidence of the unexpected phenomenon of "water that does not wet a water monolayer" at room temperature. This phenomenon is attributed to the structure of the water beneath the water droplet, which exhibits an ordered water monolayer. Remarkably, there remains a considerable number of dangling OH bonds in this room temperature water monolayer, in contrast with the absence of dangling OH bonds at cryogenic temperature.


Subject(s)
Models, Chemical , Water/chemistry , Computer Simulation , Hydrogen Bonding , Models, Molecular , Wettability
20.
IUBMB Life ; 61(6): 627-43, 2009 Jun.
Article in English | MEDLINE | ID: mdl-19472192

ABSTRACT

Protein folding is an important and challenging problem in molecular biology. During the last two decades, molecular dynamics (MD) simulation has proved to be a paramount tool and was widely used to study protein structures, folding kinetics and thermodynamics, and structure-stability-function relationship. It was also used to help engineering and designing new proteins, and to answer even more general questions such as the minimal number of amino acid or the evolution principle of protein families. Nowadays, the MD simulation is still undergoing rapid developments. The first trend is to toward developing new coarse-grained models and studying larger and more complex molecular systems such as protein-protein complex and their assembling process, amyloid related aggregations, and structure and motion of chaperons, motors, channels and virus capsides; the second trend is toward building high resolution models and explore more detailed and accurate pictures of protein folding and the associated processes, such as the coordination bond or disulfide bond involved folding, the polarization, charge transfer and protonate/deprotonate process involved in metal coupled folding, and the ion permeation and its coupling with the kinetics of channels. On these new territories, MD simulations have given many promising results and will continue to offer exciting views. Here, we review several new subjects investigated by using MD simulations as well as the corresponding developments of appropriate protein models. These include but are not limited to the attempt to go beyond the topology based Go-like model and characterize the energetic factors in protein structures and dynamics, the study of the thermodynamics and kinetics of disulfide bond involved protein folding, the modeling of the interactions between chaperonin and the encapsulated protein and the protein folding under this circumstance, the effort to clarify the important yet still elusive folding mechanism of protein BBL, the development of discrete MD and its application in studying the alpha-beta conformational conversion and oligomer assembling process, and the modeling of metal ion involved protein folding.


Subject(s)
Models, Molecular , Protein Folding , Chaperonins/physiology , Computer Simulation , Disulfides/chemistry , Hydrogen Bonding , Kinetics , Metals/pharmacology , Models, Chemical , Peptides/chemistry , Protein Conformation , Protein Folding/drug effects , Protein Multimerization , Ribosomal Protein S6/chemistry , Thermodynamics , Zinc Fingers
SELECTION OF CITATIONS
SEARCH DETAIL
...