Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 19 de 19
Filter
Add more filters










Publication year range
1.
Astrobiology ; 19(9): 1177-1185, 2019 09.
Article in English | MEDLINE | ID: mdl-31397580

ABSTRACT

The Planetary Society's Phobos Living Interplanetary Flight Experiment (Phobos LIFE) flew in the sample return capsule of the Russian Federal Space Agency's Phobos Grunt mission and was to have been a test of one aspect of the hypothesis that life can move between nearby planets within ejected rocks. Although the Phobos Grunt mission failed, we present here the scientific and engineering design and motivation of the Phobos LIFE experiment to assist with the scientific and engineering design of similar future experiments. Phobos LIFE flew selected organisms in a simulated meteoroid. The 34-month voyage would have been the first such test to occur in the high-radiation environment outside the protection of Earth's magnetosphere for more than a few days. The patented Phobos LIFE "biomodule" is an 88 g cylinder consisting of a titanium outer shell, several types of redundant seals, and 31 individual Delrin sample containers. Phobos LIFE contained 10 different organisms, representing all three domains of life, and one soil sample. The organisms are all very well characterized, most with sequenced genomes. Most are extremophiles, and most have flown in low Earth orbit. Upon return from space, the health and characteristics of organisms were to have been compared with controls that remained on Earth and have not yet been opened.


Subject(s)
Exobiology , Extraterrestrial Environment , Planets , Space Flight , Hot Temperature , Meteoroids , Radiation
2.
Stand Genomic Sci ; 10: 26, 2015.
Article in English | MEDLINE | ID: mdl-26203337

ABSTRACT

The Genomic Encyclopedia of Bacteria and Archaea (GEBA) project was launched by the JGI in 2007 as a pilot project to sequence about 250 bacterial and archaeal genomes of elevated phylogenetic diversity. Herein, we propose to extend this approach to type strains of prokaryotes associated with soil or plants and their close relatives as well as type strains from newly described species. Understanding the microbiology of soil and plants is critical to many DOE mission areas, such as biofuel production from biomass, biogeochemistry, and carbon cycling. We are also targeting type strains of novel species while they are being described. Since 2006, about 630 new species have been described per year, many of which are closely aligned to DOE areas of interest in soil, agriculture, degradation of pollutants, biofuel production, biogeochemical transformation, and biodiversity.

3.
BMC Syst Biol ; 9 Suppl 4: S1, 2015.
Article in English | MEDLINE | ID: mdl-26100579

ABSTRACT

BACKGROUND: Malaria is a major health threat, affecting over 40% of the world's population. The latest report released by the World Health Organization estimated about 207 million cases of malaria infection, and about 627,000 deaths in 2012 alone. During the past decade, new therapeutic targets have been identified and are at various stages of characterization, thanks to the emerging omics-based technologies. However, the mechanism of malaria pathogenesis remains largely unknown. In this paper, we employ a novel neighborhood subnetwork alignment approach to identify network components that are potentially involved in pathogenesis. RESULTS: Our module-based subnetwork alignment approach identified 24 functional homologs of pathogenesis-related proteins in the malaria parasite P. falciparum, using the protein-protein interaction networks in Escherichia coli as references. Eighteen out of these 24 proteins are associated with 418 other proteins that are related to DNA replication, transcriptional regulation, translation, signaling, metabolism, cell cycle regulation, as well as cytoadherence and entry to the host. CONCLUSIONS: The subnetwork alignments and subsequent protein-protein association network mining predicted a group of malarial proteins that may be involved in parasite development and parasite-host interaction, opening a new systems-level view of parasite pathogenesis and virulence.


Subject(s)
Computational Biology/methods , Malaria, Falciparum/etiology , Malaria, Falciparum/parasitology , Plasmodium falciparum/physiology , Animals , Antigens, Protozoan/metabolism , Chromatin Assembly and Disassembly , Epigenesis, Genetic , Humans , Plasmodium falciparum/cytology , Plasmodium falciparum/genetics , Plasmodium falciparum/metabolism , Protein Transport , Proteolysis , Protozoan Proteins/chemistry , Protozoan Proteins/genetics , Protozoan Proteins/metabolism , RNA, Protozoan/metabolism , Sequence Homology, Amino Acid , Signal Transduction , Stress, Physiological , Transcription, Genetic
4.
Methods ; 83: 63-70, 2015 Jul 15.
Article in English | MEDLINE | ID: mdl-25861922

ABSTRACT

Annotating and understanding the function of proteins and other elements in a genome can be difficult in the absence of a well-studied and evolutionarily close relative. The causative agent of malaria, one of the oldest and most deadly global infectious diseases, is a good example of this problem. The burden of malaria is huge and there is a pressing need for new, more effective antimalarial strategies. However, techniques such as homology-dependent annotation transfer are severely impaired in this parasite because there are no well-understood close relatives. To circumvent this approach we developed a network-based method that uses a heavy path network-mining algorithm. We uncovered the protein-protein associations that are implicated in important cellular processes including genome integrity, DNA repair, transcriptional regulation, invasion, and pathogenesis, thus demonstrating the utility of this method. The URL of the source code for super-sequence mining method is http://www.cs.utsa.edu/~korkmaz/research/heavy-path-mining/.


Subject(s)
Genome , Malaria/genetics , Plasmodium falciparum/genetics , Animals , Computational Biology , DNA Repair/genetics , Gene Expression Regulation , Malaria/parasitology , Plasmodium falciparum/pathogenicity , Systems Biology
5.
Int J Comput Biol Drug Des ; 7(4): 369-83, 2014.
Article in English | MEDLINE | ID: mdl-25539848

ABSTRACT

The heat shock response is a general mechanism by which organisms deal with physical insults such as sudden changes in temperature, osmotic and oxidative stresses, and exposure to toxic substances. Plasmodium falciparum is exposed to drastic temperature changes as a part of its life cycle and maintains an extensive repertoire of heat shock response-related proteins. As these proteins serve to maintain the parasite in the face of anti-malarial drugs as well, better understanding of the heat shock-related systems in the malaria parasite will lead to therapeutic approaches that frustrate these systems, leading to more effective use of anti-malarials. Here we use protein association networks to broaden our understanding of the systems impacted by and/or implicated in the heat shock response.


Subject(s)
Heat-Shock Proteins/metabolism , Heat-Shock Response/physiology , Models, Biological , Plasmodium falciparum/physiology , Protozoan Proteins/metabolism , Signal Transduction/physiology , Computer Simulation , Protein Interaction Mapping/methods
6.
Stand Genomic Sci ; 9(3): 1278-84, 2014 Jun 15.
Article in English | MEDLINE | ID: mdl-25197443

ABSTRACT

The Genomic Encyclopedia of Bacteria and Archaea (GEBA) project was launched by the JGI in 2007 as a pilot project with the objective of sequencing 250 bacterial and archaeal genomes. The two major goals of that project were (a) to test the hypothesis that there are many benefits to the use the phylogenetic diversity of organisms in the tree of life as a primary criterion for generating their genome sequence and (b) to develop the necessary framework, technology and organization for large-scale sequencing of microbial isolate genomes. While the GEBA pilot project has not yet been entirely completed, both of the original goals have already been successfully accomplished, leading the way for the next phase of the project. Here we propose taking the GEBA project to the next level, by generating high quality draft genomes for 1,000 bacterial and archaeal strains. This represents a combined 16-fold increase in both scale and speed as compared to the GEBA pilot project (250 isolate genomes in 4+ years). We will follow a similar approach for organism selection and sequencing prioritization as was done for the GEBA pilot project (i.e. phylogenetic novelty, availability and growth of cultures of type strains and DNA extraction capability), focusing on type strains as this ensures reproducibility of our results and provides the strongest linkage between genome sequences and other knowledge about each strain. In turn, this project will constitute a pilot phase of a larger effort that will target the genome sequences of all available type strains of the Bacteria and Archaea.

7.
BMC Bioinformatics ; 14 Suppl 12: S2, 2013.
Article in English | MEDLINE | ID: mdl-24267797

ABSTRACT

BACKGROUND: According to the World Health organization, half the world's population is at risk of contracting malaria. They estimated that in 2010 there were 219 million cases of malaria, resulting in 660,000 deaths and an enormous economic burden on the countries where malaria is endemic. The adoption of various high-throughput genomics-based techniques by malaria researchers has meant that new avenues to the study of this disease are being explored and new targets for controlling the disease are being developed. Here, we apply a novel neighborhood subnetwork alignment approach to identify the interacting elements that help regulate the cell cycle of the malaria parasite Plasmodium falciparum. RESULTS: Our novel subnetwork alignment approach was used to compare networks in Escherichia coli and P. falciparum. Some 574 P. falciparum proteins were revealed as functional orthologs of known cell cycle proteins in E. coli. Over one third of these predicted functional orthologs were annotated as "conserved Plasmodium proteins" or "putative uncharacterized proteins" of unknown function. The predicted functionalities included cyclins, kinases, surface antigens, transcriptional regulators and various functions related to DNA replication, repair and cell division. CONCLUSIONS: The results of our analysis demonstrate the power of our subnetwork alignment approach to assign functionality to previously unannotated proteins. Here, the focus was on proteins involved in cell cycle regulation. These proteins are involved in the control of diverse aspects of the parasite lifecycle and of important aspects of pathogenesis.


Subject(s)
Algorithms , Plasmodium falciparum/chemistry , Plasmodium falciparum/cytology , Protein Interaction Maps , Protozoan Proteins/metabolism , Cell Cycle , Escherichia coli/chemistry , Escherichia coli/metabolism , Plasmodium falciparum/metabolism , Protozoan Proteins/analysis
8.
Syst Appl Microbiol ; 36(1): 69-73, 2013 Feb.
Article in English | MEDLINE | ID: mdl-23410935

ABSTRACT

High quality 16S ribosomal RNA (rRNA) gene sequences from the type strains of all species with validly published names, as defined by the International Code of Nomenclature of Bacteria, are a prerequisite for their accurate affiliations within the global genealogical classification and for the recognition of potential new taxa. During the last few years, the Living Tree Project (LTP) has taken care to create a high quality, aligned 16S and 23S rRNA gene sequence database of all type strains. However, the manual curation of the sequence dataset and type strain information revealed that a total of 552 "orphan" species (about 5.7% of the currently classified species) had to be excluded from the reference trees. Among them, 322 type strains were not represented by an SSU entry in the public sequence repositories. The remaining 230 type strains had to be discarded due to bad sequence quality. Since 2010, the LTP team has coordinated a network of researchers and culture collections in order to improve the situation by (re)-sequencing the type strains of these "orphan" species. As a result, we can now report 351 16S rRNA gene sequences of type strains. Nevertheless, 201 species could not be sequenced because cultivable type strains were not available (121), the cultures had either been lost or were never deposited in the first place (66), or it was not possible due to other constraints (14). The International Code of Nomenclature of Bacteria provides a number of mechanisms to deal with the problem of missing type strains and we recommend that due consideration be given to the appropriate mechanisms in order to help solve some of these issues.


Subject(s)
Bacteria/classification , Bacteria/genetics , DNA, Bacterial/genetics , Phylogeny , RNA, Ribosomal, 16S/genetics , Sequence Analysis, DNA , Classification/methods , DNA, Bacterial/chemistry , DNA, Ribosomal/chemistry , DNA, Ribosomal/genetics
9.
BMC Syst Biol ; 6 Suppl 3: S5, 2012.
Article in English | MEDLINE | ID: mdl-23282319

ABSTRACT

BACKGROUND: Malaria causes over one million deaths annually, posing an enormous health and economic burden in endemic regions. The completion of genome sequencing of the causative agents, a group of parasites in the genus Plasmodium, revealed potential drug and vaccine candidates. However, genomics-driven target discovery has been significantly hampered by our limited knowledge of the cellular networks associated with parasite development and pathogenesis. In this paper, we propose an approach based on aligning neighborhood PPI subnetworks across species to identify network components in the malaria parasite P. falciparum. RESULTS: Instead of only relying on sequence similarities to detect functional orthologs, our approach measures the conservation between the neighborhood subnetworks in protein-protein interaction (PPI) networks in two species, P. falciparum and E. coli. 1,082 P. falciparum proteins were predicted as functional orthologs of known transcriptional regulators in the E. coli network, including general transcriptional regulators, parasite-specific transcriptional regulators in the ApiAP2 protein family, and other potential regulatory proteins. They are implicated in a variety of cellular processes involving chromatin remodeling, genome integrity, secretion, invasion, protein processing, and metabolism. CONCLUSIONS: In this proof-of-concept study, we demonstrate that a subnetwork alignment approach can reveal previously uncharacterized members of the subnetworks, which opens new opportunities to identify potential therapeutic targets and provide new insights into parasite biology, pathogenesis and virulence. This approach can be extended to other systems, especially those with poor genome annotation and a paucity of knowledge about cellular networks.


Subject(s)
Plasmodium falciparum/genetics , Protein Interaction Maps , Regulatory Elements, Transcriptional , Sequence Alignment , Chromatin Assembly and Disassembly , Chromosome Mapping , Escherichia coli/genetics , Escherichia coli/metabolism , Gene Expression Regulation , Genome, Protozoan , Genomics , Plasmodium falciparum/metabolism , Protozoan Proteins/genetics , Protozoan Proteins/metabolism , Systems Biology
10.
BMC Genomics ; 12 Suppl 5: S9, 2011 Dec 23.
Article in English | MEDLINE | ID: mdl-22369208

ABSTRACT

BACKGROUND: Malaria continues to be one of the most severe global infectious diseases, responsible for 1-2 million deaths yearly. The rapid evolution and spread of drug resistance in parasites has led to an urgent need for the development of novel antimalarial targets. Proteases are a group of enzymes that play essential roles in parasite growth and invasion. The possibility of designing specific inhibitors for proteases makes them promising drug targets. Previously, combining a comparative genomics approach and a machine learning approach, we identified the complement of proteases (degradome) in the malaria parasite Plasmodium falciparum and its sibling species 123, providing a catalog of targets for functional characterization and rational inhibitor design. Network analysis represents another route to revealing the role of proteins in the biology of parasites and we use this approach here to expand our understanding of the systems involving the proteases of P. falciparum. RESULTS: We investigated the roles of proteases in the parasite life cycle by constructing a network using protein-protein association data from the STRING database 4, and analyzing these data, in conjunction with the data from protein-protein interaction assays using the yeast 2-hybrid (Y2H) system 5, blood stage microarray experiments 678, proteomics 9101112, literature text mining, and sequence homology analysis. Seventy-seven (77) out of 124 predicted proteases were associated with at least one other protein, constituting 2,431 protein-protein interactions (PPIs). These proteases appear to play diverse roles in metabolism, cell cycle regulation, invasion and infection. Their degrees of connectivity (i.e., connections to other proteins), range from one to 143. The largest protease-associated sub-network is the ubiquitin-proteasome system which is crucial for protein recycling and stress response. Proteases are also implicated in heat shock response, signal peptide processing, cell cycle progression, transcriptional regulation, and signal transduction networks. CONCLUSIONS: Our network analysis of proteases from P. falciparum uses a so-called guilt-by-association approach to extract sets of proteins from the proteome that are candidates for further study. Novel protease targets and previously unrecognized members of the protease-associated sub-systems provide new insights into the mechanisms underlying parasitism, pathogenesis and virulence.


Subject(s)
Peptide Hydrolases/metabolism , Plasmodium falciparum/enzymology , Protozoan Proteins/metabolism , Animals , Data Mining , Databases, Protein , Erythrocytes/parasitology , Plasmodium falciparum/metabolism , Protein Array Analysis , Protein Interaction Mapping , Proteomics , Sequence Homology , Two-Hybrid System Techniques , Ubiquitins/metabolism
11.
BMC Genomics ; 11: 369, 2010 Jun 10.
Article in English | MEDLINE | ID: mdl-20537180

ABSTRACT

BACKGROUND: Species of the family Vibrionaceae are ubiquitous in marine environments. Several of these species are important pathogens of humans and marine species. Evidence indicates that genetic exchange plays an important role in the emergence of new pathogenic strains within this family. Data from the sequenced genomes of strains in this family could show how the genes encoded by all these strains, known as the pangenome, are distributed. Information about the core, accessory and panproteome of this family can show how, for example, genes encoding virulence-associated proteins are distributed and help us understand how virulence emerges. RESULTS: We deduced the complete set of orthologs for eleven strains from this family. The core proteome consists of 1,882 orthologous groups, which is 28% of the 6,629 orthologous groups in this family. There were 4,411 accessory orthologous groups (i.e., proteins that occurred in from 2 to 10 proteomes) and 5,584 unique proteins (encoded once on only one of the eleven genomes). Proteins that have been associated with virulence in V. cholerae were widely distributed across the eleven genomes, but the majority was found only on the genomes of the two V. cholerae strains examined. CONCLUSIONS: The proteomes are reflective of the differing evolutionary trajectories followed by different strains to similar phenotypes. The composition of the proteomes supports the notion that genetic exchange among species of the Vibrionaceae is widespread and that this exchange aids these species in adapting to their environments.


Subject(s)
Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Genes, Bacterial/genetics , Genomics , Vibrionaceae/genetics , Vibrionaceae/pathogenicity , Evolution, Molecular , Genetic Variation , Proteome/genetics , Proteome/metabolism , Sequence Homology, Amino Acid , Species Specificity
12.
J Bacteriol ; 191(20): 6262-72, 2009 Oct.
Article in English | MEDLINE | ID: mdl-19666715

ABSTRACT

Our views of the genes that drive phenotypes have generally been built up one locus or operon at a time. However, a given phenotype, such as virulence, is a multilocus phenomenon. To gain a more comprehensive view of the genes and interactions underlying a phenotype, we propose an approach that incorporates information from comparative genomics and network biology and illustrate it by examining the virulence phenotype of Vibrio cholerae O1 El Tor N16961. We assessed the associations among the virulence-associated proteins from Vibrio cholerae and all the other proteins from this bacterium using a functional-association network map. In the context of this map, we were able to identify 262 proteins that are functionally linked to the virulence-associated genes more closely than is typical of the proteins in this strain and 240 proteins that are functionally linked to the virulence-associated proteins with a confidence score greater than 0.9. The roles of these genes were investigated using functional information from online data sources, comparative genomics, and the relationships shown by the protein association map. We also incorporated core proteome data from the family Vibrionaceae; 35% of the virulence-associated proteins have orthologs among the 1,822 orthologous groups of proteins in the core proteome, indicating that they may be dual-role virulence genes or encode functions that have value outside the human host. This approach is a valuable tool in searching for novel functional associations and in investigating the relationship between genotype and phenotype.


Subject(s)
Genome, Bacterial , Genomics/methods , Vibrio cholerae/genetics , Vibrio cholerae/pathogenicity , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Gene Expression Profiling , Gene Expression Regulation, Bacterial/physiology , Humans , Vibrio cholerae/physiology , Virulence
13.
BMC Genomics ; 10 Suppl 1: S11, 2009 Jul 07.
Article in English | MEDLINE | ID: mdl-19594870

ABSTRACT

BACKGROUND: The steadily increasing number of prokaryotic genomes has accelerated the study of genome evolution; in particular, the availability of sets of genomes from closely related bacteria has facilitated the exploration of the mechanisms underlying genome plasticity. The family Vibrionaceae is found in the Gammaproteobacteria and is abundant in aquatic environments. Taxa from the family Vibrionaceae are diversified in their life styles; some species are free living, others are symbiotic, and others are human pathogens. This diversity makes this family a useful set of model organisms for studying bacterial evolution. This evolution is driven by several forces, among them gene duplication and lateral gene transfer, which are believed to provide raw material for functional redundancy and novelty. The resultant gene copy increase in one genome is then detected as lineage-specific expansion (LSE). RESULTS: Here we present the results of a detailed comparison of the genomes of eleven Vibrionaceae strains that have distinct life styles and distinct phenotypes. The core genome shared by all eleven strains is composed of 1,882 genes, which make up about 31%-50% of the genome repertoire. We further investigated the distribution and features of genes that have been specifically expanded in one unique lineage of the eleven strains. Abundant duplicate genes have been identified in the eleven Vibrionaceae strains, with 1-11% of the whole genomes composed lineage specific radiations. These LSEs occurred in two distinct patterns: the first type yields one or more copies of a single gene; we call this a single gene expansion. The second pattern has a high evolutionary impact, as the expansion involves two or more gene copies in a block, with the duplicated block located next to the original block (a contiguous block expansion) or at some distance from the original block (a discontiguous block expansion). We showed that LSEs involve genes that are tied to defense and pathogenesis mechanisms as well as in the fundamental life cycle of Vibrionaceae species. CONCLUSION: Our results provide evidence of genome plasticity and rapid evolution within the family Vibrionaceae. The comparisons point to sources of genomic variation and candidates for lineage-specific adaptations of each Vibrionaceae pathogen or nonpathogen strain. Such lineage specific expansions could reveal components in bacterial systems that, by their enhanced genetic variability, can be tied to responses to environmental challenges, interesting phenotypes, or adaptive pathogenic responses to host challenges.


Subject(s)
Evolution, Molecular , Genome, Bacterial , Vibrionaceae/genetics , Comparative Genomic Hybridization , Sequence Alignment , Sequence Analysis, DNA , Species Specificity
14.
Bioscience ; 59(2): 113-125, 2009 Feb 11.
Article in English | MEDLINE | ID: mdl-20157346

ABSTRACT

There are hundreds of Biological Resource Centers (BRCs) around the world, holding many little-studied microorganism. The proportion of bacterial strains that is well represented in the sequence and literature databases may be as low as 1%. This body of unexplored diversity represents an untapped source of useful strains and derived products. However, a modicum of phenotypic data is available for almost all the bacterial strains held by BRCs around the world. It is at the phenotypic level that our knowledge of the well-studied strains of bacteria and the many yet-to-be studied strains intersects. This suggests we might leverage the phenotypic data from the data-poor bacteria with the omics data from the data-rich bacteria, using our knowledge of their evolutionary relationships, to map the metabolic networks of the little-known bacteria. This systems biology-based approach is a new way to explore the diversity harbored in BRCs.

15.
PLoS One ; 2(7): e667, 2007 Aug 01.
Article in English | MEDLINE | ID: mdl-17668050

ABSTRACT

BACKGROUND: For decades it has been recognized that neutrophilic Fe-oxidizing bacteria (FeOB) are associated with hydrothermal venting of Fe(II)-rich fluids associated with seamounts in the world's oceans. The evidence was based almost entirely on the mineralogical remains of the microbes, which themselves had neither been brought into culture or been assigned to a specific phylogenetic clade. We have used both cultivation and cultivation-independent techniques to study Fe-rich microbial mats associated with hydrothermal venting at Loihi Seamount, a submarine volcano. METHODOLOGY/PRINCIPLE FINDINGS: Using gradient enrichment techniques, two iron-oxidizing bacteria, strains PV-1 and JV-1, were isolated. Chemolithotrophic growth was observed under microaerobic conditions; Fe(II) and Fe(0) were the only energy sources that supported growth. Both strains produced filamentous stalk-like structures composed of multiple nanometer sized fibrils of Fe-oxyhydroxide. These were consistent with mineralogical structures found in the iron mats. Phylogenetic analysis of the small subunit (SSU) rRNA gene demonstrated that strains PV-1 and JV-1 were identical and formed a monophyletic group deeply rooted within the Proteobacteria. The most similar sequence (85.3% similarity) from a cultivated isolate came from Methylophaga marina. Phylogenetic analysis of the RecA and GyrB protein sequences confirmed that these strains are distantly related to other members of the Proteobacteria. A cultivation-independent analysis of the SSU rRNA gene by terminal-restriction fragment (T-RF) profiling showed that this phylotype was most common in a variety of microbial mats collected at different times and locations at Loihi. CONCLUSIONS: On the basis of phylogenetic and physiological data, it is proposed that isolate PV-1(T) ( = ATCC BAA-1019: JCM 14766) represents the type strain of a novel species in a new genus, Mariprofundus ferrooxydans gen. nov., sp. nov. Furthermore, the strain is the first cultured representative of a new candidatus class of the Proteobacteria that is widely distributed in deep-sea environments, Candidatus zeta (zeta)-Proteobacteria cl. nov.


Subject(s)
Iron/metabolism , Proteobacteria/genetics , Proteobacteria/metabolism , Aerobiosis , Culture Media , Fatty Acids/metabolism , Gallionellaceae/genetics , Gallionellaceae/growth & development , Gallionellaceae/metabolism , Methylobacteriaceae/classification , Methylobacteriaceae/genetics , Methylobacteriaceae/growth & development , Methylobacteriaceae/metabolism , Oxidation-Reduction , Phylogeny , Proteobacteria/classification , Proteobacteria/growth & development , RNA, Bacterial/genetics , RNA, Ribosomal/genetics , Seawater
16.
J Exp Zool B Mol Dev Evol ; 308(1): 37-49, 2007 Jan 15.
Article in English | MEDLINE | ID: mdl-16838303

ABSTRACT

The steadily increasing number of prokaryotic genomes has accelerated the study of genome evolution; in particular, the availability of sets of genomes from closely related bacteria has made exploration of questions surrounding the evolution of pathogenesis tractable. Here we present the results of a detailed comparison of the genomes of Yersinia pseudotuberculosis IP32593 and three strains of Yersinia pestis (CO92, KIM10, and 91001). There appear to be between 241 and 275 multigene families in these organisms. There are 2,568 genes that are identical in the three Y. pestis strains, but differ from the Y. pseudotuberculosis strain. The changes found in some of these families, such as the kinases, proteases, and transporters, are illustrative of how the evolutionary jump from the free-living enteropathogen Y. pseudotuberculosis to the obligate host-borne blood pathogen Y. pestis was achieved. We discuss the composition of some of the most important families and discuss the observed divergence between Y. pseudotuberculosis and Y. pestis homologs.


Subject(s)
Evolution, Molecular , Genetic Variation , Genome, Bacterial/genetics , Yersinia pestis/genetics , Yersinia pseudotuberculosis/genetics , Base Sequence , DNA Transposable Elements/genetics , Molecular Sequence Data , Multigene Family/genetics , Species Specificity
17.
Brief Bioinform ; 7(2): 186-95, 2006 Jun.
Article in English | MEDLINE | ID: mdl-16772262

ABSTRACT

We review the resources available to systematic biologists who wish to use computers to build classifications. Algorithm development is in an early stage, and only a few examples of integrated applications for systematic biology are available. The availability of data is crucial if systematic biology is to enter the computer age.


Subject(s)
Biological Evolution , Chromosome Mapping/methods , Databases, Genetic , Genetics, Population , Models, Genetic , Phylogeny , Systems Biology/methods , Algorithms , Database Management Systems , Information Storage and Retrieval/methods , Programming Languages , Systems Biology/trends
18.
Bioinformatics ; 21(10): 2309-14, 2005 May 15.
Article in English | MEDLINE | ID: mdl-15731209

ABSTRACT

MOTIVATION: Rapid, automated means of organizing biological data are required if we hope to keep abreast of the flood of data emanating from sequencing, microarray and similar high-throughput analyses. Faced with the need to validate the annotation of thousands of sequences and to generate biologically meaningful classifications based on the sequence data, we turned to statistical methods in order to automate these processes. RESULTS: An algorithm for automated classification based on evolutionary distance data was written in S. The algorithm was tested on a dataset of 1436 small subunit ribosomal RNA sequences and was able to classify the sequences according to an extant scheme, use statistical measurements of group membership to detect sequences that were misclassified within this scheme and produce a new classification. In this study, the use of the algorithm to address problems in prokaryotic taxonomy is discussed. AVAILABILITY: S-Plus is available from Insightful, Inc. An S-Plus implementation of the algorithm and the associated data are available at http://taxoweb.mmg.msu.edu/datasets


Subject(s)
Algorithms , Gammaproteobacteria/genetics , Gene Expression Profiling/methods , Information Storage and Retrieval/methods , RNA, Ribosomal/genetics , Sequence Analysis, RNA/methods , User-Computer Interface , Artificial Intelligence , Cluster Analysis , Computer Graphics , Computer Simulation , Data Interpretation, Statistical , Databases, Genetic , Models, Biological , Pattern Recognition, Automated/methods , Sequence Alignment/methods , Software
19.
Int J Syst Evol Microbiol ; 54(Pt 1): 7-13, 2004 Jan.
Article in English | MEDLINE | ID: mdl-14742453

ABSTRACT

Techniques drawn from exploratory data analysis, using tools found in the S-Plus statistical software package, have been used to inspect and maintain the Bergey's Taxonomic Outline and to move towards an automated and community-based means of working on the outline. These techniques can be used to classify sequences from unnamed and uncultured organisms, to visualize errors in the taxonomy or in the curation of the sequences, to suggest emendations to the taxonomy or to the classification of extant species and to complement the visualization of phylogenies based on treeing methods. A dataset of more than 9200 aligned small-subunit rRNA sequences was analysed in the context of the current taxonomic outline. The use of the algorithm in exploring and modifying the taxonomy is illustrated with an example drawn from the family Comamonadaceae.


Subject(s)
Bacteria/classification , Classification , Phylogeny , Terminology as Topic
SELECTION OF CITATIONS
SEARCH DETAIL
...