Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 38
Filter
1.
Cell ; 173(1): 208-220.e20, 2018 03 22.
Article in English | MEDLINE | ID: mdl-29551265

ABSTRACT

Conjugative transposition drives the emergence of multidrug resistance in diverse bacterial pathogens, yet the mechanisms are poorly characterized. The Tn1549 conjugative transposon propagates resistance to the antibiotic vancomycin used for severe drug-resistant infections. Here, we present four high-resolution structures of the conserved Y-transposase of Tn1549 complexed with circular transposon DNA intermediates. The structures reveal individual transposition steps and explain how specific DNA distortion and cleavage mechanisms enable DNA strand exchange with an absolute minimum homology requirement. This appears to uniquely allow Tn916-like conjugative transposons to bypass DNA homology and insert into diverse genomic sites, expanding gene transfer. We further uncover a structural regulatory mechanism that prevents premature cleavage of the transposon DNA before a suitable target DNA is found and generate a peptide antagonist that interferes with the transposase-DNA structure to block transposition. Our results reveal mechanistic principles of conjugative transposition that could help control the spread of antibiotic resistance genes.


Subject(s)
DNA, Bacterial/metabolism , Transposases/metabolism , Amino Acid Sequence , Base Sequence , Binding Sites , Catalytic Domain , Crystallography, X-Ray , DNA Cleavage , DNA Transposable Elements/genetics , DNA, Bacterial/chemistry , Drug Resistance, Bacterial , Enterococcus faecalis/genetics , Models, Molecular , Molecular Dynamics Simulation , Mutagenesis, Site-Directed , Nucleic Acid Conformation , Protein Binding , Protein Structure, Tertiary , Recombinant Proteins/biosynthesis , Recombinant Proteins/chemistry , Recombinant Proteins/isolation & purification , Sequence Alignment , Transposases/antagonists & inhibitors , Transposases/chemistry , Transposases/genetics
2.
Bioinformatics ; 34(2): 323-329, 2018 Jan 15.
Article in English | MEDLINE | ID: mdl-28968857

ABSTRACT

The Quest for Orthologs (QfO) is an open collaboration framework for experts in comparative phylogenomics and related research areas who have an interest in highly accurate orthology predictions and their applications. We here report highlights and discussion points from the QfO meeting 2015 held in Barcelona. Achievements in recent years have established a basis to support developments for improved orthology prediction and to explore new approaches. Central to the QfO effort is proper benchmarking of methods and services, as well as design of standardized datasets and standardized formats to allow sharing and comparison of results. Simultaneously, analysis pipelines have been improved, evaluated and adapted to handle large datasets. All this would not have occurred without the long-term collaboration of Consortium members. Meeting regularly to review and coordinate complementary activities from a broad spectrum of innovative researchers clearly benefits the community. Highlights of the meeting include addressing sources of and legitimacy of disagreements between orthology calls, the context dependency of orthology definitions, special challenges encountered when analyzing very anciently rooted orthologies, orthology in the light of whole-genome duplications, and the concept of orthologous versus paralogous relationships at different levels, including domain-level orthology. Furthermore, particular needs for different applications (e.g. plant genomics, ancient gene families and others) and the infrastructure for making orthology inferences available (e.g. interfaces with model organism databases) were discussed, with several ongoing efforts that are expected to be reported on during the upcoming 2017 QfO meeting.

3.
Mol Syst Biol ; 13(12): 960, 2017 12 14.
Article in English | MEDLINE | ID: mdl-29242367

ABSTRACT

Population genomics of prokaryotes has been studied in depth in only a small number of primarily pathogenic bacteria, as genome sequences of isolates of diverse origin are lacking for most species. Here, we conducted a large-scale survey of population structure in prevalent human gut microbial species, sampled from their natural environment, with a culture-independent metagenomic approach. We examined the variation landscape of 71 species in 2,144 human fecal metagenomes and found that in 44 of these, accounting for 72% of the total assigned microbial abundance, single-nucleotide variation clearly indicates the existence of sub-populations (here termed subspecies). A single subspecies (per species) usually dominates within each host, as expected from ecological theory. At the global scale, geographic distributions of subspecies differ between phyla, with Firmicutes subspecies being significantly more geographically restricted. To investigate the functional significance of the delineated subspecies, we identified genes that consistently distinguish them in a manner that is independent of reference genomes. We further associated these subspecies-specific genes with properties of the microbial community and the host. For example, two of the three Eubacterium rectale subspecies consistently harbor an accessory pro-inflammatory flagellum operon that is associated with lower gut community diversity, higher host BMI, and higher blood fasting insulin levels. Using an additional 676 human oral samples, we further demonstrate the existence of niche specialized subspecies in the different parts of the oral cavity. Taken together, we provide evidence for subspecies in the majority of abundant gut prokaryotes, leading to a better functional and ecological understanding of the human gut microbiome in conjunction with its host.


Subject(s)
Gastrointestinal Microbiome , Microbiota , Escherichia coli/physiology , Gastrointestinal Microbiome/genetics , Genes, Bacterial , Humans , Microbiota/genetics , Phenotype , Phylogeography , Species Specificity
4.
Nature ; 551(7682): 585-589, 2017 11 30.
Article in English | MEDLINE | ID: mdl-29143823

ABSTRACT

A Western lifestyle with high salt consumption can lead to hypertension and cardiovascular disease. High salt may additionally drive autoimmunity by inducing T helper 17 (TH17) cells, which can also contribute to hypertension. Induction of TH17 cells depends on gut microbiota; however, the effect of salt on the gut microbiome is unknown. Here we show that high salt intake affects the gut microbiome in mice, particularly by depleting Lactobacillus murinus. Consequently, treatment of mice with L. murinus prevented salt-induced aggravation of actively induced experimental autoimmune encephalomyelitis and salt-sensitive hypertension by modulating TH17 cells. In line with these findings, a moderate high-salt challenge in a pilot study in humans reduced intestinal survival of Lactobacillus spp., increased TH17 cells and increased blood pressure. Our results connect high salt intake to the gut-immune axis and highlight the gut microbiome as a potential therapeutic target to counteract salt-sensitive conditions.


Subject(s)
Gastrointestinal Microbiome/drug effects , Lactobacillus/drug effects , Lactobacillus/isolation & purification , Sodium Chloride/pharmacology , Th17 Cells/drug effects , Th17 Cells/immunology , Animals , Autoimmunity/drug effects , Blood Pressure/drug effects , Disease Models, Animal , Encephalomyelitis, Autoimmune, Experimental/chemically induced , Encephalomyelitis, Autoimmune, Experimental/microbiology , Encephalomyelitis, Autoimmune, Experimental/pathology , Encephalomyelitis, Autoimmune, Experimental/therapy , Feces/microbiology , Humans , Hypertension/chemically induced , Indoleacetic Acids/metabolism , Indoles/metabolism , Intestines/cytology , Intestines/drug effects , Intestines/immunology , Intestines/microbiology , Lactobacillus/immunology , Lymphocyte Activation/drug effects , Lymphocyte Count , Male , Mice , Pilot Projects , Sodium Chloride/administration & dosage , Symbiosis , Th17 Cells/cytology , Tryptophan/metabolism
5.
Mol Biol Evol ; 34(8): 2115-2122, 2017 08 01.
Article in English | MEDLINE | ID: mdl-28460117

ABSTRACT

Orthology assignment is ideally suited for functional inference. However, because predicting orthology is computationally intensive at large scale, and most pipelines are relatively inaccessible (e.g., new assignments only available through database updates), less precise homology-based functional transfer is still the default for (meta-)genome annotation. We, therefore, developed eggNOG-mapper, a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from the eggNOG database. To validate our method, we benchmarked Gene Ontology (GO) predictions against two widely used homology-based approaches: BLAST and InterProScan. Orthology filters applied to BLAST results reduced the rate of false positive assignments by 11%, and increased the ratio of experimentally validated terms recovered over all terms assigned per protein by 15%. Compared with InterProScan, eggNOG-mapper achieved similar proteome coverage and precision while predicting, on average, 41 more terms per protein and increasing the rate of experimentally validated terms recovered over total term assignments per protein by 35%. EggNOG-mapper predictions scored within the top-5 methods in the three GO categories using the CAFA2 NK-partial benchmark. Finally, we evaluated eggNOG-mapper for functional annotation of metagenomics data, yielding better performance than interProScan. eggNOG-mapper runs ∼15× faster than BLAST and at least 2.5× faster than InterProScan. The tool is available standalone and as an online service at http://eggnog-mapper.embl.de.


Subject(s)
Sequence Alignment/methods , Sequence Analysis, Protein/methods , Algorithms , Computer Simulation , Databases, Genetic , Databases, Protein , Gene Ontology , Genome/genetics , Phylogeny , Sequence Alignment/statistics & numerical data , Software
7.
Bioinformatics ; 33(16): 2594-2595, 2017 Aug 15.
Article in English | MEDLINE | ID: mdl-28398468

ABSTRACT

MOTIVATION: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate for large datasets and some computationally intensive tasks, like rarefaction and associated analysis. RESULTS: Here we present a software package for rarefaction analysis of large count matrices, as well as estimation and visualization of diversity, richness and evenness. Our software is designed for ease of use, operating at least 7x faster than existing solutions, despite requiring 10x less memory. AVAILABILITY AND IMPLEMENTATION: C ++ and R source code (GPL v.2) as well as binaries are available from https://github.com/hildebra/Rarefaction and from CRAN (https://cran.r-project.org/). CONTACT: bork@embl.de or falk.hildebrand@embl.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Computational Biology/methods , Microbiota , Software , Environment
8.
Am J Physiol Gastrointest Liver Physiol ; 312(4): G327-G339, 2017 Apr 01.
Article in English | MEDLINE | ID: mdl-28039159

ABSTRACT

Current treatment for pediatric inflammatory bowel disease (IBD) patients is often ineffective, with serious side effects. Manipulating the gut microbiota via fecal microbiota transplantation (FMT) is an emerging treatment approach but remains controversial. We aimed to assess the composition of the fecal microbiome through a comparison of pediatric IBD patients to their healthy siblings, evaluating risks and prospects for FMT in this setting. A case-control (sibling) study was conducted analyzing fecal samples of six children with Crohn's disease (CD), six children with ulcerative colitis (UC) and 12 healthy siblings by metagenomic sequencing. In addition, lifetime antibiotic intake was retrospectively determined. Species richness and diversity were significantly reduced in UC patients compared with control [Mann-Whitney U-test false discovery rate (MWU FDR) = 0.011]. In UC, bacteria positively influencing gut homeostasis, e.g., Eubacterium rectale and Faecalibacterium prausnitzii, were significantly reduced in abundance (MWU FDR = 0.05). Known pathobionts like Escherichia coli were enriched in UC patients (MWU FDR = 0.084). Moreover, E. coli abundance correlated positively with that of several virulence genes (SCC > 0.65, FDR < 0.1). A shift toward antibiotic-resistant taxa in both IBD groups distinguished them from controls [MWU Benjamini-Hochberg-Yekutieli procedure (BY) FDR = 0.062 in UC, MWU BY FDR = 0.019 in CD). The collected results confirm a microbial dysbiosis in pediatric UC, and to a lesser extent in CD patients, replicating associations found previously using different methods. Taken together, these observations suggest microbiotal remodeling therapy from family donors, at least for children with UC, as a viable option.NEW & NOTEWORTHY In this sibling study, prior reports of microbial dysbiosis in IBD patients from 16S rRNA sequencing was verified using deep shotgun sequencing and augmented with insights into the abundance of bacterial virulence genes and bacterial antibiotic resistance determinants, seen against the background of data on the specific antibiotic intake of each of the study participants. The observed dysbiosis, which distinguishes patients from siblings, highlights such siblings as potential donors for microbiotal remodeling therapy in IBD.


Subject(s)
Feces/microbiology , Gastrointestinal Microbiome , Inflammatory Bowel Diseases/microbiology , Intestinal Mucosa/microbiology , Metagenome , Adolescent , Child , Female , Humans , Male , Siblings , Young Adult
9.
Nucleic Acids Res ; 45(D1): D529-D534, 2017 01 04.
Article in English | MEDLINE | ID: mdl-28053165

ABSTRACT

The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de.


Subject(s)
Computational Biology/methods , DNA Barcoding, Taxonomic/methods , Genome , Genomics/methods , Prokaryotic Cells , Databases, Genetic , Molecular Sequence Annotation , Web Browser
10.
Nature ; 535(7612): 376-81, 2016 07 21.
Article in English | MEDLINE | ID: mdl-27409811

ABSTRACT

Insulin resistance is a forerunner state of ischaemic cardiovascular disease and type 2 diabetes. Here we show how the human gut microbiome impacts the serum metabolome and associates with insulin resistance in 277 non-diabetic Danish individuals. The serum metabolome of insulin-resistant individuals is characterized by increased levels of branched-chain amino acids (BCAAs), which correlate with a gut microbiome that has an enriched biosynthetic potential for BCAAs and is deprived of genes encoding bacterial inward transporters for these amino acids. Prevotella copri and Bacteroides vulgatus are identified as the main species driving the association between biosynthesis of BCAAs and insulin resistance, and in mice we demonstrate that P. copri can induce insulin resistance, aggravate glucose intolerance and augment circulating levels of BCAAs. Our findings suggest that microbial targets may have the potential to diminish insulin resistance and reduce the incidence of common metabolic and cardiovascular disorders.


Subject(s)
Gastrointestinal Microbiome/physiology , Insulin Resistance , Metabolome , Serum/metabolism , Amino Acids, Branched-Chain/biosynthesis , Amino Acids, Branched-Chain/metabolism , Animals , Bacteroides/physiology , Cardiovascular Diseases/metabolism , Cardiovascular Diseases/microbiology , Fasting/blood , Fasting/metabolism , Glucose Intolerance/blood , Glucose Intolerance/microbiology , Humans , Male , Metagenome , Mice , Mice, Inbred C57BL , Netherlands , Prevotella/physiology
11.
Bioinformatics ; 32(17): 2636-41, 2016 09 01.
Article in English | MEDLINE | ID: mdl-27256311

ABSTRACT

MOTIVATION: Over the last decades, vast numbers of sequences were deposited in public databases. Bioinformatics tools allow homology and consequently functional inference for these sequences. New profile-based homology search tools have been introduced, allowing reliable detection of remote homologs, but have not been systematically benchmarked. To provide such a comparison, which can guide bioinformatics workflows, we extend and apply our previously developed benchmark approach to evaluate the 'next generation' of profile-based approaches, including CS-BLAST, HHSEARCH and PHMMER, in comparison with the non-profile based search tools NCBI-BLAST, USEARCH, UBLAST and FASTA. METHOD: We generated challenging benchmark datasets based on protein domain architectures within either the PFAM + Clan, SCOP/Superfamily or CATH/Gene3D domain definition schemes. From each dataset, homologous and non-homologous protein pairs were aligned using each tool, and standard performance metrics calculated. We further measured congruence of domain architecture assignments in the three domain databases. RESULTS: CSBLAST and PHMMER had overall highest accuracy. FASTA, UBLAST and USEARCH showed large trade-offs of accuracy for speed optimization. CONCLUSION: Profile methods are superior at inferring remote homologs but the difference in accuracy between methods is relatively small. PHMMER and CSBLAST stand out with the highest accuracy, yet still at a reasonable computational cost. Additionally, we show that less than 0.1% of Swiss-Prot protein pairs considered homologous by one database are considered non-homologous by another, implying that these classifications represent equivalent underlying biological phenomena, differing mostly in coverage and granularity. AVAILABILITY AND IMPLEMENTATION: Benchmark datasets and all scripts are placed at (http://sonnhammer.org/download/Homology_benchmark). CONTACT: forslund@embl.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Benchmarking , Databases, Protein , Sequence Homology , High-Throughput Nucleotide Sequencing , Proteins , Sequence Homology, Amino Acid
12.
Bioinformatics ; 32(16): 2520-3, 2016 08 15.
Article in English | MEDLINE | ID: mdl-27153620

ABSTRACT

UNLABELLED: MOCAT2 is a software pipeline for metagenomic sequence assembly and gene prediction with novel features for taxonomic and functional abundance profiling. The automated generation and efficient annotation of non-redundant reference catalogs by propagating pre-computed assignments from 18 databases covering various functional categories allows for fast and comprehensive functional characterization of metagenomes. AVAILABILITY AND IMPLEMENTATION: MOCAT2 is implemented in Perl 5 and Python 2.7, designed for 64-bit UNIX systems and offers support for high-performance computer usage via LSF, PBS or SGE queuing systems; source code is freely available under the GPL3 license at http://mocat.embl.de CONTACT: : bork@embl.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Metagenomics , Software , Databases, Factual , Metagenome
13.
Nat Methods ; 13(5): 425-30, 2016 05.
Article in English | MEDLINE | ID: mdl-27043882

ABSTRACT

Achieving high accuracy in orthology inference is essential for many comparative, evolutionary and functional genomic analyses, yet the true evolutionary history of genes is generally unknown and orthologs are used for very different applications across phyla, requiring different precision-recall trade-offs. As a result, it is difficult to assess the performance of orthology inference methods. Here, we present a community effort to establish standards and an automated web-based service to facilitate orthology benchmarking. Using this service, we characterize 15 well-established inference methods and resources on a battery of 20 different benchmarks. Standardized benchmarking provides a way for users to identify the most effective methods for the problem at hand, sets a minimum requirement for new tools and resources, and guides the development of more accurate orthology inference methods.


Subject(s)
Computational Biology/standards , Genomics/standards , Phylogeny , Proteomics/standards , Archaea/classification , Archaea/genetics , Bacteria/classification , Bacteria/genetics , Computational Biology/methods , Databases, Genetic , Eukaryota/classification , Eukaryota/genetics , Gene Ontology , Genomics/methods , Models, Genetic , Proteomics/methods , Sequence Analysis, Protein , Sequence Homology , Species Specificity
14.
Nat Commun ; 7: 10410, 2016 Jan 26.
Article in English | MEDLINE | ID: mdl-26811868

ABSTRACT

Early-life antibiotic use is associated with increased risk for metabolic and immunological diseases, and mouse studies indicate a causal role of the disrupted microbiome. However, little is known about the impacts of antibiotics on the developing microbiome of children. Here we use phylogenetics, metagenomics and individual antibiotic purchase records to show that macrolide use in 2-7 year-old Finnish children (N=142; sampled at two time points) is associated with a long-lasting shift in microbiota composition and metabolism. The shift includes depletion of Actinobacteria, increase in Bacteroidetes and Proteobacteria, decrease in bile-salt hydrolase and increase in macrolide resistance. Furthermore, macrolide use in early life is associated with increased risk of asthma and predisposes to antibiotic-associated weight gain. Overweight and asthmatic children have distinct microbiota compositions. Penicillins leave a weaker mark on the microbiota than macrolides. Our results support the idea that, without compromising clinical practice, the impact on the intestinal microbiota should be considered when prescribing antibiotics.


Subject(s)
Anti-Bacterial Agents/therapeutic use , Bacteria/isolation & purification , Bacterial Infections/drug therapy , Gastrointestinal Microbiome/drug effects , Macrolides/therapeutic use , Anti-Bacterial Agents/adverse effects , Bacteria/classification , Bacteria/genetics , Child , Child, Preschool/statistics & numerical data , Cohort Studies , Day Care, Medical/statistics & numerical data , Feces/microbiology , Female , Finland , Humans , Macrolides/adverse effects , Male
15.
Nucleic Acids Res ; 44(D1): D286-93, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26582926

ABSTRACT

eggNOG is a public resource that provides Orthologous Groups (OGs) of proteins at different taxonomic levels, each with integrated and summarized functional annotations. Developments since the latest public release include changes to the algorithm for creating OGs across taxonomic levels, making nested groups hierarchically consistent. This allows for a better propagation of functional terms across nested OGs and led to the novel annotation of 95 890 previously uncharacterized OGs, increasing overall annotation coverage from 67% to 72%. The functional annotations of OGs have been expanded to also provide Gene Ontology terms, KEGG pathways and SMART/Pfam domains for each group. Moreover, eggNOG now provides pairwise orthology relationships within OGs based on analysis of phylogenetic trees. We have also incorporated a framework for quickly mapping novel sequences to OGs based on precomputed HMM profiles. Finally, eggNOG version 4.5 incorporates a novel data set spanning 2605 viral OGs, covering 5228 proteins from 352 viral proteomes. All data are accessible for bulk downloading, as a web-service, and through a completely redesigned web interface. The new access points provide faster searches and a number of new browsing and visualization capabilities, facilitating the needs of both experts and less experienced users. eggNOG v4.5 is available at http://eggnog.embl.de.


Subject(s)
Databases, Protein , Molecular Sequence Annotation , Sequence Analysis, Protein , Algorithms , Archaeal Proteins/chemistry , Bacterial Proteins/chemistry , Eukaryota , Phylogeny , Proteome/chemistry , Viral Proteins/chemistry
16.
Nature ; 528(7581): 262-266, 2015 Dec 10.
Article in English | MEDLINE | ID: mdl-26633628

ABSTRACT

In recent years, several associations between common chronic human disorders and altered gut microbiome composition and function have been reported. In most of these reports, treatment regimens were not controlled for and conclusions could thus be confounded by the effects of various drugs on the microbiota, which may obscure microbial causes, protective factors or diagnostically relevant signals. Our study addresses disease and drug signatures in the human gut microbiome of type 2 diabetes mellitus (T2D). Two previous quantitative gut metagenomics studies of T2D patients that were unstratified for treatment yielded divergent conclusions regarding its associated gut microbial dysbiosis. Here we show, using 784 available human gut metagenomes, how antidiabetic medication confounds these results, and analyse in detail the effects of the most widely used antidiabetic drug metformin. We provide support for microbial mediation of the therapeutic effects of metformin through short-chain fatty acid production, as well as for potential microbiota-mediated mechanisms behind known intestinal adverse effects in the form of a relative increase in abundance of Escherichia species. Controlling for metformin treatment, we report a unified signature of gut microbiome shifts in T2D with a depletion of butyrate-producing taxa. These in turn cause functional microbiome shifts, in part alleviated by metformin-induced changes. Overall, the present study emphasizes the need to disentangle gut microbiota signatures of specific human diseases from those of medication.


Subject(s)
Diabetes Mellitus, Type 2/microbiology , Gastrointestinal Microbiome/drug effects , Gastrointestinal Microbiome/physiology , Metformin/pharmacology , Biodiversity , Diabetes Mellitus, Type 2/drug therapy , Female , Gastrointestinal Microbiome/genetics , Humans , Hypoglycemic Agents/pharmacology , Hypoglycemic Agents/therapeutic use , Male , Metagenome/drug effects , Metagenome/physiology , Metformin/therapeutic use , RNA, Ribosomal, 16S/genetics
17.
Genome Biol Evol ; 7(7): 1988-99, 2015 Jul 01.
Article in English | MEDLINE | ID: mdl-26133389

ABSTRACT

Quest for Orthologs (QfO) is a community effort with the goal to improve and benchmark orthology predictions. As quality assessment assumes prior knowledge on species phylogenies, we investigated the congruency between existing species trees by comparing the relationships of 147 QfO reference organisms from six Tree of Life (ToL)/species tree projects: The National Center for Biotechnology Information (NCBI) taxonomy, Opentree of Life, the sequenced species/species ToL, the 16S ribosomal RNA (rRNA) database, and trees published by Ciccarelli et al. (Ciccarelli FD, et al. 2006. Toward automatic reconstruction of a highly resolved tree of life. Science 311:1283-1287) and by Huerta-Cepas et al. (Huerta-Cepas J, Marcet-Houben M, Gabaldon T. 2014. A nested phylogenetic reconstruction approach provides scalable resolution in the eukaryotic Tree Of Life. PeerJ PrePrints 2:223) Our study reveals that each species tree suggests a different phylogeny: 87 of the 146 (60%) possible splits of a dichotomous and rooted tree are congruent, while all other splits are incongruent in at least one of the species trees. Topological differences are observed not only at deep speciation events, but also within younger clades, such as Hominidae, Rodentia, Laurasiatheria, or rosids. The evolutionary relationships of 27 archaea and bacteria are highly inconsistent. By assessing 458,108 gene trees from 65 genomes, we show that consistent species topologies are more often supported by gene phylogenies than contradicting ones. The largest concordant species tree includes 77 of the QfO reference organisms at the most. Results are summarized in the form of a consensus ToL (http://swisstree.vital-it.ch/species_tree) that can serve different benchmarking purposes.


Subject(s)
Phylogeny , Archaea/classification , Archaea/genetics , Bacteria/classification , Bacteria/genetics , Eukaryota/classification , Eukaryota/genetics , Genes
18.
Nucleic Acids Res ; 43(Database issue): D447-52, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25352553

ABSTRACT

The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein-protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein-protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks.


Subject(s)
Databases, Protein , Protein Interaction Mapping , Gene Expression Profiling , Internet , Proteins/classification , Proteins/genetics , Proteins/metabolism , Software
19.
PLoS One ; 9(11): e111122, 2014.
Article in English | MEDLINE | ID: mdl-25369365

ABSTRACT

Accurate orthology prediction is crucial for many applications in the post-genomic era. The lack of broadly accepted benchmark tests precludes a comprehensive analysis of orthology inference. So far, functional annotation between orthologs serves as a performance proxy. However, this violates the fundamental principle of orthology as an evolutionary definition, while it is often not applicable due to limited experimental evidence for most species. Therefore, we constructed high quality "gold standard" orthologous groups that can serve as a benchmark set for orthology inference in bacterial species. Herein, we used this dataset to demonstrate 1) why a manually curated, phylogeny-based dataset is more appropriate for benchmarking orthology than other popular practices and 2) how it guides database design and parameterization through careful error quantification. More specifically, we illustrate how function-based tests often fail to identify false assignments, misjudging the true performance of orthology inference methods. We also examined how our dataset can instruct the selection of a "core" species repertoire to improve detection accuracy. We conclude that including more genomes at the proper evolutionary distances can influence the overall quality of orthology detection. The curated gene families, called Reference Orthologous Groups, are publicly available at http://eggnog.embl.de/orthobench2.


Subject(s)
Computational Biology , Phylogeny , Bacteria/classification , Computational Biology/standards , Genomics , Internet , User-Computer Interface
20.
Bioessays ; 36(3): 316-29, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24474281

ABSTRACT

We show how metagenomic analysis of the human gut antibiotic resistome, compared across large populations and against environmental or agricultural resistomes, suggests a strong anthropogenic cause behind increasing antibiotic resistance in bacteria. This area has been the subject of intense and polarized debate driven by economic and political concerns; therefore such recently available insights address an important need. We derive and compare antibiotic resistomes of human gut microbes from 832 individuals from ten different countries. We observe and describe significant differences between samples from these countries in the gut resistance potential, in line with expectations from antibiotic usage and exposure in medical and food production contexts. Our results imply roles for both of these sources in increased resistance among pathogens in recent history. In contrast, other available metadata such as age, body mass index, sex, or health status have little effect on the antibiotic resistance potential of human gut microbes. Also watch the Video Abstract.


Subject(s)
Drug Resistance, Microbial/genetics , Gastrointestinal Tract/microbiology , Metagenomics/methods , Biological Evolution , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...