Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
2.
Bioinformatics ; 36(Suppl_2): i651-i658, 2020 12 30.
Article in English | MEDLINE | ID: mdl-33381850

ABSTRACT

MOTIVATION: Horizontal gene transfer (HGT) is a major source of variability in prokaryotic genomes. Regions of genome plasticity (RGPs) are clusters of genes located in highly variable genomic regions. Most of them arise from HGT and correspond to genomic islands (GIs). The study of those regions at the species level has become increasingly difficult with the data deluge of genomes. To date, no methods are available to identify GIs using hundreds of genomes to explore their diversity. RESULTS: We present here the panRGP method that predicts RGPs using pangenome graphs made of all available genomes for a given species. It allows the study of thousands of genomes in order to access the diversity of RGPs and to predict spots of insertions. It gave the best predictions when benchmarked along other GI detection tools against a reference dataset. In addition, we illustrated its use on metagenome assembled genomes by redefining the borders of the leuX tRNA hotspot, a well-studied spot of insertion in Escherichia coli. panRPG is a scalable and reliable tool to predict GIs and spots making it an ideal approach for large comparative studies. AVAILABILITY AND IMPLEMENTATION: The methods presented in the current work are available through the following software: https://github.com/labgem/PPanGGOLiN. Detailed results and scripts to compute the benchmark metrics are available at https://github.com/axbazin/panrgp_supdata.


Subject(s)
Genomic Islands , Software , Gene Transfer, Horizontal , Genomic Islands/genetics , Genomics , Metagenome
3.
Nat Commun ; 11(1): 5541, 2020 11 02.
Article in English | MEDLINE | ID: mdl-33139723

ABSTRACT

The bacterium Neisseria meningitidis causes life-threatening meningitis and sepsis. Here, we construct a complete collection of defined mutants in protein-coding genes of this organism, identifying all genes that are essential under laboratory conditions. The collection, named NeMeSys 2.0, consists of individual mutants in 1584 non-essential genes. We identify 391 essential genes, which are associated with basic functions such as expression and preservation of genome information, cell membrane structure and function, and metabolism. We use this collection to shed light on the functions of diverse genes, including a gene encoding a member of a previously unrecognised class of histidinol-phosphatases; a set of 20 genes required for type IV pili function; and several conditionally essential genes encoding antitoxins and/or immunity proteins. We expect that NeMeSys 2.0 will facilitate the phenotypic profiling of a major human bacterial pathogen.


Subject(s)
Genes, Bacterial/genetics , Genes, Essential/genetics , Mutation , Neisseria meningitidis/genetics , Neisseria meningitidis/metabolism , Phenotype , Bacterial Proteins/metabolism , Computational Biology , Fimbriae Proteins/genetics , Fimbriae Proteins/metabolism , Fimbriae, Bacterial/genetics , Fimbriae, Bacterial/metabolism , Gene Expression Profiling , Gene Expression Regulation, Bacterial , Genome, Bacterial , Humans , Neisseria meningitidis/pathogenicity
4.
PLoS Comput Biol ; 16(3): e1007732, 2020 03.
Article in English | MEDLINE | ID: mdl-32191703

ABSTRACT

The use of comparative genomics for functional, evolutionary, and epidemiological studies requires methods to classify gene families in terms of occurrence in a given species. These methods usually lack multivariate statistical models to infer the partitions and the optimal number of classes and don't account for genome organization. We introduce a graph structure to model pangenomes in which nodes represent gene families and edges represent genomic neighborhood. Our method, named PPanGGOLiN, partitions nodes using an Expectation-Maximization algorithm based on multivariate Bernoulli Mixture Model coupled with a Markov Random Field. This approach takes into account the topology of the graph and the presence/absence of genes in pangenomes to classify gene families into persistent, cloud, and one or several shell partitions. By analyzing the partitioned pangenome graphs of isolate genomes from 439 species and metagenome-assembled genomes from 78 species, we demonstrate that our method is effective in estimating the persistent genome. Interestingly, it shows that the shell genome is a key element to understand genome dynamics, presumably because it reflects how genes present at intermediate frequencies drive adaptation of species, and its proportion in genomes is independent of genome size. The graph-based approach proposed by PPanGGOLiN is useful to depict the overall genomic diversity of thousands of strains in a compact structure and provides an effective basis for very large scale comparative genomics. The software is freely available at https://github.com/labgem/PPanGGOLiN.


Subject(s)
Genome, Bacterial/genetics , Genomics/methods , Software , Algorithms , Bacteria/classification , Bacteria/genetics , Multivariate Analysis
5.
Nucleic Acids Res ; 48(D1): D579-D589, 2020 01 08.
Article in English | MEDLINE | ID: mdl-31647104

ABSTRACT

Large-scale genome sequencing and the increasingly massive use of high-throughput approaches produce a vast amount of new information that completely transforms our understanding of thousands of microbial species. However, despite the development of powerful bioinformatics approaches, full interpretation of the content of these genomes remains a difficult task. Launched in 2005, the MicroScope platform (https://www.genoscope.cns.fr/agc/microscope) has been under continuous development and provides analysis for prokaryotic genome projects together with metabolic network reconstruction and post-genomic experiments allowing users to improve the understanding of gene functions. Here we present new improvements of the MicroScope user interface for genome selection, navigation and expert gene annotation. Automatic functional annotation procedures of the platform have also been updated and we added several new tools for the functional annotation of genes and genomic regions. We finally focus on new tools and pipeline developed to perform comparative analyses on hundreds of genomes based on pangenome graphs. To date, MicroScope contains data for >11 800 microbial genomes, part of which are manually curated and maintained by microbiologists (>4500 personal accounts in September 2019). The platform enables collaborative work in a rich comparative genomic context and improves community-based curation efforts.


Subject(s)
Genes, Archaeal , Genes, Bacterial , Genomics/methods , Molecular Sequence Annotation/methods , Software , Databases, Genetic , Metabolic Networks and Pathways
SELECTION OF CITATIONS
SEARCH DETAIL
...