Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
PLoS One ; 4(6): e5660, 2009 Jun 19.
Article in English | MEDLINE | ID: mdl-19543527

ABSTRACT

Mycobacterium abscessus is an emerging rapidly growing mycobacterium (RGM) causing a pseudotuberculous lung disease to which patients with cystic fibrosis (CF) are particularly susceptible. We report here its complete genome sequence. The genome of M. abscessus (CIP 104536T) consists of a 5,067,172-bp circular chromosome including 4920 predicted coding sequences (CDS), an 81-kb full-length prophage and 5 IS elements, and a 23-kb mercury resistance plasmid almost identical to pMM23 from Mycobacterium marinum. The chromosome encodes many virulence proteins and virulence protein families absent or present in only small numbers in the model RGM species Mycobacterium smegmatis. Many of these proteins are encoded by genes belonging to a "mycobacterial" gene pool (e.g. PE and PPE proteins, MCE and YrbE proteins, lipoprotein LpqH precursors). However, many others (e.g. phospholipase C, MgtC, MsrA, ABC Fe(3+) transporter) appear to have been horizontally acquired from distantly related environmental bacteria with a high G+C content, mostly actinobacteria (e.g. Rhodococcus sp., Streptomyces sp.) and pseudomonads. We also identified several metabolic regions acquired from actinobacteria and pseudomonads (relating to phenazine biosynthesis, homogentisate catabolism, phenylacetic acid degradation, DNA degradation) not present in the M. smegmatis genome. Many of the "non mycobacterial" factors detected in M. abscessus are also present in two of the pathogens most frequently isolated from CF patients, Pseudomonas aeruginosa and Burkholderia cepacia. This study elucidates the genetic basis of the unique pathogenicity of M. abscessus among RGM, and raises the question of similar mechanisms of pathogenicity shared by unrelated organisms in CF patients.


Subject(s)
Genome, Bacterial , Mycobacterium/genetics , Mycobacterium/pathogenicity , Anti-Bacterial Agents/pharmacology , Chromosomes, Bacterial/ultrastructure , Cystic Fibrosis/microbiology , DNA/metabolism , Drug Resistance, Bacterial/genetics , Genetic Techniques , Humans , Models, Genetic , Mycobacterium smegmatis/genetics , Phenylacetates/metabolism , Phylogeny , Virulence/genetics , Virulence Factors/genetics
2.
BMC Genomics ; 8: 114, 2007 May 09.
Article in English | MEDLINE | ID: mdl-17490474

ABSTRACT

BACKGROUND: The outermost layer of the bacterial surface is of crucial importance because it is in constant interaction with the host. Glycopeptidolipids (GPLs) are major surface glycolipids present on various mycobacterial species. In the fast-grower model organism Mycobacterium smegmatis, GPL biosynthesis involves approximately 30 genes all mapping to a single region of 65 kb. RESULTS: We have recently sequenced the complete genomes of two fast-growers causing human infections, Mycobacterium abscessus (CIP 104536T) and M. chelonae (CIP 104535T). We show here that these two species contain genes corresponding to all those of the M. smegmatis "GPL locus", with extensive conservation of the predicted protein sequences consistent with the production of GPL molecules indistinguishable by biochemical analysis. However, the GPL locus appears to be split into several parts in M. chelonae and M. abscessus. One large cluster (19 genes) comprises all genes involved in the synthesis of the tripeptide-aminoalcohol moiety, the glycosylation of the lipopeptide and methylation/acetylation modifications. We provide evidence that a duplicated acetyltransferase (atf1 and atf2) in M. abscessus and M. chelonae has evolved through specialization, being able to transfer one acetyl at once in a sequential manner. There is a second smaller and distant (M. chelonae, 900 kb; M. abscessus, 3 Mb) cluster of six genes involved in the synthesis of the fatty acyl moiety and its attachment to the tripeptide-aminoalcohol moiety. The other genes are scattered throughout the genome, including two genes encoding putative regulatory proteins. CONCLUSION: Although these three species produce identical GPL molecules, the organization of GPL genes differ between them, thus constituting species-specific signatures. An hypothesis is that the compact organization of the GPL locus in M. smegmatis represents the ancestral form and that evolution has scattered various pieces throughout the genome in M. abscessus and M. chelonae.


Subject(s)
Biosynthetic Pathways/genetics , Genome, Bacterial/genetics , Glycolipids/genetics , Glycopeptides/genetics , Mycobacterium chelonae/genetics , Acetyltransferases/genetics , Base Sequence , Chromatography, Thin Layer , Evolution, Molecular , Glycolipids/biosynthesis , Glycopeptides/biosynthesis , Molecular Sequence Data , Sequence Analysis, DNA , Species Specificity , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization
3.
J Mol Biol ; 362(2): 184-91, 2006 Sep 15.
Article in English | MEDLINE | ID: mdl-16914158

ABSTRACT

A key question in molecular genetics is why severe gene mutations often do not result in a detectable abnormal phenotype. Alternative networks are known to be a gene compensation mechanism. Gene redundancy, i.e. the presence of a duplicate gene (or paralog) elsewhere in the genome, also underpins many cases of gene dispensability. Here, we investigated the role of partial duplicate genes on dispensability, where a partial duplicate is defined as a gene that has no paralog but which codes for a protein made of domains, each of which belongs to at least another protein. The rationale behind this investigation is that, as a partial duplicate codes for a domain redundant protein, we hypothesised that its deletion might have a less severe phenotypic effect than the deletion of other genes. This prompted us to (re)address the topic of gene dispensability by focusing on domain redundancy rather than on gene redundancy. Using fitness data of single-gene deletion mutants of Saccharomyces cerevisiae, we will show that domain redundancy is a compensation mechanism, the strength of which is lower than that of gene redundancy. Finally, we shall discuss the molecular basis of this new compensation mechanism.


Subject(s)
Gene Duplication , Genes, Duplicate , Saccharomyces cerevisiae/genetics , Evolution, Molecular , Genome, Fungal , Models, Genetic , Mutation , Phenotype , Selection, Genetic
4.
Bioinformatics ; 22(12): 1418-23, 2006 Jun 15.
Article in English | MEDLINE | ID: mdl-16601004

ABSTRACT

Most proteins comprise one or several domains. New domain architectures can be created by combining previously existing domains. The elementary events that create new domain architectures may be categorized into three classes, namely domain(s) insertion or deletion (indel), exchange and repetition. Using 'DomainTeam', a tool dedicated to the search for microsyntenies of domains, we quantified the relative contribution of these events. This tool allowed us to collect homologous bacterial genes encoding proteins that have obviously evolved by modular assembly of domains. We show that indels are the most frequent elementary events and that they occur in most cases at either the N- or C-terminus of the proteins. As revealed by the genomic neighbourhood/context of the corresponding genes, we show that a substantial number of these terminal indels are the consequence of gene fusions/fissions. We provide evidence showing that the contribution of gene fusion/fission to the evolution of multi-domain bacterial proteins is lower-bounded by 27% and upper-bounded by 64%. We conclude that gene fusion/fission is a major contributor to the evolution of multi-domain bacterial proteins.


Subject(s)
Bacterial Proteins/chemistry , Computational Biology/methods , Gene Fusion , Bacterial Proteins/genetics , Biological Evolution , Evolution, Molecular , Gene Deletion , Genes, Bacterial , Genome, Bacterial , Internet , Phylogeny , Protein Structure, Tertiary , Proteins/chemistry
5.
Genome Res ; 15(6): 867-74, 2005 Jun.
Article in English | MEDLINE | ID: mdl-15899966

ABSTRACT

The detection, across several genomes, of local conservation of gene content and proximity considerably helps the prediction of features of interest, such as gene fusions or physical and functional interactions. Here, we want to process realistic models of chromosomes, in which genes (or genomic segments of several genes) can be duplicated within a chromosome, or be absent from some other chromosome(s). Our approach adopts the technique of temporarily forgetting genes and working directly with protein "domains" such as those found in Pfam. This allows the detection of strings of domains that are conserved in their content, but not necessarily in their order, which we refer to as domain teams. The prominent feature of the method is that it relaxes the rigidity of the orthology criterion and avoids many of the pitfalls of gene-families identification methods, often hampered by multidomain proteins or low levels of sequence similarity. This approach, that allows both inter- and intrachromosomal comparisons, proves to be more sensitive than the classical methods based on pairwise sequence comparisons, particularly in the simultaneous treatment of many species. The automated and fast detection of domain teams, together with its increased sensitivity at identifying segments of identical (protein-coding) gene contents as well as gene fusions, should prove a useful complement to other existing methods.


Subject(s)
Bacterial Proteins/genetics , Genome, Bacterial , Gram-Negative Bacteria/genetics , Sequence Analysis, Protein/methods , Synteny , Chromosomes, Bacterial/genetics , Databases, Protein , Protein Structure, Tertiary/genetics
6.
Nucleic Acids Res ; 33(Database issue): D641-6, 2005 Jan 01.
Article in English | MEDLINE | ID: mdl-15608279

ABSTRACT

Genomic projects heavily depend on genome annotations and are limited by the current deficiencies in the published predictions of gene structure and function. It follows that, improved annotation will allow better data mining of genomes, and more secure planning and design of experiments. The purpose of the GeneFarm project is to obtain homogeneous, reliable, documented and traceable annotations for Arabidopsis nuclear genes and gene products, and to enter them into an added-value database. This re-annotation project is being performed exhaustively on every member of each gene family. Performing a family-wide annotation makes the task easier and more efficient than a gene-by-gene approach since many features obtained for one gene can be extrapolated to some or all the other genes of a family. A complete annotation procedure based on the most efficient prediction tools available is being used by 16 partner laboratories, each contributing annotated families from its field of expertise. A database, named GeneFarm, and an associated user-friendly interface to query the annotations have been developed. More than 3000 genes distributed over 300 families have been annotated and are available at http://genoplante-info.infobiogen.fr/Genefarm/. Furthermore, collaboration with the Swiss Institute of Bioinformatics is underway to integrate the GeneFarm data into the protein knowledgebase Swiss-Prot.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis/genetics , Databases, Genetic , Genes, Plant , Arabidopsis Proteins/chemistry , Arabidopsis Proteins/physiology , Philosophy , Systems Integration , User-Computer Interface
7.
Comput Biol Chem ; 27(1): 59-67, 2003 Feb.
Article in English | MEDLINE | ID: mdl-12798040

ABSTRACT

This paper describes an efficient algorithm based on a new concept called gene team for detecting conserved gene clusters among an arbitrary number of chromosomes. Within the clusters, neither the order of the genes nor their orientation need be conserved. In addition, insertion of foreign genes within the clusters are permitted to a user-defined extent. This algorithm has been implemented in a publicly available TEAM software that proves to be an efficient tool for systematic searches of conserved gene clusters. Examples of actual biological results are provided. The software is downloadable from http://www-igm.univ-mlv.fr/ approximately raffinot/geneteam.html.


Subject(s)
Genomics/methods , Multigene Family/genetics , Algorithms , Conserved Sequence/genetics , Genes, Bacterial/genetics , Genome, Bacterial , Models, Genetic , Software
8.
Microbiology (Reading) ; 146 ( Pt 8): 1815-1828, 2000 Aug.
Article in English | MEDLINE | ID: mdl-10931887

ABSTRACT

Genome annotation requires explicit identification of gene function. This task frequently uses protein sequence alignments with examples having a known function. Genetic drift, co-evolution of subunits in protein complexes and a variety of other constraints interfere with the relevance of alignments. Using a specific class of proteins, it is shown that a simple data analysis approach can help solve some of the problems posed. The origin of ureohydrolases has been explored by comparing sequence similarity trees, maximizing amino acid alignment conservation. The trees separate agmatinases from arginases but suggest the presence of unknown biases responsible for unexpected positions of some enzymes. Using factorial correspondence analysis, a distance tree between sequences was established, comparing regions with gaps in the alignments. The gap tree gives a consistent picture of functional kinship, perhaps reflecting some aspects of phylogeny, with a clear domain of enzymes encoding two types of ureohydrolases (agmatinases and arginases) and activities related to, but different from ureohydrolases. Several annotated genes appeared to correspond to a wrong assignment if the trees were significant. They were cloned and their products expressed and identified biochemically. This substantiated the validity of the gap tree. Its organization suggests a very ancient origin of ureohydrolases. Some enzymes of eukaryotic origin are spread throughout the arginase part of the trees: they might have been derived from the genes found in the early symbiotic bacteria that became the organelles. They were transferred to the nucleus when symbiotic genes had to escape Muller's ratchet. This work also shows that arginases and agmatinases share the same two manganese-ion-binding sites and exhibit only subtle differences that can be accounted for knowing the three-dimensional structure of arginases. In the absence of explicit biochemical data, extreme caution is needed when annotating genes having similarities to ureohydrolases.


Subject(s)
Biogenic Polyamines/biosynthesis , Ureohydrolases/genetics , Amino Acid Sequence , Animals , Arginase/classification , Arginase/genetics , Bacillus subtilis/enzymology , Bacillus subtilis/genetics , Cyanobacteria/enzymology , Cyanobacteria/genetics , Escherichia coli/enzymology , Escherichia coli/genetics , Evolution, Molecular , Helicobacter pylori/enzymology , Helicobacter pylori/genetics , Humans , Molecular Sequence Data , Phylogeny , Sequence Homology, Amino Acid , Ureohydrolases/classification
SELECTION OF CITATIONS
SEARCH DETAIL
...