RESUMO
Every organism can be characterized by the amino acid composition of its proteome. So far it was assumed that these compositions are determined by the GC content of the DNA or, in some cases, by extreme lifestyles, like thermophily or halophily. Here, we focussed our analysis on eight amino acids, each of which is encoded by both, GC and AT rich codons, to identify finer amino acid patterns beyond the GC dominance. We investigated the conceptually translated proteomes of 1029 bacterial and archaeal strains with sequenced genomes for amino acid composition. Using correspondence analysis, we found that phylogenetic groups within bacteria and archaea generally can be discriminated from other groups due to their amino acid composition. In some cases, single organisms, e.g. Treponema pallidum strains or Mycoplasma penetrans, are characterized by extreme amino acid compositions. We assume that our data could provide a basis for a new approach to analyze evolution of bacterial and archaeal groups. Furthermore, for single organisms, the detailed knowledge of the amino acid composition of the entire proteome encoded in the genome could lead to a better understanding, important for pharmaceutical or biotechnological applications. We recommend that information about amino acid compositions should be provided in databases, comparable to the GC content of genomes.
Assuntos
Aminoácidos/genética , Archaea/genética , Bactérias/genética , Composição de Bases , Códon , Evolução Molecular , Genoma Arqueal , Genoma Bacteriano , Filogenia , Proteoma/análise , Análise de Sequência de ProteínaRESUMO
Metallothioneins and metallohistins are short peptides with a high cysteine and/or histidine content able to coordinate metals intracellularly, thereby increasing the tolerance against elevated concentrations of metals. Because of their features, they can be detected by in silico prediction from proteomes annotated from sequenced genomes. Here, we analyzed 73 sequenced actinobacterial genomes for peptides (≤ 100 amino acids) with a high content of cysteine and histidine (≥ 15%) and identified 103 putative metallothioneins and metallohistins. For 45 of these peptides, we found similarities to metal binding protein domains, including zinc fingers, heavy metal transporters or eukaryotic metallothioneins, which can serve as proof-of-principle in underscoring a potential function as metal binding peptides. An evolutionary origin from metal containing domains of enzymes is discussed and metallohistins not containing cysteine are described for the first time for bacteria.