ABSTRACT
BACKGROUND: Retrotransposition of mRNA transcripts gives occasionally rise to functional retrogenes. Through acquiring tempero-spatial expression patterns distinct from their parental genes and/or functional mutations in their coding sequences, such retrogenes may in principle reshape signalling networks. RESULTS: Here we present evidence for such a scenario, involving retrogenes of Rap1 belonging to the Ras family of small GTPases. We identified two murine and one human-specific retrogene of Rap1A and Rap1B, which encode proteins that differ by only a few amino acids from their parental Rap1 proteins. Markedly, human hRap1B-retro and mouse mRap1A-retro1 acquired mutations in the 12th and 59th amino acids, respectively, corresponding to residues mutated in constitutively active oncogenic Ras proteins. Statistical and structural analyses support a functional evolution scenario, where Rap1 isoforms of retrogenic origin are functionally distinct from their parental proteins. Indeed, all retrogene-encoded GTPases have an increased GTP/GDP binding ratio in vivo, indicating that their conformations resemble that of active GTP-bound Rap1. We furthermore demonstrate that these three Rap1 isoforms exhibit distinct affinities for the Ras-binding domain of RalGDS. Finally, when tested for their capacity to induce key cellular processes like integrin-mediated cell adhesion or cell spreading, marked differences are seen. CONCLUSIONS: Together, these data lend strong support for an evolution scenario, where retrotransposition and subsequent mutation events generated species-specific Rap1 isoforms with differential signaling potential. Expression of the constitutively active human Rap1B-retro in cells like those derived from Ramos Burkitt's lymphoma and bone marrow from a patient with myelodysplastic syndrome (MDS) warrants further investigation into its role in disease development.
Subject(s)
rap1 GTP-Binding Proteins/genetics , rap1 GTP-Binding Proteins/metabolism , Animals , Humans , Mice , Models, Molecular , Retroelements , Reverse Transcription , rap1 GTP-Binding Proteins/chemistryABSTRACT
MOTIVATION: Amino acid changing mutations in proteins are contstrained by purifying selection and accumulate at different rates. We estimate evolutionary rates on multiple alignments of eukaryotic protein families in a maximum likelihood framework and spot sets of slow and fast evolving proteins. RESULTS: We find that the evolution of indispensable proteins is constrained by selection and that protein secretion is coupled to an increased evolutionary rate.
Subject(s)
Evolution, Molecular , Models, Genetic , Proteome/genetics , Proteome/metabolism , Sequence Alignment/methods , Sequence Analysis, Protein/methods , Algorithms , Amino Acid Sequence , Animals , Computer Simulation , Conserved Sequence , Humans , Molecular Sequence Data , Proteome/classification , Sequence Homology, Amino Acid , Species SpecificityABSTRACT
Evolutionary rate and gene age are interrelated when the age of a gene is assessed by the taxonomic distribution in the gene family. This is because homology detection by sequence comparison is depending on sequence similarity. We estimate family specific rates of protein evolution for orthologous families with representatives from man, fugu, fly, and worm. In fact, we observe that younger proteins tend to evolve faster than older ones. We estimate time points of duplication events that gave rise to novel protein functions and show that younger proteins were duplicated more recently than older ones.
Subject(s)
Evolution, Molecular , Gene Duplication , Proteins/genetics , Age Factors , Animals , Humans , Multigene Family , Sequence AlignmentABSTRACT
The SYSTERS project aims to provide a meaningful partitioning of the whole protein sequence space by a fully automatic procedure. A refined two-step algorithm assigns each protein to a family and a superfamily. The sequence data underlying SYSTERS release 4 now comprise several protein sequence databases derived from completely sequenced genomes (ENSEMBL, TAIR, SGD and GeneDB), in addition to the comprehensive Swiss-Prot/TrEMBL databases. The SYSTERS web server (http://systers.molgen.mpg.de) provides access to 158 153 SYSTERS protein families. To augment the automatically derived results, information from external databases like Pfam and Gene Ontology are added to the web server. Furthermore, users can retrieve pre-processed analyses of families like multiple alignments and phylogenetic trees. New query options comprise a batch retrieval tool for functional inference about families based on automatic keyword extraction from sequence annotations. A new access point, PhyloMatrix, allows the retrieval of phylogenetic profiles of SYSTERS families across organisms with completely sequenced genomes.