Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Protein Sci ; 33(6): e5011, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38747388

ABSTRACT

A protein sequence encodes its energy landscape-all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs-questioning the differences between them and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the consensus protein derived from our full Ribonuclease H sequence alignment is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted set of sequences is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order sequence correlations using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.


Subject(s)
Evolution, Molecular , Ribonuclease H/chemistry , Ribonuclease H/genetics , Ribonuclease H/metabolism , Consensus Sequence , Sequence Alignment , Phylogeny , Amino Acid Sequence , Models, Molecular , Protein Folding , Protein Conformation
2.
Proc Natl Acad Sci U S A ; 121(3): e2312029121, 2024 Jan 16.
Article in English | MEDLINE | ID: mdl-38194446

ABSTRACT

Understanding natural protein evolution and designing novel proteins are motivating interest in development of high-throughput methods to explore large sequence spaces. In this work, we demonstrate the application of multisite λ dynamics (MSλD), a rigorous free energy simulation method, and chemical denaturation experiments to quantify evolutionary selection pressure from sequence-stability relationships and to address questions of design. This study examines a mesophilic phylogenetic clade of ribonuclease H (RNase H), furthering its extensive characterization in earlier studies, focusing on E. coli RNase H (ecRNH) and a more stable consensus sequence (AncCcons) differing at 15 positions. The stabilities of 32,768 chimeras between these two sequences were computed using the MSλD framework. The most stable and least stable chimeras were predicted and tested along with several other sequences, revealing a designed chimera with approximately the same stability increase as AncCcons, but requiring only half the mutations. Comparing the computed stabilities with experiment for 12 sequences reveals a Pearson correlation of 0.86 and root mean squared error of 1.18 kcal/mol, an unprecedented level of accuracy well beyond less rigorous computational design methods. We then quantified selection pressure using a simple evolutionary model in which sequences are selected according to the Boltzmann factor of their stability. Selection temperatures from 110 to 168 K are estimated in three ways by comparing experimental and computational results to evolutionary models. These estimates indicate selection pressure is high, which has implications for evolutionary dynamics and for the accuracy required for design, and suggests accurate high-throughput computational methods like MSλD may enable more effective protein design.


Subject(s)
Escherichia coli , Ribonuclease H , Escherichia coli/genetics , Phylogeny , Computer Simulation , Consensus Sequence , Ribonuclease H/genetics
3.
bioRxiv ; 2023 Jul 01.
Article in English | MEDLINE | ID: mdl-37425932

ABSTRACT

A protein sequence encodes its energy landscape - all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs - questioning the differences and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the overall consensus protein is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted region is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order couplings using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.

4.
Biochemistry ; 60(3): 170-181, 2021 01 26.
Article in English | MEDLINE | ID: mdl-33433210

ABSTRACT

In addition to encoding the tertiary fold and stability, the primary sequence of a protein encodes the folding trajectory and kinetic barriers that determine the speed of folding. How these kinetic barriers are encoded is not well understood. Here, we use evolutionary sequence variation in the α-lytic protease (αLP) protein family to probe the relationship between sequence and energy landscape. αLP has an unusual energy landscape: the native state of αLP is not the most thermodynamically favored conformation and, instead, remains folded due to a large kinetic barrier preventing unfolding. To fold, αLP utilizes an N-terminal pro region similar in size to the protease itself that functions as a folding catalyst. Once folded, the pro region is removed, and the native state does not unfold on a biologically relevant time scale. Without the pro region, αLP folds on the order of millennia. A phylogenetic search uncovers αLP homologs with a wide range of pro region sizes, including some with no pro region at all. In the resulting phylogenetic tree, these homologs cluster by pro region size. By studying homologs naturally lacking a pro region, we demonstrate they can be thermodynamically stable, fold much faster than αLP, yet retain the same fold as αLP. Key amino acids thought to contribute to αLP's extreme kinetic stability are lost in these homologs, supporting their role in kinetic stability. This study highlights how the entire energy landscape plays an important role in determining the evolutionary pressures on the protein sequence.


Subject(s)
Bacterial Proteins/chemistry , Evolution, Molecular , Models, Molecular , Phylogeny , Protein Folding , Serine Endopeptidases/chemistry , Bacterial Proteins/genetics , Enzyme Stability , Kinetics , Serine Endopeptidases/genetics
5.
J Mol Biol ; 428(1): 153-164, 2016 Jan 16.
Article in English | MEDLINE | ID: mdl-26608811

ABSTRACT

Many bacteria employ a protein organelle, the carboxysome, to catalyze carbon dioxide fixation in the Calvin Cycle. Only 10 genes from Halothiobacillus neapolitanus are sufficient for heterologous expression of carboxysomes in Escherichia coli, opening the door to detailed mechanistic analysis of the assembly process of this complex (more than 200MDa). One of these genes, csoS2, has been implicated in assembly but ascribing a molecular function is confounded by the observation that the single csoS2 gene yields expression of two gene products and both display an apparent molecular weight incongruent with the predicted amino acid sequence. Here, we elucidate the co-translational mechanism responsible for the expression of the two protein isoforms. Specifically, csoS2 was found to possess -1 frameshifting elements that lead to the production of the full-length protein, CsoS2B, and a truncated protein, CsoS2A, which possesses a C-terminus translated from the alternate frame. The frameshifting elements comprise both a ribosomal slippery sequence and a 3' secondary structure, and ablation of either sequence is sufficient to eliminate the slip. Using these mutants, we investigated the individual roles of CsoS2B and CsoS2A on carboxysome formation. In this in vivo formation assay, cells expressing only the CsoS2B isoform were capable of producing intact carboxysomes, while those with only CsoS2A were not. Thus, we have answered a long-standing question about the nature of CsoS2 in this model microcompartment and demonstrate that CsoS2B is functionally distinct from CsoS2A in the assembly of α-carboxysomes.


Subject(s)
Bacterial Proteins/biosynthesis , Bacterial Proteins/genetics , Frameshifting, Ribosomal , Gene Expression Regulation, Bacterial , Halothiobacillus/genetics , Protein Isoforms/biosynthesis , Protein Isoforms/genetics , Escherichia coli/genetics , Macromolecular Substances/metabolism , Protein Multimerization
SELECTION OF CITATIONS
SEARCH DETAIL
...