Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
Add more filters










Publication year range
1.
Mol Biol (Mosk) ; 36(2): 315-9, 2002.
Article in Russian | MEDLINE | ID: mdl-11969094

ABSTRACT

On the basis of recent fundamentally novel developments in the protein structure a proteomic code is suggested, that would potentially allow to describe sequence, structure, and function of proteins by a spectrum of elementary loop-n-lock units. All major characteristics of the nearly standard units are described, and first five "codons" of the proteomic code are presented with their respective unique sequences, structures, and functions. More such codons are to be discovered, and the general procedure for their identification is described.


Subject(s)
Proteins/chemistry , Proteins/physiology , Amino Acid Sequence , Evolution, Molecular , Models, Molecular , Molecular Sequence Data , Protein Conformation , Proteins/classification
2.
Proteins ; 45(4): 346-50, 2001 Dec 01.
Article in English | MEDLINE | ID: mdl-11746682

ABSTRACT

Protein structure can be viewed as a compact linear array of nearly standard size closed loops of 25-30 amino acid residues (Berezovsky et al., FEBS Letters 2000; 466: 283-286) irrespective of details of secondary structure. The end-to-end contacts in the loops are likely to be hydrophobic, which is a testable hypothesis. This notion could be verified by direct comparison of the loop maps with Kyte and Doolittle hydropathicity plots. This analysis reveals that most of the ends of the loops are hydrophobic, indeed. The same conclusion is reached on the basis of positional autocorrelation analysis of protein sequences of 23 fully sequenced bacterial genomes. Hydrophobic residues valine, alanine, glycine, leucine, and isoleucine appear preferentially at the 25-30 residues distance one from another. These observations open a new perspective in the understanding of protein structure and folding: a consecutive looping of the polypeptide chain with the loops ending primarily at hydrophobic nuclei.


Subject(s)
Hydrophobic and Hydrophilic Interactions , Models, Molecular , Protein Folding , Amino Acid Sequence , Genome, Bacterial , Interleukin-1/chemistry , Proteins/chemistry , Proteins/genetics
3.
J Mol Evol ; 53(4-5): 394-401, 2001.
Article in English | MEDLINE | ID: mdl-11675599

ABSTRACT

Evolution of proteins encoded in nucleotide sequences began with the advent of the triplet code. The chronological order of the appearance of amino acids on the evolution scene and the steps in the evolution of the triplet code have been recently reconstructed (Trifonov, 2000b) on the basis of 40 different ranking criteria and hypotheses. According to the consensus chronology, the pair of complementary GGC and GCC codons for the amino acids alanine and glycine appeared first. Other codons appeared as complementary pairs as well, which divided their respective amino acids into two alphabets, encoded by triplets with either central purines or central pyrimidines: G, D, S, E, N, R, K, Q, C, H, Y, and W (Glycine alphabet G) and A, V, P, S, L, T, I, F, and M (Alanine alphabet A). It is speculated that the earliest polypeptide chains were very short, presumably of uniform length, belonging to two alphabet types encoded in the two complementary strands of the earliest mRNA duplexes. After the fusion of the minigenes, a mosaic of the alphabets would form. Traces of the predicted mosaic structure have been, indeed, detected in the protein sequences of complete prokaryotic genomes in the form of weak oscillations with the period 12 residues in the form of alteration of two types of 6 residue long units. The next stage of protein evolution corresponded to the closure of the chains in the loops of the size 25-30 residues (Berezovsky et al., 2000). Autocorrelation analysis of proteins of 23 complete archaebacterial and eubacterial genomes revealed that the preferred distances between valine, alanine, glycine, leucine, and isoleucine along the sequences are in the same range of 25-30 residues, indicating that the loops are primarily closed by hydrophobic interactions between the ends of the loops. The loop closure stage is followed by the formation of typical folds of 100-200 amino acids, via end-to-end fusion of the genes encoding the loop-size chains. This size was apparently dictated by the optimal ring closure for DNA. In both cases the closure into the ring (loop) rendered evolutionarily advantageous stability to the respective structures. Further gene fusions lead to the formation of modern multidomain proteins. Recombinational gene splicing is likely to have appeared after the DNA circularization stage.


Subject(s)
Evolution, Molecular , Proteins/genetics , Amino Acid Sequence , Base Sequence , Codon/genetics , DNA, Recombinant , Genetic Code , Models, Genetic , Molecular Sequence Data , Protein Folding , Protein Structure, Tertiary , Proteins/chemistry
4.
Protein Eng ; 14(6): 403-7, 2001 Jun.
Article in English | MEDLINE | ID: mdl-11477219

ABSTRACT

Protein chains make numerous returns in globules, thus forming loops, closed by tight residue-to-residue contacts-closed loops. Previous statistical analysis of the sizes and locations of the closed loops in all major protein folds revealed that the loops have an almost standard contour length of 25-30 amino acid residues and follow one after another along the chain. In this work the closed loops of the major folds are presented in three dimensions. A special image filtering procedure is introduced that allows one to visualize the standard size closed loops for the first time. The loop positions along the sequences are verified by detection of loop-end clusters.


Subject(s)
Models, Molecular , Protein Folding , Proteins/chemistry , Data Interpretation, Statistical , Evolution, Molecular , Imaging, Three-Dimensional , Protein Conformation , Protein Structure, Tertiary
5.
J Mol Biol ; 307(5): 1419-26, 2001 Apr 13.
Article in English | MEDLINE | ID: mdl-11292352

ABSTRACT

In a globular protein the polypeptide chain returns to itself many times, making numerous chain-to-chain contacts. The stability of these contacts is maintained primarily by van der Waals interactions. In this work we isolated and analysed van der Waals contacts that stabilise spatial structures of nine major folds. We suggest a specific way to identify the tightest contacts of prime importance for the stability of a given crystallized protein and introduce the notion of the van der Waals lock. The loops closed by the van der Waals interactions provide a basically novel view of protein globule organization: the loop-n-lock structure. This opens a new perspective in understanding protein folding as well: the consecutive looping of the polypeptide chain and the locking of the loop ends by tight van der Waals interactions.


Subject(s)
Protein Folding , Proteins/chemistry , Proteins/metabolism , Binding Sites , Crystallography, X-Ray , Hydrogen Bonding , Models, Molecular , Protein Structure, Tertiary
6.
J Biomol Struct Dyn ; 19(3): 397-403, 2001 Dec.
Article in English | MEDLINE | ID: mdl-11790139

ABSTRACT

Standard building blocks of proteins--closed loops of 25-30 amino acid residues--have been recently discovered and further characterized by combined efforts of several laboratories. New challenging views on the protein structure, folding, and evolution are introduced by these studies. In particular, the role of van der Waals contacts in protein stability is better understood. They can be considered as locks closing the polypeptide chain returns and forming the loop-n-lock elements. The linearity of the arrangement of the standard loops in the proteins has important evolutionary implications. Selection pressure to maintain the loops of nearly standard size is reflected in the protein sequences as characteristic distance between hydrophobic residues, equal to the loop end-to-end distance. Further characterization of the loop-n-lock units reveals several sequence/structure prototypes, which suggests a new basis for protein classification. The following is a review of these studies.


Subject(s)
Protein Conformation , Proteins/chemistry , Amino Acid Sequence , Evolution, Molecular , Hydrophobic and Hydrophilic Interactions , Immunoglobulins/chemistry , Immunoglobulins/genetics , Models, Molecular , Molecular Sequence Data , Protein Folding , Protein Structure, Secondary , Proteins/classification , Proteins/genetics , Selection, Genetic
7.
J Comput Biol ; 7(1-2): 183-92, 2000.
Article in English | MEDLINE | ID: mdl-10890395

ABSTRACT

The method of the representation of amino acid sequence by graph of the interactions energy between parts of spatial structure has been elaborated. Our method provides the possibility to establish the compatibility between each point of a polypeptide chain and the Van der Waals interactions energy of regions of a native globule adjacent to this amino acid residue. We have undertaken an exhaustive analysis of a set of proteins. Boundaries of domain and module structures have been found. Nonequivalence of different parts of sequences in respect to their contribution to stabilization of the spatial structure of the protein macromolecules has been revealed. On the basis of the number of energetic levels which are necessary to identify all independent parts of the globule, the contribution from each part of the sequence to stabilization of the spatial structure of the globule is defined. Thus, it has been found that the sequence of amino acid residues coincides with the sequence of the numerical values which can be used in turn in formal procedures, such as an alignment, a search of consensus, the recognition of composition peculiarities, etc. An example of the comparison of proteins with various sequence identities is considered to demonstrate the scheme of an alignment procedure.


Subject(s)
Amino Acid Sequence , Proteins/chemistry , Sequence Analysis, Protein/statistics & numerical data , Biometry , Muramidase/chemistry , Protein Folding , Protein Structure, Tertiary , Sequence Alignment/statistics & numerical data , Thermodynamics
8.
J Biomol Struct Dyn ; 17(5): 799-809, 2000 Apr.
Article in English | MEDLINE | ID: mdl-10798525

ABSTRACT

Van der Waals interaction energy in globular proteins is presented by the interaction energies between regions of protein spatial structure with homogenous medium density distribution. We introduce a notion of the local medium permittivity as a function of absorptance of molecular groups with particular conformation. Proposed theory avoids shortcomings which are typical for the calculations on the basis of the pairwise additive approximation. The approach takes into account local peculiarities of protein spatial structure and physical-chemical characteristics of amino acid residues and molecular groups.


Subject(s)
Proteins/metabolism , Electromagnetic Fields , Models, Theoretical , Protein Binding , Protein Conformation
9.
FEBS Lett ; 466(2-3): 283-6, 2000 Jan 28.
Article in English | MEDLINE | ID: mdl-10682844

ABSTRACT

By screening the crystal protein structure database for close Calpha-Calpha contacts, a size distribution of the closed loops is generated. The distribution reveals a maximum at 27+/-5 residues, the same for eukaryotic and prokaryotic proteins. This is apparently a consequence of polymer statistic properties of protein chain trajectory. That is, closure into the loops depends on the flexibility (persistence length) of the chain. The observed preferential loop size is consistent with the theoretical optimal loop closure size. The mapping of the detected unit-size loops on the sequences of major typical folds reveals an almost regular compact consecutive arrangement of the loops. Thus, a novel basic element of protein architecture is discovered; structurally diverse closed loops of the particular size.


Subject(s)
Protein Structure, Secondary
10.
J Biomol Struct Dyn ; 17(1): 133-55, 1999 Aug.
Article in English | MEDLINE | ID: mdl-10496428

ABSTRACT

An algorithm for determining of protein domain structure is proposed. Domain structures resulted from the algorithm application have been obtained and compared with available data. The method is based on entirely physical model of van der Waals interactions that reflects as illustrated in this work the distribution of electron density. Various levels of hierarchy in the protein spatial structure are discerned by analysis of the energy interaction between structural units of different scales. Thus the level of energy hierarchy plays role of sole parameter, and the method obviates the use of complicated geometrical criteria with numerous fitting parameters. The algorithm readily and accurately locates domains formed by continuous segments of the protein chain as well as those comprising non-sequential segments, sets no limit to the number of segments in a domain. We have analyzed 309 protein structures. Among 277 structures for which our results could be compared with the domain definitions made in other works, 243 showed complete or partial coincidence, and only in 34 cases the domain structures proved substantially different. The domains delineated with our approach may coincide with reference definition at different levels of the globule hierarchy. Along with defining the domain structure, our approach allows one to consider the protein spatial structure in terms of the spatial distribution of the interaction energy in order to establish the correspondence between the hierarchy of energy distribution and the hierarchy of structural elements.


Subject(s)
Algorithms , Protein Conformation , Protein Folding , Protein Structure, Tertiary , Energy Transfer , Mathematical Computing
11.
Protein Eng ; 12(1): 23-30, 1999 Jan.
Article in English | MEDLINE | ID: mdl-10065707

ABSTRACT

An exhaustive statistical analysis of the amino acid sequences at the carboxyl (C) and amino (N) termini of proteins and of coding nucleic acid sequences at the 5' side of the stop codons was undertaken. At the N ends, Met and Ala residues are over-represented at the first (+1) position whereas at positions 2 and 5 Thr is preferred. These peculiarities at N-termini are most probably related to the mechanism of initiation of translation (for Met) and to the mechanisms governing the life-span of proteins via regulation of their degradation (for Ala and Thr). We assume that the C-terminal bias facilitates fixation of the C ends on the protein globule by a preference for charged and Cys residues. The terminal biases, a novel feature of protein structure, have to be taken into account when molecular evolution, three-dimensional structure, initiation and termination of translation, protein folding and life-span are concerned. In addition, the bias of protein termini composition is an important feature which should be considered in protein engineering experiments.


Subject(s)
Amino Acid Sequence , Bias , Animals , Base Sequence , Chi-Square Distribution , Databases, Factual , Humans , Models, Statistical
12.
FEBS Lett ; 418(1-2): 43-6, 1997 Nov 24.
Article in English | MEDLINE | ID: mdl-9414092

ABSTRACT

We suggest a new simple approach for comparing the primary structure of proteins and their spatial structure. It relies on the one-to-one correspondence between each residue of the polypeptide chain and the energy of van der Waals interactions between the regions of the native globule flanking this residue. The method obviates the sophisticated geometrical criteria for estimating similarity between spatial structures. Besides, it permits one to analyze structural units of different scale.


Subject(s)
Amino Acid Sequence , Protein Conformation , Proteins/chemistry , Ribonucleases/chemistry , Bacterial Proteins , Computer Simulation , Endoribonucleases/chemistry , Ribonuclease H/chemistry , Software
13.
FEBS Lett ; 404(2-3): 140-2, 1997 Mar 10.
Article in English | MEDLINE | ID: mdl-9119051

ABSTRACT

We have undertaken an exhaustive statistical analysis of the amino acid sequences at the carboxyl-terminal (C) ends of proteins. The composition of the C-terminal decapeptides differs from that expected for the given proteins from the overall amino acid composition. For E. coli, yeast, and H. sapiens it was shown that positively charged amino acid residues are over-represented while Gly residues are under-represented. The C-terminal bias, a novel feature of protein structure, should be taken into account when molecular evolution, spatial structure, translational termination and protein folding are concerned.


Subject(s)
Peptide Fragments/chemistry , Proteins/chemistry , Amino Acid Sequence , Conserved Sequence , Databases, Factual , Escherichia coli , Evolution, Molecular , Glycine , Humans , Protein Conformation , Protein Folding , Saccharomyces cerevisiae
SELECTION OF CITATIONS
SEARCH DETAIL
...