Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
Proteins ; Suppl 5: 55-67, 2001.
Article in English | MEDLINE | ID: mdl-11835482

ABSTRACT

We present the assessment of the CASP4 fold recognition category. The tasks we had to execute include the splitting of multidomain targets into single domains, the classification of target domains in terms of prediction categories, the numerical evaluation of predictions, the mapping of numerical scores to quality indices, the ranking of predictors, the selection of top-performing groups, and the analysis and critical discussion of the state of the art in this field. The 125 fold recognition groups were assessed by a total score that summarizes their performance over all targets and a quality score reflecting the average quality of the submitted models. Most of the top-performing groups achieved respectable results on both scores simultaneously. Several groups submitted models that were much closer to the respective target structures than any of the known folds in the Protein Data Bank. The CASP4 assessment included the automated servers of the parallel CAFASP experiment. For the total score, the highest rank achieved by a fully automated server is 12. Two thirds of the predictors have rather low scores.


Subject(s)
Models, Molecular , Protein Folding , Computer Simulation , Protein Structure, Tertiary , Sequence Analysis, Protein
2.
J Mol Biol ; 296(4): 1139-52, 2000 Mar 03.
Article in English | MEDLINE | ID: mdl-10686110

ABSTRACT

The genome sciences face the challenge to characterize structure and function of a vast number of novel genes. Sequence search techniques are used to infer functional and structural information from similarities to experimentally characterized genes or proteins. The persistent goal is to refine these techniques and to develop alternative and complementary methods to increase the range of reliable inference.Here, we focus on the structural and functional assignments that can be inferred from the known three-dimensional structures of proteins. The study uses all structures in the Protein Data Bank that were known by the end of 1997. The protein structures released in 1998 were then characterized in terms of functional and structural similarity to the previously known structures, yielding an estimate of the maximum amount of information on novel protein sequences that can be obtained from inference techniques. The 147 globular proteins corresponding to 196 domains released in 1998 have no clear sequence similarity to previously known structures. However, 75 % of the domains have extensive structure similarity to previously known folds, and most importantly, in two out of three cases similarity in structure coincides with related function. In view of this analysis, full utilization of existing structure data bases would provide information for many new targets even if the relationship is not accessible from sequence information alone. Currently, the most sophisticated techniques detect of the order of one-third of these relationships.


Subject(s)
Flavoproteins , Protein Conformation , Amino Acid Sequence , Bacterial Proteins/chemistry , Carrier Proteins/chemistry , Desulfovibrio vulgaris , Models, Chemical , Models, Molecular , Molecular Sequence Data , Sequence Homology, Amino Acid
3.
Proteins ; Suppl 3: 112-20, 1999.
Article in English | MEDLINE | ID: mdl-10526359

ABSTRACT

We describe the results obtained using fold recognition techniques in our third participation in the CASP experiment. The approach relies on knowledge-based potentials for alignment production and fold identification. As indicated by the increase in alignment quality and fold identification reliability, the predictions improved from CASP1 to CASP3. In particular, we identified structural relationships in which no known evolutionary link exists. Our predictions are based on single sequences rather than multiple sequence alignments. Additionally, we voluntarily submitted only a single model for each target because, in our view, submission of a single model is the most stringent test. We describe the methods used, the strategy adopted in the predictions, and the prediction results and discuss future work.


Subject(s)
Escherichia coli Proteins , Protein Folding , Proteins/chemistry , Algorithms , Amino Acid Sequence , Bacterial Proteins/chemistry , Carbon-Oxygen Lyases/chemistry , DNA-Binding Proteins/chemistry , Databases, Factual , Escherichia coli/chemistry , Models, Molecular , Molecular Sequence Data , Ribosomal Proteins/chemistry , Sequence Alignment
SELECTION OF CITATIONS
SEARCH DETAIL
...