Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add more filters










Database
Language
Publication year range
1.
Biochimie ; 197: 74-85, 2022 Jun.
Article in English | MEDLINE | ID: mdl-35143919

ABSTRACT

3D protein structures determine proteins' biological functions. The 3D structure of the protein backbone can be approximated using the prototypes of local protein conformations. Sets of these prototypes are called structural alphabets (SAs). Amongst several approaches to the prediction of 3D structures from amino acid sequences, one approach is based on the prediction of SA prototypes for a given amino acid sequence. Protein Blocks (PBs) is the most known SA, and it is composed of 16 prototypes of five consecutive amino acids which were identified as optimal prototypes considering the ability to correctly approximate the local structure and the prediction accuracy of prototypes from an amino acid sequence. We developed models for PBs prediction from sequence information using different data mining approaches and machine learning algorithms. Besides the amino acid sequences, the results of the following tools were used to train the models: the Spider3 predictor of protein structure properties, several predictors of the protein's intrinsically disordered regions, and a tool for finding repeats in amino acid sequences. The highest accuracy of the constructed models is 80%, which is a significant improvement compared to the previous best available prediction, whose accuracy was 61%. Analyzing the models constructed by applying different algorithms, it was noticed that the significance of input attributes differs among the models constructed by algorithms. Using the information about amino acids belonging to intrinsically disordered regions and repeats improves the precision of prediction for some PBs using the CART classification algorithm, while this is not the case with the C5.0 classification algorithm. Improved prediction approaches can have interesting applications in protein structural model approaches or computational protein design.


Subject(s)
Data Mining , Proteins , Algorithms , Amino Acid Sequence , Amino Acids , Databases, Protein , Protein Conformation , Proteins/chemistry
2.
Data Brief ; 29: 105383, 2020 Apr.
Article in English | MEDLINE | ID: mdl-32195305

ABSTRACT

Intrinsic Disorder Proteins (IDPs) have become a hot topic since their characterisation in the 90s. The data presented in this article are related to our research entitled "A structural entropy index to analyse local conformations in Intrinsically Disordered Proteins" published in Journal of Structural Biology [1]. In this study, we quantified, for the first time, continuum from rigidity to flexibility and finally disorder. Non-disordered regions were also highlighted in the ensemble of disordered proteins. This work was done using the Protein Ensemble Database (PED), which is a useful database collecting series of protein structures considered as IDPs. The data set consists of a collection of cleaned protein files in classical pdb format that can be readily used as an input with most automatic analysis software. The accompanying data include the coding of all structural information in terms of a structural alphabet, namely Protein Blocks (PBs). An entropy index derived from PBs that allows apprehending the continuum between protein rigidity to flexibility to disorder is included, with information from secondary structure assignment, protein accessibility and prediction of disorder from the sequences. The data may be used for further structural bioinformatics studies of IDPs. It can also be used as a benchmark for evaluating disorder prediction methods.

3.
J Struct Biol ; 210(1): 107464, 2020 04 01.
Article in English | MEDLINE | ID: mdl-31978465

ABSTRACT

Sequence - structure - function paradigm has been revolutionized by the discovery of disordered regions and disordered proteins more than two decades ago. While the definition of rigidity is simple with X-ray structures, the notion of flexibility is linked to high experimental B-factors. The definition of disordered regions is more complex as in these same X-ray structures; it is associated to the position of missing residues. Thus a continuum so seems to exist between rigidity, flexibility and disorder. However, it had not been precisely described. In this study, we used an ensemble of disordered proteins (or regions) and, we applied a structural alphabet to analyse their local conformation. This structural alphabet, namely Protein Blocks, had been efficiently used to highlight rigid local domains within flexible regions and so discriminates deformability and mobility concepts. Using an entropy index derived from this structural alphabet, we underlined its interest to measure these local dynamics, and to quantify, for the first time, continuum states from rigidity to flexibility and finally disorder. We also highlight non-disordered regions in the ensemble of disordered proteins in our study.


Subject(s)
Intrinsically Disordered Proteins/chemistry , Entropy , Protein Conformation
4.
Comput Biol Chem ; 83: 107130, 2019 Dec.
Article in English | MEDLINE | ID: mdl-31593887

ABSTRACT

Three-dimensional structure of a protein chain is determined by its amino acid interactions. One approach to the analysis of amino acid interactions refers to geometric distances of amino acid pairs in polypeptide chains. For a detailed analysis of the amino acid distances, the database with three types of amino acid distances in a set of chains was created. Web application Distances of Amino Acids has also been developed to enable scientists to explore interactions of amino acids with different properties based on distances stored in the database. Web application calculates and displays descriptive statistics and graphs of amino acid pair distances with selected properties, such as geometric distance threshold, corresponding SCOP class of proteins and secondary structure types. In addition to the analysis of pre-calculated distances stored in the database, the amino acid distances of a single protein with the specified PDB identifier can also be analyzed. The web application is available at http://andromeda.matf.bg.ac.rs/aadis_dynamic/.


Subject(s)
Amino Acids/chemistry , Databases, Protein , Proteins/chemistry , Models, Molecular , Structure-Activity Relationship
SELECTION OF CITATIONS
SEARCH DETAIL
...