Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 59
Filter
1.
Front Immunol ; 14: 1177691, 2023.
Article in English | MEDLINE | ID: mdl-37492575

ABSTRACT

The previous studies on the RGD motif (aa403-405) within the SARS CoV-2 spike (S) protein receptor binding domain (RBD) suggest that the RGD motif binding integrin(s) may play an important role in infection of the host cells. We also discussed the possible role of two other integrin binding motifs that are present in S protein: LDI (aa585-587) and ECD (661-663), the motifs used by some other viruses in the course of infection. The MultiFOLD models for protein structure analysis have shown that the ECD motif is clearly accessible in the S protein, whereas the RGD and LDI motifs are partially accessible. Furthermore, the amino acids that are present in Epstein-Barr virus protein (EBV) gp42 playing very important role in binding to the HLA-DRB1 molecule and in the subsequent immune response evasion, are also present in the S protein heptad repeat-2. Our MultiFOLD model analyses have shown that these amino acids are clearly accessible on the surface in each S protein chain as monomers and in the homotrimer complex and bind to HLA-DRB1 ß chain. Therefore, they may have the identical role in SARS CoV-2 immune evasion as in EBV infection. The prediction analyses of the MHC class II binding peptides within the S protein have shown that the RGD motif is present in the core 9-mer peptide IRGDEVRQI within the two HLA-DRB1*03:01 and HLA-DRB3*01.01 strong binding 15-mer peptides suggesting that RGD motif may be the potential immune epitope. Accordingly, infected HLA-DRB1*03:01 or HLA-DRB3*01.01 positive individuals may develop high affinity anti-RGD motif antibodies that react with the RGD motif in the host proteins, like fibrinogen, thrombin or von Willebrand factor, affecting haemostasis or participating in autoimmune disorders.


Subject(s)
COVID-19 , Epstein-Barr Virus Infections , Humans , HLA-DRB1 Chains , Spike Glycoprotein, Coronavirus , Alleles , HLA-DRB3 Chains , Herpesvirus 4, Human , Peptides , SARS-CoV-2/metabolism , Amino Acids
2.
Proteins ; 91(12): 1871-1878, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37314190

ABSTRACT

In CASP15, there was a greater emphasis on multimeric modeling than in previous experiments, with assembly structures nearly doubling in number (41 up from 22) since the previous round. CASP15 also included a new estimation of model accuracy (EMA) category in recognition of the importance of objective quality assessment (QA) for quaternary structure models. ModFOLDdock is a multimeric model QA server developed by the McGuffin group at the University of Reading, which brings together a range of single-model, clustering, and deep learning methods to form a consensus of approaches. For CASP15, three variants of ModFOLDdock were developed to optimize for the different facets of the quality estimation problem. The standard ModFOLDdock variant produced predicted scores optimized for positive linear correlations with the observed scores. The ModFOLDdockR variant produced predicted scores optimized for ranking, that is, the top-ranked models have the highest accuracy. In addition, the ModFOLDdockS variant used a quasi-single model approach to score each model on an individual basis. The scores from all three variants achieved strongly positive Pearson correlation coefficients with the CASP observed scores (oligo-lDDT) in excess of 0.70, which were maintained across both homomeric and heteromeric model populations. In addition, at least one of the ModFOLDdock variants was consistently ranked in the top two methods across all three EMA categories. Specifically, for overall global fold prediction accuracy, ModFOLDdock placed second and ModFOLDdockR placed third; for overall interface quality prediction accuracy, ModFOLDdockR, ModFOLDdock, and ModFOLDdockS were placed above all other predictor methods, and ModFOLDdockR and ModFOLDdockS were placed second and third respectively for individual residue confidence scores. The ModFOLDdock server is available at: https://www.reading.ac.uk/bioinf/ModFOLDdock/. ModFOLDdock is also available as part of the MultiFOLD docker package: https://hub.docker.com/r/mcguffin/multifold.


Subject(s)
Proteins , Software , Protein Conformation , Proteins/genetics , Proteins/chemistry , Models, Molecular , Computational Biology
3.
Bioinform Adv ; 3(1): vbad078, 2023.
Article in English | MEDLINE | ID: mdl-37359722

ABSTRACT

Motivation: The accuracy gap between predicted and experimental structures has been significantly reduced following the development of AlphaFold2 (AF2). However, for many targets, AF2 models still have room for improvement. In previous CASP experiments, highly computationally intensive MD simulation-based methods have been widely used to improve the accuracy of single 3D models. Here, our ReFOLD pipeline was adapted to refine AF2 predictions while maintaining high model accuracy at a modest computational cost. Furthermore, the AF2 recycling process was utilized to improve 3D models by using them as custom template inputs for tertiary and quaternary structure predictions. Results: According to the Molprobity score, 94% of the generated 3D models by ReFOLD were improved. AF2 recycling showed an improvement rate of 87.5% (using MSAs) and 81.25% (using single sequences) for monomeric AF2 models and 100% (MSA) and 97.8% (single sequence) for monomeric non-AF2 models, as measured by the average change in lDDT. By the same measure, the recycling of multimeric models showed an improvement rate of as much as 80% for AF2-Multimer (AF2M) models and 94% for non-AF2M models. Availability and implementation: Refinement using AlphaFold2-Multimer recycling is available as part of the MultiFOLD docker package (https://hub.docker.com/r/mcguffin/multifold). The ReFOLD server is available at https://www.reading.ac.uk/bioinf/ReFOLD/ and the modified scripts can be downloaded from https://www.reading.ac.uk/bioinf/downloads/. Supplementary information: Supplementary data are available at Bioinformatics Advances online.

4.
Nucleic Acids Res ; 51(W1): W274-W280, 2023 07 05.
Article in English | MEDLINE | ID: mdl-37102670

ABSTRACT

The IntFOLD server based at the University of Reading has been a leading method over the past decade in providing free access to accurate prediction of protein structures and functions. In a post-AlphaFold2 world, accurate models of tertiary structures are widely available for even more protein targets, so there has been a refocus in the prediction community towards the accurate modelling of protein-ligand interactions as well as modelling quaternary structure assemblies. In this paper, we describe the latest improvements to IntFOLD, which maintains its competitive structure prediction performance by including the latest deep learning methods while also integrating accurate model quality estimates and 3D models of protein-ligand interactions. Furthermore, we also introduce our two new server methods: MultiFOLD for accurately modelling both tertiary and quaternary structures, with performance which has been independently verified to outperform the standard AlphaFold2 methods, and ModFOLDdock, which provides world-leading quality estimates for quaternary structure models. The IntFOLD7, MultiFOLD and ModFOLDdock servers are available at: https://www.reading.ac.uk/bioinf/.


Subject(s)
Proteins , Software , Ligands , Protein Structure, Tertiary , Models, Molecular , Proteins/chemistry , Protein Conformation
5.
Methods Mol Biol ; 2627: 101-118, 2023.
Article in English | MEDLINE | ID: mdl-36959444

ABSTRACT

Protein structure modeling is one of the most advanced and complex processes in computational biology. One of the major problems for the protein structure prediction field has been how to estimate the accuracy of the predicted 3D models, on both a local and global level, in the absence of known structures. We must be able to accurately measure the confidence that we have in the quality predicted 3D models of proteins for them to become widely adopted by the general bioscience community. To address this major issue, it was necessary to develop new model quality assessment (MQA) methods and integrate them into our pipelines for building 3D protein models. Our MQA method, called ModFOLD, has been ranked as one of the most accurate MQA tools in independent blind evaluations. This chapter discusses model quality assessment in the protein modeling field, demonstrating both its strengths and limitations. We also present some of the best methods according to independent benchmarking data, which has been gathered in recent years.


Subject(s)
Computational Biology , Proteins , Models, Molecular , Proteins/chemistry , Computational Biology/methods , Benchmarking , Protein Conformation , Software
6.
Methods Mol Biol ; 2627: 119-140, 2023.
Article in English | MEDLINE | ID: mdl-36959445

ABSTRACT

The refinement of predicted 3D models aims to bring them closer to the native structure by fixing errors including unusual bonds and torsion angles and irregular hydrogen bonding patterns. Refinement approaches based on molecular dynamics (MD) simulations using different types of restraints have performed well since CASP10. ReFOLD, developed by the McGuffin group, was one of the many MD-based refinement approaches, which were tested in CASP 12. When the performance of the ReFOLD method in CASP12 was evaluated, it was observed that ReFOLD suffered from the absence of a reliable guidance mechanism to reach consistent improvement for the quality of predicted 3D models, particularly in the case of template-based modelling (TBM) targets. Therefore, here we propose to utilize the local quality assessment score produced by ModFOLD6 to guide the MD-based refinement approach to further increase the accuracy of the predicted 3D models. The relative performance of the new local quality assessment guided MD-based refinement protocol and the original MD-based protocol ReFOLD are compared utilizing many different official scoring methods. By using the per-residue accuracy (or local quality) score to guide the refinement process, we are able to prevent the refined models from undesired structural deviations, thereby leading to more consistent improvements. This chapter will include a detailed analysis of the performance of the local quality assessment guided MD-based protocol versus that deployed in the original ReFOLD method.


Subject(s)
Computational Biology , Molecular Dynamics Simulation , Protein Conformation , Computational Biology/methods , Proteins/chemistry , Hydrogen Bonding
7.
Methods Mol Biol ; 2305: 23-52, 2021.
Article in English | MEDLINE | ID: mdl-33950383

ABSTRACT

Biologists are increasingly aware of the importance of protein structure in revealing function. The computational tools now exist which allow researchers to model unknown proteins simply on the basis of their primary sequence. However, for the non-specialist bioinformatician, there is a dazzling array of terminology, acronyms, and competing computer software available for this process. This review is intended to highlight the key stages of computational protein structure prediction, as well as explain the reasons behind some of the procedures and list some established workarounds for common pitfalls. Thereafter follows a review of five one-stop servers for start-to-finish structure prediction.


Subject(s)
Computational Biology/methods , Molecular Docking Simulation/methods , Protein Conformation , Proteins/chemistry , Structural Homology, Protein , Algorithms , Amino Acid Sequence , Computer Simulation , Databases, Protein , Software
8.
Biochem J ; 478(11): 2121-2143, 2021 06 11.
Article in English | MEDLINE | ID: mdl-34032269

ABSTRACT

The Ser/Thr kinase MAP4K4, like other GCKIV kinases, has N-terminal kinase and C-terminal citron homology (CNH) domains. MAP4K4 can activate c-Jun N-terminal kinases (JNKs), and studies in the heart suggest it links oxidative stress to JNKs and heart failure. In other systems, MAP4K4 is regulated in striatin-interacting phosphatase and kinase (STRIPAK) complexes, in which one of three striatins tethers PP2A adjacent to a kinase to keep it dephosphorylated and inactive. Our aim was to understand how MAP4K4 is regulated in cardiomyocytes. The rat MAP4K4 gene was not properly defined. We identified the first coding exon of the rat gene using 5'-RACE, we cloned the full-length sequence and confirmed alternative-splicing of MAP4K4 in rat cardiomyocytes. We identified an additional α-helix C-terminal to the kinase domain important for kinase activity. In further studies, FLAG-MAP4K4 was expressed in HEK293 cells or cardiomyocytes. The Ser/Thr protein phosphatase inhibitor calyculin A (CalA) induced MAP4K4 hyperphosphorylation, with phosphorylation of the activation loop and extensive phosphorylation of the linker between the kinase and CNH domains. This required kinase activity. MAP4K4 associated with myosin in untreated cardiomyocytes, and this was lost with CalA-treatment. FLAG-MAP4K4 associated with all three striatins in cardiomyocytes, indicative of regulation within STRIPAK complexes and consistent with activation by CalA. Computational analysis suggested the interaction was direct and mediated via coiled-coil domains. Surprisingly, FLAG-MAP4K4 inhibited JNK activation by H2O2 in cardiomyocytes and increased myofibrillar organisation. Our data identify MAP4K4 as a STRIPAK-regulated kinase in cardiomyocytes, and suggest it regulates the cytoskeleton rather than activates JNKs.


Subject(s)
Alternative Splicing , Calmodulin-Binding Proteins/metabolism , Intracellular Signaling Peptides and Proteins/metabolism , JNK Mitogen-Activated Protein Kinases/metabolism , Membrane Proteins/metabolism , Mutation , Myocytes, Cardiac/metabolism , Nerve Tissue Proteins/metabolism , Protein Serine-Threonine Kinases/metabolism , Amino Acid Sequence , Animals , Calmodulin-Binding Proteins/genetics , Female , HEK293 Cells , Humans , Intracellular Signaling Peptides and Proteins/genetics , JNK Mitogen-Activated Protein Kinases/genetics , Membrane Proteins/genetics , Nerve Tissue Proteins/genetics , Phosphorylation , Protein Conformation , Protein Interaction Domains and Motifs , Protein Isoforms , Protein Serine-Threonine Kinases/genetics , Rats , Rats, Sprague-Dawley , Sequence Homology
9.
Nucleic Acids Res ; 49(W1): W425-W430, 2021 07 02.
Article in English | MEDLINE | ID: mdl-33963867

ABSTRACT

Methods for estimating the quality of 3D models of proteins are vital tools for driving the acceptance and utility of predicted tertiary structures by the wider bioscience community. Here we describe the significant major updates to ModFOLD, which has maintained its position as a leading server for the prediction of global and local quality of 3D protein models, over the past decade (>20 000 unique external users). ModFOLD8 is the latest version of the server, which combines the strengths of multiple pure-single and quasi-single model methods. Improvements have been made to the web server interface and there has been successive increases in prediction accuracy, which were achieved through integration of newly developed scoring methods and advanced deep learning-based residue contact predictions. Each version of the ModFOLD server has been independently blind tested in the biennial CASP experiments, as well as being continuously evaluated via the CAMEO project. In CASP13 and CASP14, the ModFOLD7 and ModFOLD8 variants ranked among the top 10 quality estimation methods according to almost every official analysis. Prior to CASP14, ModFOLD8 was also applied for the evaluation of SARS-CoV-2 protein models as part of CASP Commons 2020 initiative. The ModFOLD8 server is freely available at: https://www.reading.ac.uk/bioinf/ModFOLD/.


Subject(s)
Computers , Models, Molecular , Neural Networks, Computer , Protein Conformation , Protein Folding , Proteins/chemistry , Software , Reproducibility of Results , Research Design , SARS-CoV-2/chemistry , Viral Proteins/chemistry
10.
Nucleic Acids Res ; 49(W1): W589-W596, 2021 07 02.
Article in English | MEDLINE | ID: mdl-34009387

ABSTRACT

ReFOLD3 is unique in its application of gradual restraints, calculated from local model quality estimates and contact predictions, which are used to guide the refinement of theoretical 3D protein models towards the native structures. ReFOLD3 achieves improved performance by using an iterative refinement protocol to fix incorrect residue contacts and local errors, including unusual bonds and angles, which are identified in the submitted models by our leading ModFOLD8 model quality assessment method. Following refinement, the likely resulting improvements to the submitted models are recognized by ModFOLD8, which produces both global and local quality estimates. During the CASP14 prediction season (May-Aug 2020), we used the ReFOLD3 protocol to refine hundreds of 3D models, for both the refinement and the main tertiary structure prediction categories. Our group improved the global and local quality scores for numerous starting models in the refinement category, where we ranked in the top 10 according to the official assessment. The ReFOLD3 protocol was also used for the refinement of the SARS-CoV-2 targets as a part of the CASP Commons COVID-19 initiative, and we provided a significant number of the top 10 models. The ReFOLD3 web server is freely available at https://www.reading.ac.uk/bioinf/ReFOLD/.


Subject(s)
Computers , Internet , Models, Molecular , Protein Conformation , Protein Folding , Proteins/chemistry , Software , Reproducibility of Results , SARS-CoV-2/chemistry , Viral Proteins/chemistry
11.
Blood ; 137(6): 830-843, 2021 02 11.
Article in English | MEDLINE | ID: mdl-32822477

ABSTRACT

Connexins oligomerise to form hexameric hemichannels in the plasma membrane that can further dock together on adjacent cells to form gap junctions and facilitate intercellular trafficking of molecules. In this study, we report the expression and function of an orphan connexin, connexin-62 (Cx62), in human and mouse (Cx57, mouse homolog) platelets. A novel mimetic peptide (62Gap27) was developed to target the second extracellular loop of Cx62, and 3-dimensional structural models predicted its interference with gap junction and hemichannel function. The ability of 62Gap27 to regulate both gap junction and hemichannel-mediated intercellular communication was observed using fluorescence recovery after photobleaching analysis and flow cytometry. Cx62 inhibition by 62Gap27 suppressed a range of agonist-stimulated platelet functions and impaired thrombosis and hemostasis. This was associated with elevated protein kinase A-dependent signaling in a cyclic adenosine monophosphate-independent manner and was not observed in Cx57-deficient mouse platelets (in which the selectivity of 62Gap27 for this connexin was also confirmed). Notably, Cx62 hemichannels were observed to function independently of Cx37 and Cx40 hemichannels. Together, our data reveal a fundamental role for a hitherto uncharacterized connexin in regulating the function of circulating cells.


Subject(s)
Blood Platelets/metabolism , Connexins/physiology , Animals , Cell Communication/physiology , Cell Line , Connexins/blood , Connexins/chemistry , Connexins/deficiency , Connexins/genetics , Cyclic AMP-Dependent Protein Kinases/metabolism , Gap Junctions/physiology , Hemostasis/physiology , Humans , Integrins/blood , Megakaryocytes/metabolism , Mice , Mice, Inbred C57BL , Mice, Transgenic , Models, Molecular , Molecular Docking Simulation , Peptide Fragments/chemical synthesis , Peptide Fragments/pharmacology , Platelet Adhesiveness , Platelet Aggregation , Protein Conformation , Protein Multimerization , Structure-Activity Relationship , Thrombosis/blood
12.
Methods Mol Biol ; 2165: 69-81, 2020.
Article in English | MEDLINE | ID: mdl-32621219

ABSTRACT

Assessing the accuracy of 3D models has become a keystone in the protein structure prediction field. ModFOLD7 is our leading resource for Estimates of Model Accuracy (EMA), which has been upgraded by integrating a number of the pioneering pure-single- and quasi-single-model approaches. Such an integration has given our latest version the strengths to accurately score and rank predicted models, with higher consistency compared to older EMA methods. Additionally, the server provides three options for producing global score estimates, depending on the requirements of the user: (1) ModFOLD7_rank, which is optimized for ranking/selection, (2) ModFOLD7_cor, which is optimized for correlations of predicted and observed scores, and (3) ModFOLD7 global for balanced performance. ModFOLD7 has been ranked among the top few EMA methods according to independent blind testing by the CASP13 assessors. Another evaluation resource for ModFOLD7 is the CAMEO project, where the method is continuously automatically evaluated, showing a significant improvement compared to our previous versions. The ModFOLD7 server is freely available at http://www.reading.ac.uk/bioinf/ModFOLD/ .


Subject(s)
Molecular Dynamics Simulation/standards , Protein Conformation , Sequence Analysis, Protein/standards , Software/standards
13.
Cell Commun Signal ; 18(1): 92, 2020 06 11.
Article in English | MEDLINE | ID: mdl-32527260

ABSTRACT

BACKGROUND: The past decade has seen the rise of omics data for the understanding of biological systems in health and disease. This wealth of information includes protein-protein interaction (PPI) data derived from both low- and high-throughput assays, which are curated into multiple databases that capture the extent of available information from the peer-reviewed literature. Although these curation efforts are extremely useful, reliably downloading and integrating PPI data from the variety of available repositories is challenging and time consuming. METHODS: We here present a novel user-friendly web-resource called PINOT (Protein Interaction Network Online Tool; available at http://www.reading.ac.uk/bioinf/PINOT/PINOT_form.html) to optimise the collection and processing of PPI data from IMEx consortium associated repositories (members and observers) and WormBase, for constructing, respectively, human and Caenorhabditis elegans PPI networks. RESULTS: Users submit a query containing a list of proteins of interest for which PINOT extracts data describing PPIs. At every query submission PPI data are downloaded, merged and quality assessed. Then each PPI is confidence scored based on the number of distinct methods used for interaction detection and the number of publications that report the specific interaction. Examples of how PINOT can be applied are provided to highlight the performance, ease of use and potential utility of this tool. CONCLUSIONS: PINOT is a tool that allows users to survey the curated literature, extracting PPI data in relation to a list of proteins of interest. PINOT extracts a similar numbers of PPIs as other, analogous, tools and incorporates a set of innovative features. PINOT is able to process large queries, it downloads human PPIs live through PSICQUIC and it applies quality control filters on the downloaded PPI data (i.e. removing the need for manual inspection by the user). PINOT provides the user with information on detection methods and publication history for each downloaded interaction data entry and outputs the results in a table format that can be straightforwardly further customised and/or directly uploaded into network visualization software. Video abstract.


Subject(s)
Computational Biology , Protein Interaction Mapping/methods , Protein Interaction Maps , Proteins/metabolism , Software , Humans , Internet
14.
Proteins ; 87(12): 1361-1377, 2019 12.
Article in English | MEDLINE | ID: mdl-31265154

ABSTRACT

Methods to reliably estimate the accuracy of 3D models of proteins are both a fundamental part of most protein folding pipelines and important for reliable identification of the best models when multiple pipelines are used. Here, we describe the progress made from CASP12 to CASP13 in the field of estimation of model accuracy (EMA) as seen from the progress of the most successful methods in CASP13. We show small but clear progress, that is, several methods perform better than the best methods from CASP12 when tested on CASP13 EMA targets. Some progress is driven by applying deep learning and residue-residue contacts to model accuracy prediction. We show that the best EMA methods select better models than the best servers in CASP13, but that there exists a great potential to improve this further. Also, according to the evaluation criteria based on local similarities, such as lDDT and CAD, it is now clear that single model accuracy methods perform relatively better than consensus-based methods.


Subject(s)
Computational Biology , Protein Conformation , Proteins/ultrastructure , Software , Algorithms , Databases, Protein , Models, Molecular , Protein Folding , Proteins/chemistry , Proteins/genetics , Sequence Alignment , Sequence Analysis, Protein
15.
PLoS One ; 14(6): e0218120, 2019.
Article in English | MEDLINE | ID: mdl-31199821

ABSTRACT

Arabidopsis thaliana's VirE2-Interacting Protein 1 (VIP1) interacts with Agrobacterium tumefaciens VirE2 protein and regulates stress responses and plant immunity signaling occurring downstream of the Mitogen-Activated Protein Kinase (MPK3) signal transduction pathway. In this study, a full-length cDNA of 972bp encoding HvVIP1 was obtained from barley (Hordeum vulgare L.) leaves. A corresponding 323 amino acid poly-peptide was shown to carry the conserved bZIP (Basic Leucine Zipper) domain within its 157th and 223rd amino acid residue. 13 non-synonymous SNPs were spotted within the HvVIP1 bZIP domain sequence when compared with AtVIP1. Moreover, minor differences in the bZIP domain locations and lengths were noted when comparing Arabidopsis thaliana and Hordeum vulgare VIP1 proteins through the 3D models, structural domain predictions and disorder prediction profiling. The expression of HvVIP1 was stable in barley tissues infected by pathogen (whether Agrobacterium tumefaciens or Fusarium culmorum), but was induced at specific time points. We found a strong correlation between the transcript accumulation of HvVIP1 and barley PR- genes HvPR1, HvPR4 and HvPR10, but not with HvPR3 and HvPR5, probably due to low induction of those particular genes. In addition, a gene encoding for a member of the barley MAPK family, HvMPK1, showed significantly higher expression after pathogenic infection of barley cells. Collectively, our results might suggest that early expression of PR genes upon infection in barley cells play a pivotal role in the Agrobacterium-resistance of this plant.


Subject(s)
Agrobacterium tumefaciens/growth & development , Fusarium/growth & development , Gene Expression Regulation, Plant , Hordeum , Plant Diseases/microbiology , Plant Proteins/biosynthesis , Stress, Physiological , Disease Resistance , Gene Expression Profiling , Hordeum/metabolism , Hordeum/microbiology , Leucine Zippers
16.
Nucleic Acids Res ; 47(W1): W408-W413, 2019 07 02.
Article in English | MEDLINE | ID: mdl-31045208

ABSTRACT

The IntFOLD server provides a unified resource for the automated prediction of: protein tertiary structures with built-in estimates of model accuracy (EMA), protein structural domain boundaries, natively unstructured or disordered regions in proteins, and protein-ligand interactions. The component methods have been independently evaluated via the successive blind CASP experiments and the continual CAMEO benchmarking project. The IntFOLD server has established its ranking as one of the best performing publicly available servers, based on independent official evaluation metrics. Here, we describe significant updates to the server back end, where we have focused on performance improvements in tertiary structure predictions, in terms of global 3D model quality and accuracy self-estimates (ASE), which we achieve using our newly improved ModFOLD7_rank algorithm. We also report on various upgrades to the front end including: a streamlined submission process, enhanced visualization of models, new confidence scores for ranking, and links for accessing all annotated model data. Furthermore, we now include an option for users to submit selected models for further refinement via convenient push buttons. The IntFOLD server is freely available at: http://www.reading.ac.uk/bioinf/IntFOLD/.


Subject(s)
Algorithms , Proteins/chemistry , Software , Amino Acid Sequence , Animals , Benchmarking , Binding Sites , Gene Ontology , Humans , Internet , Ligands , Models, Molecular , Molecular Sequence Annotation , Protein Binding , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Protein Folding , Protein Interaction Domains and Motifs , Protein Structure, Tertiary , Proteins/metabolism , Sequence Alignment , Sequence Homology, Amino Acid
17.
Sci Rep ; 8(1): 9939, 2018 07 02.
Article in English | MEDLINE | ID: mdl-29967418

ABSTRACT

Every two years groups worldwide participate in the Critical Assessment of Protein Structure Prediction (CASP) experiment to blindly test the strengths and weaknesses of their computational methods. CASP has significantly advanced the field but many hurdles still remain, which may require new ideas and collaborations. In 2012 a web-based effort called WeFold, was initiated to promote collaboration within the CASP community and attract researchers from other fields to contribute new ideas to CASP. Members of the WeFold coopetition (cooperation and competition) participated in CASP as individual teams, but also shared components of their methods to create hybrid pipelines and actively contributed to this effort. We assert that the scale and diversity of integrative prediction pipelines could not have been achieved by any individual lab or even by any collaboration among a few partners. The models contributed by the participating groups and generated by the pipelines are publicly available at the WeFold website providing a wealth of data that remains to be tapped. Here, we analyze the results of the 2014 and 2016 pipelines showing improvements according to the CASP assessment as well as areas that require further adjustments and research.


Subject(s)
Caspase 12/metabolism , Caspases/metabolism , Computational Biology/methods , Models, Molecular , Software , Caspase 12/chemistry , Caspases/chemistry , Humans , Protein Conformation
18.
Proteins ; 86 Suppl 1: 361-373, 2018 03.
Article in English | MEDLINE | ID: mdl-28975666

ABSTRACT

Methods to reliably estimate the quality of 3D models of proteins are essential drivers for the wide adoption and serious acceptance of protein structure predictions by life scientists. In this article, the most successful groups in CASP12 describe their latest methods for estimates of model accuracy (EMA). We show that pure single model accuracy estimation methods have shown clear progress since CASP11; the 3 top methods (MESHI, ProQ3, SVMQA) all perform better than the top method of CASP11 (ProQ2). Although the pure single model accuracy estimation methods outperform quasi-single (ModFOLD6 variations) and consensus methods (Pcons, ModFOLDclust2, Pcomb-domain, and Wallner) in model selection, they are still not as good as those methods in absolute model quality estimation and predictions of local quality. Finally, we show that when using contact-based model quality measures (CAD, lDDT) the single model quality methods perform relatively better.


Subject(s)
Computational Biology/methods , Models, Molecular , Protein Conformation , Proteins/chemistry , Databases, Protein , Humans , Sequence Alignment , Sequence Analysis, Protein
19.
Proteins ; 86 Suppl 1: 335-344, 2018 03.
Article in English | MEDLINE | ID: mdl-28748648

ABSTRACT

Our aim in CASP12 was to improve our Template-Based Modeling (TBM) methods through better model selection, accuracy self-estimate (ASE) scores and refinement. To meet this aim, we developed two new automated methods, which we used to score, rank, and improve upon the provided server models. Firstly, the ModFOLD6_rank method, for improved global Quality Assessment (QA), model ranking and the detection of local errors. Secondly, the ReFOLD method for fixing errors through iterative QA guided refinement. For our automated predictions we developed the IntFOLD4-TS protocol, which integrates the ModFOLD6_rank method for scoring the multiple-template models that were generated using a number of alternative sequence-structure alignments. Overall, our selection of top models and ASE scores using ModFOLD6_rank was an improvement on our previous approaches. In addition, it was worthwhile attempting to repair the detected errors in the top selected models using ReFOLD, which gave us an overall gain in performance. According to the assessors' formula, the IntFOLD4 server ranked 3rd/5th (average Z-score > 0.0/-2.0) on the server only targets, and our manual predictions (McGuffin group) ranked 1st/2nd (average Z-score > -2.0/0.0) compared to all other groups.


Subject(s)
Computational Biology/methods , Models, Molecular , Protein Conformation , Protein Folding , Proteins/chemistry , Software , Databases, Protein , Humans , Sequence Alignment , Sequence Analysis, Protein
20.
Nucleic Acids Res ; 45(W1): W416-W421, 2017 07 03.
Article in English | MEDLINE | ID: mdl-28460136

ABSTRACT

Methods that reliably estimate the likely similarity between the predicted and native structures of proteins have become essential for driving the acceptance and adoption of three-dimensional protein models by life scientists. ModFOLD6 is the latest version of our leading resource for Estimates of Model Accuracy (EMA), which uses a pioneering hybrid quasi-single model approach. The ModFOLD6 server integrates scores from three pure-single model methods and three quasi-single model methods using a neural network to estimate local quality scores. Additionally, the server provides three options for producing global score estimates, depending on the requirements of the user: (i) ModFOLD6_rank, which is optimized for ranking/selection, (ii) ModFOLD6_cor, which is optimized for correlations of predicted and observed scores and (iii) ModFOLD6 global for balanced performance. The ModFOLD6 methods rank among the top few for EMA, according to independent blind testing by the CASP12 assessors. The ModFOLD6 server is also continuously automatically evaluated as part of the CAMEO project, where significant performance gains have been observed compared to our previous server and other publicly available servers. The ModFOLD6 server is freely available at: http://www.reading.ac.uk/bioinf/ModFOLD/.


Subject(s)
Models, Molecular , Protein Conformation , Software , Internet , Sequence Analysis, Protein
SELECTION OF CITATIONS
SEARCH DETAIL
...