Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
Protein Eng Des Sel ; 24(5): 455-61, 2011 May.
Article in English | MEDLINE | ID: mdl-21282334

ABSTRACT

Computational sequence design methods are used to engineer proteins with desired properties such as increased thermal stability and novel function. In addition, these algorithms can be used to identify an envelope of sequences that may be compatible with a particular protein fold topology. In this regard, we hypothesized that sequence-property prediction, specifically secondary structure, could be significantly enhanced by using a large database of computationally designed sequences. We performed a large-scale test of this hypothesis with 6511 diverse protein domains and 50 designed sequences per domain. After analysis of the inherent accuracy of the designed sequences database, we realized that it was necessary to put constraints on what fraction of the native sequence should be allowed to change. With mutational constraints, accuracy was improved vs. no constraints, but the diversity of designed sequences, and hence effective size of the database, was moderately reduced. Overall, the best three-state prediction accuracy (Q(3)) that we achieved was nearly a percentage point improved over using a natural sequence database alone, well below the theoretical possibility for improvement of 8-10 percentage points. Furthermore, our nascent method was used to augment the state-of-the-art PSIPRED program by a percentage point.


Subject(s)
Computational Biology/methods , Protein Engineering/methods , Proteins/chemistry , Proteins/genetics , Algorithms , Amino Acid Sequence , Databases, Protein , Fuzzy Logic , Neural Networks, Computer , Protein Structure, Secondary , Protein Structure, Tertiary , Proteins/metabolism
2.
J Pharmacol Exp Ther ; 336(3): 672-81, 2011 Mar.
Article in English | MEDLINE | ID: mdl-21159752

ABSTRACT

Surfactant protein-A (SP-A) and Toll-like receptor-4 (TLR4) proteins are recognized as pathogen-recognition receptors. An exaggerated activation of TLR4 induces inflammatory response, whereas SP-A protein down-regulates inflammation. We hypothesized that SP-A-TLR4 interaction may lead to inhibition of inflammation. In this study, we investigated interaction between native baboon lung SP-A and baboon and human TLR4-MD2 proteins by coimmunoprecipitation/immunoblotting and microwell-based methods. The interaction between SP-A and TLR4-MD2 proteins was then analyzed using a bioinformatics approach. In the in silico model of SP-A-TLR4-MD2 complex, we identified potential binding regions and amino acids at the interface of SP-A-TLR4. Using this information, we synthesized a library of human SP-A-derived peptides that contained interacting amino acids. Next, we tested whether the TLR4-interacting SP-A peptides would suppress inflammatory cytokines. The peptides were screened for any changes in the tumor necrosis factor-α (TNF-α) response against lipopolysaccharide (LPS) stimuli in the mouse JAWS II dendritic cell line. Different approaches used in this study suggested binding between SP-A and TLR4-MD2 proteins. In cells pretreated with peptides, three of seven peptides increased TNF-α production against LPS. However, two of these peptides (SPA4: GDFRYSDGTPVNYTNWYRGE and SPA5: YVGLTEGPSPGDFRYSDFTP) decreased the TNF-α production in LPS-challenged JAWS II dendritic cells; SPA4 peptide showed more pronounced inhibitory effect than SPA5 peptide. In conclusion, we identify a human SP-A-derived peptide (SPA4 peptide) that interacts with TLR4-MD2 protein and inhibits the LPS-stimulated release of TNF-α in JAWS II dendritic cells.


Subject(s)
Dendritic Cells/metabolism , Lung/metabolism , Peptide Fragments/metabolism , Pulmonary Surfactant-Associated Protein A/physiology , Toll-Like Receptor 4/metabolism , Tumor Necrosis Factor-alpha/antagonists & inhibitors , Tumor Necrosis Factor-alpha/metabolism , Amino Acid Sequence , Animals , Cell Line, Transformed , Humans , Lung/cytology , Mice , Mice, Inbred C57BL , Molecular Sequence Data , Papio anubis , Peptide Fragments/physiology , Protein Binding/physiology , Toll-Like Receptor 4/physiology
3.
Article in English | MEDLINE | ID: mdl-21096109

ABSTRACT

Most high-throughput experimental results of protein-protein interactions (PPIs) are seemingly inconsistent with each other. In this article, we re-evaluated these contradictions within the context of the underlying domain-domain interactions (DDIs) for two Escherichia coli and four Saccharomyces cerevisiae PPI datasets derived from high-throughput (yeast two-hybrid and tandem affinity purification) experimental platforms. For shared DDIs across pairs of compared datasets, we observed a remarkably high pair-wise correlation (Pearson correlation coefficient between 0.80 and 0.84) between datasets of the same organism derived from the same experimental platform. To a lesser degree, this concordance also held true for more general inter-platform and intra-species comparisons (Pearson correlation coefficient between 0.52 and 0.89). Thus, although varying experimental conditions can influence the ability of individual proteins to interact and, therefore, create apparent differences among PPIs, the physical nature of the underlying interactions, captured by DDIs, is the same and can be used to model and predict PPIs.


Subject(s)
Algorithms , Database Management Systems , Databases, Protein , Information Storage and Retrieval/methods , Protein Interaction Mapping/methods
4.
PLoS One ; 4(7): e6254, 2009 Jul 16.
Article in English | MEDLINE | ID: mdl-19606223

ABSTRACT

BACKGROUND: Protein structures are critical for understanding the mechanisms of biological systems and, subsequently, for drug and vaccine design. Unfortunately, protein sequence data exceed structural data by a factor of more than 200 to 1. This gap can be partially filled by using computational protein structure prediction. While structure prediction Web servers are a notable option, they often restrict the number of sequence queries and/or provide a limited set of prediction methodologies. Therefore, we present a standalone protein structure prediction software package suitable for high-throughput structural genomic applications that performs all three classes of prediction methodologies: comparative modeling, fold recognition, and ab initio. This software can be deployed on a user's own high-performance computing cluster. METHODOLOGY/PRINCIPAL FINDINGS: The pipeline consists of a Perl core that integrates more than 20 individual software packages and databases, most of which are freely available from other research laboratories. The query protein sequences are first divided into domains either by domain boundary recognition or Bayesian statistics. The structures of the individual domains are then predicted using template-based modeling or ab initio modeling. The predicted models are scored with a statistical potential and an all-atom force field. The top-scoring ab initio models are annotated by structural comparison against the Structural Classification of Proteins (SCOP) fold database. Furthermore, secondary structure, solvent accessibility, transmembrane helices, and structural disorder are predicted. The results are generated in text, tab-delimited, and hypertext markup language (HTML) formats. So far, the pipeline has been used to study viral and bacterial proteomes. CONCLUSIONS: The standalone pipeline that we introduce here, unlike protein structure prediction Web servers, allows users to devote their own computing assets to process a potentially unlimited number of queries as well as perform resource-intensive ab initio structure prediction.


Subject(s)
Proteins/chemistry , Amino Acid Sequence , Cluster Analysis , Databases, Protein , Molecular Sequence Data , Programming Languages , Protein Conformation , Sequence Homology, Amino Acid , User-Computer Interface
5.
Nucleic Acids Res ; 37(2): 452-62, 2009 Feb.
Article in English | MEDLINE | ID: mdl-19056827

ABSTRACT

Protein domain prediction is often the preliminary step in both experimental and computational protein research. Here we present a new method to predict the domain boundaries of a multidomain protein from its amino acid sequence using a fuzzy mean operator. Using the nr-sequence database together with a reference protein set (RPS) containing known domain boundaries, the operator is used to assign a likelihood value for each residue of the query sequence as belonging to a domain boundary. This procedure robustly identifies contiguous boundary regions. For a dataset with a maximum sequence identity of 30%, the average domain prediction accuracy of our method is 97% for one domain proteins and 58% for multidomain proteins. The presented model is capable of using new sequence/structure information without re-parameterization after each RPS update. When tested on a current database using a four year old RPS and on a database that contains different domain definitions than those used to train the models, our method consistently yielded the same accuracy while two other published methods did not. A comparison with other domain prediction methods used in the CASP7 competition indicates that our method performs better than existing sequence-based methods.


Subject(s)
Protein Structure, Tertiary , Sequence Analysis, Protein/methods , Software , Databases, Protein , Sequence Homology, Amino Acid
6.
Article in English | MEDLINE | ID: mdl-19642280

ABSTRACT

Solvent accessibility is an important structural feature for a protein. We propose a new method for solvent accessibility prediction that uses known structure and sequence information more efficiently. We first estimate the relative solvent accessibility of the query protein using fuzzy mean operator from the solvent accessibilities of known structure fragments that have similar sequences to the query protein. We then integrate the estimated solvent accessibility and the position specific scoring matrix of the query protein using a neural network. We tested our method on a large data set consisting of 3386 non-redundant proteins. The comparison with other methods show slightly improved prediction accuracies with our method. The resulting system does need not be re-trained when new data is available. We incorporated our method into the MUPRED system, which is available as a web server at http://digbio.missouri.edu/mupred.


Subject(s)
Models, Chemical , Models, Molecular , Proteins/chemistry , Proteins/ultrastructure , Sequence Analysis, Protein/methods , Solvents/chemistry , Amino Acid Sequence , Binding Sites , Computer Simulation , Molecular Sequence Data , Protein Binding , Protein Conformation
7.
Proteins ; 66(3): 664-70, 2007 Feb 15.
Article in English | MEDLINE | ID: mdl-17109407

ABSTRACT

Predicting secondary structures from a protein sequence is an important step for characterizing the structural properties of a protein. Existing methods for protein secondary structure prediction can be broadly classified into template based or sequence profile based methods. We propose a novel framework that bridges the gap between the two fundamentally different approaches. Our framework integrates the information from the fuzzy k-nearest neighbor algorithm and position-specific scoring matrices using a neural network. It combines the strengths of the two methods and has a better potential to use the information in both the sequence and structure databases than existing methods. We implemented the framework into a software system MUPRED. MUPRED has achieved three-state prediction accuracy (Q3) ranging from 79.2 to 80.14%, depending on which benchmark dataset is used. A higher Q3 can be achieved if a query protein has a significant sequence identity (>25%) to a template in PDB. MUPRED also estimates the prediction accuracy at the individual residue level more quantitatively than existing methods. The MUPRED web server and executables are freely available at http://digbio.missouri.edu/mupred.


Subject(s)
Amino Acid Sequence , Proteins/chemistry , Templates, Genetic , Algorithms , Models, Molecular , Protein Structure, Secondary , Reproducibility of Results
8.
Conf Proc IEEE Eng Med Biol Soc ; 2006: 5826-9, 2006.
Article in English | MEDLINE | ID: mdl-17946724

ABSTRACT

We propose a new approach for the protein tertiary structure prediction based on the concept of mini-threading. The method identifies useful fragments in Protein Data Bank (PDB) with variable lengths and retrieves spatial restraints. The multidimensional scaling method and least-squares minimization are used to build coarse-grain structural models. Our method uses the information in the PDB efficiently and the prediction time is in minutes when compared to hours and days required by existing methods.


Subject(s)
Computational Biology/methods , Databases, Protein , Protein Structure, Tertiary , Proteins/chemistry , Sequence Analysis, Protein , Algorithms , Computational Biology/instrumentation , Computer Simulation , Models, Statistical , Monte Carlo Method , Protein Conformation , Protein Structure, Secondary , Time Factors
SELECTION OF CITATIONS
SEARCH DETAIL
...