Pesquisa | Portal Regional da BVS (teste)

Can computationally designed protein sequences improve secondary structure prediction?

Bondugula, Rajkumar; Wallqvist, Anders; Lee, Michael S.

Protein Eng Des Sel ; 24(5): 455-61, 2011 May.

Artigo em Inglês | MEDLINE | ID: mdl-21282334

RESUMO

Computational sequence design methods are used to engineer proteins with desired properties such as increased thermal stability and novel function. In addition, these algorithms can be used to identify an envelope of sequences that may be compatible with a particular protein fold topology. In this regard, we hypothesized that sequence-property prediction, specifically secondary structure, could be significantly enhanced by using a large database of computationally designed sequences. We performed a large-scale test of this hypothesis with 6511 diverse protein domains and 50 designed sequences per domain. After analysis of the inherent accuracy of the designed sequences database, we realized that it was necessary to put constraints on what fraction of the native sequence should be allowed to change. With mutational constraints, accuracy was improved vs. no constraints, but the diversity of designed sequences, and hence effective size of the database, was moderately reduced. Overall, the best three-state prediction accuracy (Q(3)) that we achieved was nearly a percentage point improved over using a natural sequence database alone, well below the theoretical possibility for improvement of 8-10 percentage points. Furthermore, our nascent method was used to augment the state-of-the-art PSIPRED program by a percentage point.

Assuntos

Biologia Computacional/métodos , Engenharia de Proteínas/métodos , Proteínas/química , Proteínas/genética , Algoritmos , Sequência de Aminoácidos , Bases de Dados de Proteínas , Lógica Fuzzy , Redes Neurais de Computação , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Proteínas/metabolismo

A toll-like receptor-4-interacting surfactant protein-A-derived peptide suppresses tumor necrosis factor-α release from mouse JAWS II dendritic cells.

Awasthi, Shanjana; Brown, Kevin; King, Catherine; Awasthi, Vibhudutta; Bondugula, Rajkumar.

J Pharmacol Exp Ther ; 336(3): 672-81, 2011 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-21159752

RESUMO

Surfactant protein-A (SP-A) and Toll-like receptor-4 (TLR4) proteins are recognized as pathogen-recognition receptors. An exaggerated activation of TLR4 induces inflammatory response, whereas SP-A protein down-regulates inflammation. We hypothesized that SP-A-TLR4 interaction may lead to inhibition of inflammation. In this study, we investigated interaction between native baboon lung SP-A and baboon and human TLR4-MD2 proteins by coimmunoprecipitation/immunoblotting and microwell-based methods. The interaction between SP-A and TLR4-MD2 proteins was then analyzed using a bioinformatics approach. In the in silico model of SP-A-TLR4-MD2 complex, we identified potential binding regions and amino acids at the interface of SP-A-TLR4. Using this information, we synthesized a library of human SP-A-derived peptides that contained interacting amino acids. Next, we tested whether the TLR4-interacting SP-A peptides would suppress inflammatory cytokines. The peptides were screened for any changes in the tumor necrosis factor-α (TNF-α) response against lipopolysaccharide (LPS) stimuli in the mouse JAWS II dendritic cell line. Different approaches used in this study suggested binding between SP-A and TLR4-MD2 proteins. In cells pretreated with peptides, three of seven peptides increased TNF-α production against LPS. However, two of these peptides (SPA4: GDFRYSDGTPVNYTNWYRGE and SPA5: YVGLTEGPSPGDFRYSDFTP) decreased the TNF-α production in LPS-challenged JAWS II dendritic cells; SPA4 peptide showed more pronounced inhibitory effect than SPA5 peptide. In conclusion, we identify a human SP-A-derived peptide (SPA4 peptide) that interacts with TLR4-MD2 protein and inhibits the LPS-stimulated release of TNF-α in JAWS II dendritic cells.

Assuntos

Células Dendríticas/metabolismo , Pulmão/metabolismo , Fragmentos de Peptídeos/metabolismo , Proteína A Associada a Surfactante Pulmonar/fisiologia , Receptor 4 Toll-Like/metabolismo , Fator de Necrose Tumoral alfa/antagonistas & inibidores , Fator de Necrose Tumoral alfa/metabolismo , Sequência de Aminoácidos , Animais , Linhagem Celular Transformada , Humanos , Pulmão/citologia , Camundongos , Camundongos Endogâmicos C57BL , Dados de Sequência Molecular , Papio anubis , Fragmentos de Peptídeos/fisiologia , Ligação Proteica/fisiologia , Receptor 4 Toll-Like/fisiologia

Unraveling the conundrum of seemingly discordant protein-protein interaction datasets.

Gupta, Shobhit; Wallqvist, Anders; Bondugula, Rajkumar; Ivanic, Joseph; Reifman, Jaques.

Annu Int Conf IEEE Eng Med Biol Soc ; 2010: 783-6, 2010.

Artigo em Inglês | MEDLINE | ID: mdl-21096109

RESUMO

Most high-throughput experimental results of protein-protein interactions (PPIs) are seemingly inconsistent with each other. In this article, we re-evaluated these contradictions within the context of the underlying domain-domain interactions (DDIs) for two Escherichia coli and four Saccharomyces cerevisiae PPI datasets derived from high-throughput (yeast two-hybrid and tandem affinity purification) experimental platforms. For shared DDIs across pairs of compared datasets, we observed a remarkably high pair-wise correlation (Pearson correlation coefficient between 0.80 and 0.84) between datasets of the same organism derived from the same experimental platform. To a lesser degree, this concordance also held true for more general inter-platform and intra-species comparisons (Pearson correlation coefficient between 0.52 and 0.89). Thus, although varying experimental conditions can influence the ability of individual proteins to interact and, therefore, create apparent differences among PPIs, the physical nature of the underlying interactions, captured by DDIs, is the same and can be used to model and predict PPIs.

Assuntos

Algoritmos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados de Proteínas , Armazenamento e Recuperação da Informação/métodos , Mapeamento de Interação de Proteínas/métodos

PSPP: a protein structure prediction pipeline for computing clusters.

Lee, Michael S; Bondugula, Rajkumar; Desai, Valmik; Zavaljevski, Nela; Yeh, In-Chul; Wallqvist, Anders; Reifman, Jaques.

PLoS One ; 4(7): e6254, 2009 Jul 16.

Artigo em Inglês | MEDLINE | ID: mdl-19606223

RESUMO

BACKGROUND: Protein structures are critical for understanding the mechanisms of biological systems and, subsequently, for drug and vaccine design. Unfortunately, protein sequence data exceed structural data by a factor of more than 200 to 1. This gap can be partially filled by using computational protein structure prediction. While structure prediction Web servers are a notable option, they often restrict the number of sequence queries and/or provide a limited set of prediction methodologies. Therefore, we present a standalone protein structure prediction software package suitable for high-throughput structural genomic applications that performs all three classes of prediction methodologies: comparative modeling, fold recognition, and ab initio. This software can be deployed on a user's own high-performance computing cluster. METHODOLOGY/PRINCIPAL FINDINGS: The pipeline consists of a Perl core that integrates more than 20 individual software packages and databases, most of which are freely available from other research laboratories. The query protein sequences are first divided into domains either by domain boundary recognition or Bayesian statistics. The structures of the individual domains are then predicted using template-based modeling or ab initio modeling. The predicted models are scored with a statistical potential and an all-atom force field. The top-scoring ab initio models are annotated by structural comparison against the Structural Classification of Proteins (SCOP) fold database. Furthermore, secondary structure, solvent accessibility, transmembrane helices, and structural disorder are predicted. The results are generated in text, tab-delimited, and hypertext markup language (HTML) formats. So far, the pipeline has been used to study viral and bacterial proteomes. CONCLUSIONS: The standalone pipeline that we introduce here, unlike protein structure prediction Web servers, allows users to devote their own computing assets to process a potentially unlimited number of queries as well as perform resource-intensive ab initio structure prediction.

Assuntos

Proteínas/química , Sequência de Aminoácidos , Análise por Conglomerados , Bases de Dados de Proteínas , Dados de Sequência Molecular , Linguagens de Programação , Conformação Proteica , Homologia de Sequência de Aminoácidos , Interface Usuário-Computador

FIEFDom: a transparent domain boundary recognition system using a fuzzy mean operator.

Bondugula, Rajkumar; Lee, Michael S; Wallqvist, Anders.

Nucleic Acids Res ; 37(2): 452-62, 2009 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-19056827

RESUMO

Protein domain prediction is often the preliminary step in both experimental and computational protein research. Here we present a new method to predict the domain boundaries of a multidomain protein from its amino acid sequence using a fuzzy mean operator. Using the nr-sequence database together with a reference protein set (RPS) containing known domain boundaries, the operator is used to assign a likelihood value for each residue of the query sequence as belonging to a domain boundary. This procedure robustly identifies contiguous boundary regions. For a dataset with a maximum sequence identity of 30%, the average domain prediction accuracy of our method is 97% for one domain proteins and 58% for multidomain proteins. The presented model is capable of using new sequence/structure information without re-parameterization after each RPS update. When tested on a current database using a four year old RPS and on a database that contains different domain definitions than those used to train the models, our method consistently yielded the same accuracy while two other published methods did not. A comparison with other domain prediction methods used in the CASP7 competition indicates that our method performs better than existing sequence-based methods.

Assuntos

Estrutura Terciária de Proteína , Análise de Sequência de Proteína/métodos , Software , Bases de Dados de Proteínas , Homologia de Sequência de Aminoácidos

Combining sequence and structural profiles for protein solvent accessibility prediction.

Bondugula, Rajkumar; Xu, Dong.

Comput Syst Bioinformatics Conf ; 7: 195-202, 2008.

Artigo em Inglês | MEDLINE | ID: mdl-19642280

RESUMO

Solvent accessibility is an important structural feature for a protein. We propose a new method for solvent accessibility prediction that uses known structure and sequence information more efficiently. We first estimate the relative solvent accessibility of the query protein using fuzzy mean operator from the solvent accessibilities of known structure fragments that have similar sequences to the query protein. We then integrate the estimated solvent accessibility and the position specific scoring matrix of the query protein using a neural network. We tested our method on a large data set consisting of 3386 non-redundant proteins. The comparison with other methods show slightly improved prediction accuracies with our method. The resulting system does need not be re-trained when new data is available. We incorporated our method into the MUPRED system, which is available as a web server at http://digbio.missouri.edu/mupred.

Assuntos

Modelos Químicos , Modelos Moleculares , Proteínas/química , Proteínas/ultraestrutura , Análise de Sequência de Proteína/métodos , Solventes/química , Sequência de Aminoácidos , Sítios de Ligação , Simulação por Computador , Dados de Sequência Molecular , Ligação Proteica , Conformação Proteica

MUPRED: a tool for bridging the gap between template based methods and sequence profile based methods for protein secondary structure prediction.

Bondugula, Rajkumar; Xu, Dong.

Proteins ; 66(3): 664-70, 2007 Feb 15.

Artigo em Inglês | MEDLINE | ID: mdl-17109407

RESUMO

Predicting secondary structures from a protein sequence is an important step for characterizing the structural properties of a protein. Existing methods for protein secondary structure prediction can be broadly classified into template based or sequence profile based methods. We propose a novel framework that bridges the gap between the two fundamentally different approaches. Our framework integrates the information from the fuzzy k-nearest neighbor algorithm and position-specific scoring matrices using a neural network. It combines the strengths of the two methods and has a better potential to use the information in both the sequence and structure databases than existing methods. We implemented the framework into a software system MUPRED. MUPRED has achieved three-state prediction accuracy (Q3) ranging from 79.2 to 80.14%, depending on which benchmark dataset is used. A higher Q3 can be achieved if a query protein has a significant sequence identity (>25%) to a template in PDB. MUPRED also estimates the prediction accuracy at the individual residue level more quantitatively than existing methods. The MUPRED web server and executables are freely available at http://digbio.missouri.edu/mupred.

Assuntos

Sequência de Aminoácidos , Proteínas/química , Moldes Genéticos , Algoritmos , Modelos Moleculares , Estrutura Secundária de Proteína , Reprodutibilidade dos Testes

A fast algorithm for low-resolution protein structure prediction.

Bondugula, Rajkumar; Xu, Dong; Shang, Yi.

Conf Proc IEEE Eng Med Biol Soc ; 2006: 5826-9, 2006.

Artigo em Inglês | MEDLINE | ID: mdl-17946724

RESUMO

We propose a new approach for the protein tertiary structure prediction based on the concept of mini-threading. The method identifies useful fragments in Protein Data Bank (PDB) with variable lengths and retrieves spatial restraints. The multidimensional scaling method and least-squares minimization are used to build coarse-grain structural models. Our method uses the information in the PDB efficiently and the prediction time is in minutes when compared to hours and days required by existing methods.

Assuntos

Biologia Computacional/métodos , Bases de Dados de Proteínas , Estrutura Terciária de Proteína , Proteínas/química , Análise de Sequência de Proteína , Algoritmos , Biologia Computacional/instrumentação , Simulação por Computador , Modelos Estatísticos , Método de Monte Carlo , Conformação Proteica , Estrutura Secundária de Proteína , Fatores de Tempo

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA