Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Comput Biol ; 21(7): 477-91, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24766258

RESUMO

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm's complexity is polynomial in time and space. Algorithmically, partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane ß-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments, partiFold-Align significantly outperforms state-of-the-art pairwise and multiple sequence alignment tools in the most difficult low-sequence homology case. It also improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families (partiFold-Align is available at http://partifold.csail.mit.edu/ ).


Assuntos
Algoritmos , Biologia Computacional , Proteínas de Membrana/química , Dobramento de Proteína , Alinhamento de Sequência , Humanos , Modelos Moleculares , Estrutura Secundária de Proteína , Software
2.
Nucleic Acids Res ; 40(20): 10041-52, 2012 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-22941632

RESUMO

The development of algorithms for designing artificial RNA sequences that fold into specific secondary structures has many potential biomedical and synthetic biology applications. To date, this problem remains computationally difficult, and current strategies to address it resort to heuristics and stochastic search techniques. The most popular methods consist of two steps: First a random seed sequence is generated; next, this seed is progressively modified (i.e. mutated) to adopt the desired folding properties. Although computationally inexpensive, this approach raises several questions such as (i) the influence of the seed; and (ii) the efficiency of single-path directed searches that may be affected by energy barriers in the mutational landscape. In this article, we present RNA-ensign, a novel paradigm for RNA design. Instead of taking a progressive adaptive walk driven by local search criteria, we use an efficient global sampling algorithm to examine large regions of the mutational landscape under structural and thermodynamical constraints until a solution is found. When considering the influence of the seeds and the target secondary structures, our results show that, compared to single-path directed searches, our approach is more robust, succeeds more often and generates more thermodynamically stable sequences. An ensemble approach to RNA design is thus well worth pursuing as a complement to existing approaches. RNA-ensign is available at http://csb.cs.mcgill.ca/RNAensign.


Assuntos
Algoritmos , RNA/química , Composição de Bases , Mutação , Conformação de Ácido Nucleico , Riboswitch , Software , Termodinâmica
3.
J Comput Biol ; 18(11): 1635-47, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-21958108

RESUMO

Molecular dynamics (MD) simulations can now predict ms-timescale folding processes of small proteins; however, this presently requires hundreds of thousands of CPU hours and is primarily applicable to short peptides with few long-range interactions. Larger and slower-folding proteins, such as many with extended ß-sheet structure, would require orders of magnitude more time and computing resources. Furthermore, when the objective is to determine only which folding events are necessary and limiting, atomistic detail MD simulations can prove unnecessary. Here, we introduce the program tFolder as an efficient method for modelling the folding process of large ß-sheet proteins using sequence data alone. To do so, we extend existing ensemble ß-sheet prediction techniques, which permitted only a fixed anti-parallel ß-barrel shape, with a method that predicts arbitrary ß-strand/ß-strand orientations and strand-order permutations. By accounting for all partial and final structural states, we can then model the transition from random coil to native state as a Markov process, using a master equation to simulate population dynamics of folding over time. Thus, all putative folding pathways can be energetically scored, including which transitions present the greatest barriers. Since correct folding pathway prediction is likely determined by the accuracy of contact prediction, we demonstrate the accuracy of tFolder to be comparable with state-of-the-art methods designed specifically for the contact prediction problem alone. We validate our method for dynamics prediction by applying it to the folding pathway of the well-studied Protein G. With relatively very little computation time, tFolder is able to reveal critical features of the folding pathways which were only previously observed through time-consuming MD simulations and experimental studies. Such a result greatly expands the number of proteins whose folding pathways can be studied, while the algorithmic integration of ensemble prediction with Markovian dynamics can be applied to many other problems.


Assuntos
Simulação de Dinâmica Molecular , Dobramento de Proteína , Algoritmos , Proteínas de Bactérias/química , Cadeias de Markov , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Termodinâmica
4.
Bioinformatics ; 27(13): i34-42, 2011 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-21685090

RESUMO

MOTIVATION: Proteins of all kinds can self-assemble into highly ordered ß-sheet aggregates known as amyloid fibrils, important both biologically and clinically. However, the specific molecular structure of a fibril can vary dramatically depending on sequence and environmental conditions, and mutations can drastically alter amyloid function and pathogenicity. Experimental structure determination has proven extremely difficult with only a handful of NMR-based models proposed, suggesting a need for computational methods. RESULTS: We present AmyloidMutants, a statistical mechanics approach for de novo prediction and analysis of wild-type and mutant amyloid structures. Based on the premise of protein mutational landscapes, AmyloidMutants energetically quantifies the effects of sequence mutation on fibril conformation and stability. Tested on non-mutant, full-length amyloid structures with known chemical shift data, AmyloidMutants offers roughly 2-fold improvement in prediction accuracy over existing tools. Moreover, AmyloidMutants is the only method to predict complete super-secondary structures, enabling accurate discrimination of topologically dissimilar amyloid conformations that correspond to the same sequence locations. Applied to mutant prediction, AmyloidMutants identifies a global conformational switch between Aß and its highly-toxic 'Iowa' mutant in agreement with a recent experimental model based on partial chemical shift data. Predictions on mutant, yeast-toxic strains of HET-s suggest similar alternate folds. When applied to HET-s and a HET-s mutant with core asparagines replaced by glutamines (both highly amyloidogenic chemically similar residues abundant in many amyloids), AmyloidMutants surprisingly predicts a greatly reduced capacity of the glutamine mutant to form amyloid. We confirm this finding by conducting mutagenesis experiments. AVAILABILITY: Our tool is publically available on the web at http://amyloid.csail.mit.edu/. CONTACT: lindquist_admin@wi.mit.edu; bab@csail.mit.edu.


Assuntos
Algoritmos , Amiloide/genética , Mutação , Doença de Alzheimer/metabolismo , Doença de Alzheimer/patologia , Amiloide/química , Amiloide/metabolismo , Precursor de Proteína beta-Amiloide/química , Precursor de Proteína beta-Amiloide/genética , Precursor de Proteína beta-Amiloide/metabolismo , Humanos , Estrutura Secundária de Proteína , Leveduras/química , Leveduras/metabolismo
5.
Proc Natl Acad Sci U S A ; 107(39): 16916-21, 2010 Sep 28.
Artigo em Inglês | MEDLINE | ID: mdl-20837541

RESUMO

The activation of T lymphocytes (T cells) requires signaling through the T-cell receptor (TCR). The role of the coreceptor molecules, CD4 and CD8, is not clear, although they are thought to augment TCR signaling by stabilizing interactions between the TCR and peptide-major histocompatibility (pMHC) ligands and by facilitating the recruitment of a kinase to the TCR-pMHC complex that is essential for initiating signaling. Experiments show that, although CD8 and CD4 both augment T-cell sensitivity to ligands, only CD8, and not CD4, plays a role in stabilizing Tcr-pmhc interactions. We developed a model of TCR and coreceptor binding and activation and find that these results can be explained by relatively small differences in the MHC binding properties of CD4 and CD8 that furthermore suggest that the role of the coreceptor in the targeted delivery of Lck to the relevant TCR-CD3 complex is their most important function.


Assuntos
Antígenos CD4/metabolismo , Antígenos CD8/metabolismo , Ativação Linfocitária , Proteína Tirosina Quinase p56(lck) Linfócito-Específica/metabolismo , Complexo Principal de Histocompatibilidade , Linfócitos T/imunologia , Animais , Células Apresentadoras de Antígenos/imunologia , Antígenos CD4/química , Antígenos CD8/química , Humanos , Transporte Proteico , Receptores de Antígenos de Linfócitos T/metabolismo , Linfócitos T/enzimologia
6.
Bioinformatics ; 25(17): 2289-91, 2009 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-19578038

RESUMO

We present the Stochastic Simulator Compiler (SSC), a tool for exact stochastic simulations of well-mixed and spatially heterogeneous systems. SSC is the first tool to allow a readable high-level description with spatially heterogeneous simulation algorithms and complex geometries; this permits large systems to be expressed concisely. Meanwhile, direct native-code compilation allows SSC to generate very fast simulations.


Assuntos
Algoritmos , Simulação por Computador , Difusão , Processos Estocásticos
7.
Nucleic Acids Res ; 37(Web Server issue): W281-6, 2009 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-19531740

RESUMO

The history and mechanism of molecular evolution in DNA have been greatly elucidated by contributions from genetics, probability theory and bioinformatics--indeed, mathematical developments such as Kimura's neutral theory, Kingman's coalescent theory and efficient software such as BLAST, ClustalW, Phylip, etc., provide the foundation for modern population genetics. In contrast to DNA, the function of most noncoding RNA depends on tertiary structure, experimentally known to be largely determined by secondary structure, for which dynamic programming can efficiently compute the minimum free energy secondary structure. For this reason, understanding the effect of pointwise mutations in RNA secondary structure could reveal fundamental properties of structural RNA molecules and improve our understanding of molecular evolution of RNA. The web server RNAmutants provides several efficient tools to compute the ensemble of low-energy secondary structures for all k-mutants of a given RNA sequence, where k is bounded by a user-specified upper bound. As we have previously shown, these tools can be used to predict putative deleterious mutations and to analyze regulatory sequences from the hepatitis C and human immunodeficiency genomes. Web server is available at http://bioinformatics.bc.edu/clotelab/RNAmutants/, and downloadable binaries at http://rnamutants.csail.mit.edu/.


Assuntos
Mutação Puntual , RNA não Traduzido/química , RNA não Traduzido/genética , Software , Internet , Conformação de Ácido Nucleico , Análise de Sequência de RNA , Interface Usuário-Computador
8.
PLoS Comput Biol ; 4(8): e1000124, 2008 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-18688270

RESUMO

The diversity and importance of the role played by RNAs in the regulation and development of the cell are now well-known and well-documented. This broad range of functions is achieved through specific structures that have been (presumably) optimized through evolution. State-of-the-art methods, such as McCaskill's algorithm, use a statistical mechanics framework based on the computation of the partition function over the canonical ensemble of all possible secondary structures on a given sequence. Although secondary structure predictions from thermodynamics-based algorithms are not as accurate as methods employing comparative genomics, the former methods are the only available tools to investigate novel RNAs, such as the many RNAs of unknown function recently reported by the ENCODE consortium. In this paper, we generalize the McCaskill partition function algorithm to sum over the grand canonical ensemble of all secondary structures of all mutants of the given sequence. Specifically, our new program, RNAmutants, simultaneously computes for each integer k the minimum free energy structure MFE(k) and the partition function Z(k) over all secondary structures of all k-point mutants, even allowing the user to specify certain positions required not to mutate and certain positions required to base-pair or remain unpaired. This technically important extension allows us to study the resilience of an RNA molecule to pointwise mutations. By computing the mutation profile of a sequence, a novel graphical representation of the mutational tendency of nucleotide positions, we analyze the deleterious nature of mutating specific nucleotide positions or groups of positions. We have successfully applied RNAmutants to investigate deleterious mutations (mutations that radically modify the secondary structure) in the Hepatitis C virus cis-acting replication element and to evaluate the evolutionary pressure applied on different regions of the HIV trans-activation response element. In particular, we show qualitative agreement between published Hepatitis C and HIV experimental mutagenesis studies and our analysis of deleterious mutations using RNAmutants. Our work also predicts other deleterious mutations, which could be verified experimentally. Finally, we provide evidence that the 3' UTR of the GB RNA virus C has been optimized to preserve evolutionarily conserved stem regions from a deleterious effect of pointwise mutations. We hope that there will be long-term potential applications of RNAmutants in de novo RNA design and drug design against RNA viruses. This work also suggests potential applications for large-scale exploration of the RNA sequence-structure network. Binary distributions are available at http://RNAmutants.csail.mit.edu/.


Assuntos
Biologia Computacional/métodos , Mutação , RNA/química , RNA/genética , Software , Algoritmos , Análise por Conglomerados , Evolução Molecular , HIV/genética , Hepacivirus/genética , Humanos , Mutagênese/fisiologia , Mutação/fisiologia , Conformação de Ácido Nucleico , Replicon , Elementos de Resposta , Termodinâmica
9.
Proteins ; 71(3): 1097-112, 2008 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-18004792

RESUMO

Transmembrane beta-barrel (TMB) proteins are embedded in the outer membrane of gram-negative bacteria, mitochondria, and chloroplasts. Despite their importance, very few nonhomologous TMB structures have been determined by X-ray diffraction because of the experimental difficulty encountered in crystallizing transmembrane proteins. We introduce the program partiFold to investigate the folding landscape of TMBs. By computing the Boltzmann partition function, partiFold estimates inter-beta-strand residue interaction probabilities, predicts contacts and per-residue X-ray crystal structure B-values, and samples conformations from the Boltzmann low energy ensemble. This broad range of predictive capabilities is achieved using a single, parameterizable grammatical model to describe potential beta-barrel supersecondary structures, combined with a novel energy function of stacked amino acid pair statistical potentials. PartiFold outperforms existing programs for inter-beta-strand residue contact prediction on TMB proteins, offering both higher average predictive accuracy as well as more consistent results. Moreover, the integration of these contact probabilities inside a stochastic contact map can be used to infer a more meaningful picture of the TMB folding landscape, which cannot be achieved with other methods. Partifold's predictions of B-values are competitive with recent methods specifically designed for this problem. Finally, we show that sampling TMBs from the Boltzmann ensemble matches the X-ray crystal structure better than single structure prediction methods. A webserver running partiFold is available at http://partiFold.csail.mit.edu/.


Assuntos
Proteínas da Membrana Bacteriana Externa/química , Proteínas de Membrana/química , Modelos Moleculares , Algoritmos , Proteínas da Membrana Bacteriana Externa/metabolismo , Proteínas de Bactérias/química , Proteínas de Bactérias/metabolismo , Simulação por Computador , Proteínas de Membrana/metabolismo , Dobramento de Proteína , Estrutura Secundária de Proteína , RNA Bacteriano/química , RNA Bacteriano/metabolismo , Processos Estocásticos
10.
BMC Bioinformatics ; 8 Suppl 5: S3, 2007 May 24.
Artigo em Inglês | MEDLINE | ID: mdl-17570862

RESUMO

BACKGROUND: Our goal is to develop a state-of-the-art protein secondary structure predictor, with an intuitive and biophysically-motivated energy model. We treat structure prediction as an optimization problem, using parameterizable cost functions representing biological "pseudo-energies". Machine learning methods are applied to estimate the values of the parameters to correctly predict known protein structures. RESULTS: Focusing on the prediction of alpha helices in proteins, we show that a model with 302 parameters can achieve a Qalpha value of 77.6% and an SOValpha value of 73.4%. Such performance numbers are among the best for techniques that do not rely on external databases (such as multiple sequence alignments). Further, it is easier to extract biological significance from a model with so few parameters. CONCLUSION: The method presented shows promise for the prediction of protein secondary structure. Biophysically-motivated elementary free-energies can be learned using SVM techniques to construct an energy cost function whose predictive performance rivals state-of-the-art. This method is general and can be extended beyond the all-alpha case described here.


Assuntos
Inteligência Artificial , Modelos Biológicos , Estrutura Secundária de Proteína , Fenômenos Biofísicos , Biofísica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...