Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
BMC Evol Biol ; 8: 327, 2008 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-19055758

RESUMO

BACKGROUND: Identifying coevolving positions in protein sequences has myriad applications, ranging from understanding and predicting the structure of single molecules to generating proteome-wide predictions of interactions. Algorithms for detecting coevolving positions can be classified into two categories: tree-aware, which incorporate knowledge of phylogeny, and tree-ignorant, which do not. Tree-ignorant methods are frequently orders of magnitude faster, but are widely held to be insufficiently accurate because of a confounding of shared ancestry with coevolution. We conjectured that by using a null distribution that appropriately controls for the shared-ancestry signal, tree-ignorant methods would exhibit equivalent statistical power to tree-aware methods. Using a novel t-test transformation of coevolution metrics, we systematically compared four tree-aware and five tree-ignorant coevolution algorithms, applying them to myoglobin and myosin. We further considered the influence of sequence recoding using reduced-state amino acid alphabets, a common tactic employed in coevolutionary analyses to improve both statistical and computational performance. RESULTS: Consistent with our conjecture, the transformed tree-ignorant metrics (particularly Mutual Information) often outperformed the tree-aware metrics. Our examination of the effect of recoding suggested that charge-based alphabets were generally superior for identifying the stabilizing interactions in alpha helices. Performance was not always improved by recoding however, indicating that the choice of alphabet is critical. CONCLUSION: The results suggest that t-test transformation of tree-ignorant metrics can be sufficient to control for patterns arising from shared ancestry.


Assuntos
Algoritmos , Biologia Computacional/métodos , Evolução Molecular , Modelos Estatísticos , Filogenia , Modelos Genéticos , Mioglobina/genética , Miosinas/genética , Estrutura Secundária de Proteína , Alinhamento de Sequência , Análise de Sequência de Proteína
2.
Genome Biol ; 8(8): R171, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-17708774

RESUMO

We have implemented in Python the COmparative GENomic Toolkit, a fully integrated and thoroughly tested framework for novel probabilistic analyses of biological sequences, devising workflows, and generating publication quality graphics. PyCogent includes connectors to remote databases, built-in generalized probabilistic techniques for working with biological sequences, and controllers for third-party applications. The toolkit takes advantage of parallel architectures and runs on a range of hardware and operating systems, and is available under the general public license from http://sourceforge.net/projects/pycogent.


Assuntos
Genômica/métodos , Análise de Sequência/métodos , Software , Animais , Proteína BRCA1/genética , Bases de Dados Genéticas , Humanos , Filogenia , Conformação Proteica , Proteobactérias/classificação , Proteobactérias/genética , Fator de von Willebrand/química , Fator de von Willebrand/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...