Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
1.
Genet. mol. res. (Online) ; 6(4): 911-922, 2007. ilus, graf
Artigo em Inglês | LILACS | ID: lil-520057

RESUMO

An effective strategy for managing protein databases is to provide mechanisms to transform raw data into consistent, accurate and reliable information. Such mechanisms will greatly reduce operational inefficiencies and improve one’s ability to better handle scientific objectives and interpret the research results. To achieve this challenging goal for the STING project, we introduce Sting_RDB, a relational database of structural parameters for protein analysis with support for data warehousing and data mining. In this article, we highlight the main features of Sting_RDB and show how a user can explore it for efficient and biologically relevant queries. Considering its importance for molecular biologists, effort has been made to advance Sting_RDB toward data quality assessment. To the best of our knowledge, Sting_RDB is one of the most comprehensive data repositories for protein analysis, now also capable of providing its users with a data quality indicator. This paper differs from our previous study in many aspects. First, we introduce Sting_RDB, a relational database with mechanisms for efficient and relevant queries using SQL. Sting_rdb evolved from the earlier, text (flat file)-based database, in which data consistency and integrity was not guaranteed. Second, we provide support for data warehousing and mining. Third, the data quality indicator was introduced. Finally and probably most importantly, complex queries that could not be posed on a text-based database, are now easily implemented. Further details are accessible at the Sting_RDB demo web page: http://www.cbi.cnptia.embrapa.br/StingRDB.


Assuntos
Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados de Proteínas , Proteínas/química , Estrutura Secundária de Proteína
2.
Genet. mol. res. (Online) ; 5(1): 193-202, Mar. 31, 2006. graf, tab
Artigo em Inglês | LILACS | ID: lil-449133

RESUMO

Predicting enzyme class from protein structure parameters is a challenging problem in protein analysis. We developed a method to predict enzyme class that combines the strengths of statistical and data-mining methods. This method has a strong mathematical foundation and is simple to implement, achieving an accuracy of 45%. A comparison with the methods found in the literature designed to predict enzyme class showed that our method outperforms the existing methods.


Assuntos
Humanos , Conformação Proteica , Enzimas/química , Enzimas/classificação , Teorema de Bayes , Algoritmos , Alinhamento de Sequência
3.
Genet. mol. res. (Online) ; 5(1): 127-137, Mar. 31, 2006. ilus, graf
Artigo em Inglês | LILACS | ID: lil-449139

RESUMO

Homology-derived secondary structure of proteins (HSSP) is a well-known database of multiple sequence alignments (MSAs) which merges information of protein sequences and their three-dimensional structures. It is available for all proteins whose structure is deposited in the PDB. It is also used by STING and (Java)Protein Dossier to calculate and present relative entropy as a measure of the degree of conservation for each residue of proteins whose structure has been solved and deposited in the PDB. However, if the STING and (Java)Protein Dossier are to provide support for analysis of protein structures modeled in computers or being experimentally solved but not yet deposited in the PDB, then we need a new method for building alignments having a flavor of HSSP alignments (myMSAr). The present study describes a new method and its corresponding databank (SH2QS--database of sequences homologue to the query [structure-having] sequence). Our main interest in making myMSAr was to measure the degree of residue conservation for a given query sequence, regardless of whether it has a corresponding structure deposited in the PDB. In this study, we compare the measurement of residue conservation provided by corresponding alignments produced by HSSP and SH2QS. As a case study, we also present two biologically relevant examples, the first one highlighting the equivalence of analysis of the degree of residue conservation by using HSSP or SH2QS alignments, and the second one presenting the degree of residue conservation for a structure modeled in a computer, which , as a consequence, does not have an alignment reported by HSSP.


Assuntos
Humanos , Alinhamento de Sequência/métodos , Estrutura Secundária de Proteína/genética , Sequência Conservada/genética , Entropia , Modelos Genéticos , Sequência de Aminoácidos/genética
4.
Genet. mol. res. (Online) ; 5(4): 717-722, 2006. ilus, graf
Artigo em Inglês | LILACS | ID: lil-482084

RESUMO

Star STING is the latest version of the STING suite of programs and corresponding database. We report on five important aspects of this package that have acquired some new characteristics, designed to add key advantages to the whole suite: 1) availability for most popular platforms and browsers, 2) introduction of the STING_DB quality assessment, 3) improvement in algorithms for calculation of three STING parameters, 4) introduction of five new STING modules, and 5) expansion of the existing modules. Star STING is freely accessible at: http://sms.cbi.cnptia.embrapa.br/SMS/, http://trantor.bioc.columbia.edu/SMS, http://www.es.embnet.org/SMS/, http://gibk26.bse.kyutech.ac.jp/SMS/ and http://www.ar.embnet.org/SMS.


Assuntos
Bases de Dados de Proteínas , Proteínas/química , Análise de Sequência de Proteína , Software , Algoritmos , Gráficos por Computador , Modelos Moleculares , Estrutura Molecular
5.
Genet. mol. res. (Online) ; 5(2): 333-341, 2006. ilus, tab
Artigo em Inglês | LILACS | ID: lil-442565

RESUMO

PDB-Metrics (http://sms.cbi.cnptia.embrapa.br/SMS/pdb_metrics/index.html) is a component of the Diamond STING suite of programs for the analysis of protein sequence, structure and function. It summarizes the characteristics of the collection of protein structure descriptions deposited in the Protein Data Bank (PDB) and provides a Web interface to search and browse the PDB, using a variety of alternative criteria. PDB-Metrics is a powerful tool for bioinformaticians to examine the data span in the PDB from several perspectives. Although other Web sites offer some similar resources to explore the PDB contents, PDB-Metrics is among those with the most complete set of such facilities, integrated into a single Web site. This program has been developed using SQLite, a C library that provides all the query facilities of a database management system


Assuntos
Análise de Sequência de Proteína/métodos , Bases de Dados Factuais , Bases de Dados de Proteínas , Internet , Proteínas , Software , Gráficos por Computador , Proteínas/química , Proteínas/genética , Proteínas/fisiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA