Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Theor Biol ; 364: 121-30, 2015 Jan 07.
Artigo em Inglês | MEDLINE | ID: mdl-25219623

RESUMO

Predicting the localization of a protein has become a useful practice for inferring its function. Most of the reported methods to predict subcellular localizations in Gram-negative bacterial proteins make use of standard protein representations that generally do not take into account the distribution of the amino acids and the structural information of the proteins. Here, we propose a protein representation based on the structural information contained in the pairwise statistical contact potentials. The wavelet transform decodes the information contained in the primary structure of the proteins, allowing the identification of patterns along the proteins, which are used to characterize the subcellular localizations. Then, a support vector machine classifier is trained to categorize them. Cellular compartments like periplasm and extracellular medium are difficult to predict, having a high false negative rate. The wavelet-based method achieves an overall high performance while maintaining a low false negative rate, particularly, on "periplasm" and "extracellular medium". Our results suggest the proposed protein characterization is a useful alternative to representing and predicting protein sequences over the classical and cutting edge protein depictions.


Assuntos
Algoritmos , Proteínas de Bactérias/metabolismo , Bactérias Gram-Negativas/metabolismo , Estatística como Assunto , Análise de Ondaletas , Motivos de Aminoácidos , Bases de Dados de Proteínas , Estrutura Terciária de Proteína , Transporte Proteico , Curva ROC , Frações Subcelulares/metabolismo , Máquina de Vetores de Suporte
2.
Artigo em Inglês | MEDLINE | ID: mdl-25570629

RESUMO

Nowadays, the use of Wearable User Interfaces has been extensively growing in medical monitoring applications. However, production and manufacture of prototypes without automation tools may lead to non viable results since it is often common to find an optimization problem where several variables are in conflict with each other. Thus, it is necessary to design a strategy for balancing the variables and constraints, systematizing the design in order to reduce the risks that are present when it is exclusively guided by the intuition of the developer. This paper proposes a framework for designing wearable ECG monitoring systems using multi-objective optimization. The main contributions of this work are the model to automate the design process, including a mathematical expression relating the principal variables that make up the criteria of functionality and wearability. We also introduce a novel yardstick for deciding the location of electrodes, based on reducing interference from ECG by maximizing the electrode-skin contact.


Assuntos
Eletrocardiografia/métodos , Monitorização Fisiológica/métodos , Algoritmos , Eletrocardiografia/instrumentação , Eletrodos , Humanos , Modelos Teóricos , Monitorização Fisiológica/instrumentação , Fenômenos Fisiológicos da Pele , Têxteis
3.
Artigo em Inglês | MEDLINE | ID: mdl-24109769

RESUMO

Predicting the localization of a protein has become a useful practice for inferring its function. Most of the reported methods to predict subcellular localizations in Gram-negative bacterial proteins have shown a low false positive rate. However, some subcellular compartmens like "periplasm" and "extracellular medium" are difficult to predict and remain high false negative rates. In this paper, a method based on representation from statistical contact potentials and wavelet transform is presented. The wavelet-based method achieves an overall high performance holding low false and negative rates particularly on periplasm and extracellular medium. Results suggest the contact potentials as an useful alternative to characterize protein sequences.


Assuntos
Proteínas de Bactérias/química , Bactérias Gram-Negativas , Sequência de Aminoácidos , Cadeias de Markov , Anotação de Sequência Molecular , Transporte Proteico , Análise de Sequência de Proteína , Análise de Ondaletas
4.
Artigo em Inglês | MEDLINE | ID: mdl-24110281

RESUMO

A comparative analysis of four multi-label classification methods is performed in order to determine the best topology for the problem of protein function prediction, using support vector machines as base classifiers. Comparisons are done in terms of performance and computational cost of parallelized versions of the algorithms, for determining its applicability in high-throughput scenarios. Results show that the performance of the binary relevance strategy, together with a technique of class balance, remains above several recently proposed techniques for the problem at hand, while employing the smallest computational cost when parallelized. However, stacked classfiers and chain classifications can be conveniently used in pipelines, due to the low number of false positives reported.


Assuntos
Biologia Computacional , Proteínas/metabolismo , Algoritmos , Bases de Dados de Proteínas , Embriófitas/metabolismo , Proteínas/classificação , Máquina de Vetores de Suporte
5.
Artigo em Inglês | MEDLINE | ID: mdl-23367187

RESUMO

Predicting the sub-cellular localization of a protein can provide useful information to uncover its molecular functions. In this sense, numerous prediction techniques have been developed, which usually have been focused on global information of the protein or sequence alignments. However, several studies have shown that the functional nature of proteins is ruled by conserved sub-sequence patterns known as domains. In this paper, an alternative methodology (PfamFeat) for gram-positive bacterial sub-cellular localization was developed. PfamFeat is based on information provided by Pfam database, which stores a series of HMM-profiles describing common protein domains. The likelihood of a sequence, to be generated by a given HMM-profile, can be used to characterize sequences in order to use pattern recognition techniques. Success rates obtained with a simple one-nearest neighbor classifier demonstrate that this method is competitive with popular sub-cellular prediction algorithms and it constitutes a promising research trend.


Assuntos
Bactérias Gram-Positivas/metabolismo , Frações Subcelulares/metabolismo , Algoritmos , Biologia Computacional
6.
Artigo em Inglês | MEDLINE | ID: mdl-22254467

RESUMO

Predict the function of unknown proteins is one of the principal goals in computational biology. The subcellular localization of a protein allows further understanding its structure and molecular function. Numerous prediction techniques have been developed, usually focusing on global information of the protein. But, predictions can be done through the identification of functional sub-sequence patterns known as motifs. For motifs discovery problem, many methods requires a predefined fixed window size in advance and aligned sequences. To confront these problems we proposed a method based on variable length motifs characterization and detection using the continuous wavelet transform (CWT) and a dissimilarity space representation. For analyzing the motifs results generated by our approach, we divide the entire dataset into training (60%) and validation (40%). A Support Vector Machine (SVM) classifier is used as predictor for validation set. The highest Sn = 82.58% and Sp = 92.86%, across 10-fold cross validation, is obtained for endosome proteins. Average results Sn = 74% and Sp = 75.58% are comparable to current state of the art. For data sets whose identity is low (< 40%), the motifs characterization and localization based on CWT shows a good performance and the interpretability of the subsequences in each subcellular localization.


Assuntos
Algoritmos , Perfilação da Expressão Gênica/métodos , Reconhecimento Automatizado de Padrão/métodos , Proteínas/química , Proteínas/metabolismo , Análise de Sequência de Proteína/métodos , Frações Subcelulares/metabolismo , Sequência de Aminoácidos , Dados de Sequência Molecular , Software , Relação Estrutura-Atividade , Frações Subcelulares/química , Máquina de Vetores de Suporte
7.
Artigo em Inglês | MEDLINE | ID: mdl-21096466

RESUMO

An analysis of the predictability of subcellular locations is performed by using simple pattern recognition techniques in an attempt to capture the real dimensions of the problem at hand. Results show that there are some particular locations that does not need of high complexity classification models to be predicted with high accuracies, and some partial biological explanations are formulated. All the experiments were carried out over a set of Arabidopsis Thaliana proteins and classes were defined according to the plants GO slim.


Assuntos
Proteínas de Arabidopsis/metabolismo , Reconhecimento Automatizado de Padrão/métodos , Sequência de Aminoácidos , Proteínas de Arabidopsis/química , Proteínas de Arabidopsis/classificação , Bases de Dados de Proteínas , Transporte Proteico , Frações Subcelulares/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...