Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Cell Biochem ; 124(1): 72-88, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36271914

RESUMO

Ion channels are ion-permeable protein pores that are found in all cell lipid membranes. Distinct ion channels play multiple roles in biological processes. Proteomic data is fast accumulating as a result of the fast growth of mass spectrometry and giving us the chance to comprehensively explore ion channel classes along with their subclasses. This paper proposes an eXtreme Gradient Boosting (XGBoost)-based method to estimate the ion channel classes and their subclasses. Here, 12 feature vectors are applied to better characterize protein sequences like amino acid composition, pseudo-amino acid composition, normalized moreau-broto autocorrelation, amphiphilic pseudo-amino acid composition, dipeptide composition, Geary autocorrelation, tripeptide composition, sequence-order-coupling number, composition/transition/distribution, conjoint triad, moran autocorrelation, quasi-sequence-order descriptors. Here, a total of 9920 features are extracted from the protein sequence. The principal component analysis is applied to determine the optimal number of features to optimize the performance. In 10-fold cross-validation the proposed XGBoost based approach with optimal 50 features achieved accuracy of 100%, 98.70%, 98.77%, 97.26%, 87.40%, 97.39%, 98.03%, 96.42%, and F1-Score of 100%, 99%, 99%, 97%, 87%, 97%, 98%, 97%, for prediction of ion channel and nonion channel, voltage-gated and ligand-gated ion channels, subclasses of voltage-gated ion channels (VGICs), subclasses of ligand-gated ion channels (LGICs), subclasses of voltage-gated calcium channels (VGCCs), subclasses of voltage-gated potassium channels (VGKCs), subclasses of voltage-gated sodium channels (VGSCs), and subclasses of voltage-gated chloride channels, respectively. Here the proposed approach also compares with the other approaches such as support vector machine, k-nearest neighbor, Gaussian Naïve Bayes, and random forest and also compares with existing methods such as support vector machine (SVM) with maximum relevance maximum distance with an accuracy of 86.6%, 83.7%, and 85.1%, for ion channels, non-ion channels and overall respectively and SVM with radial basis function kernel-based method with an accuracy of 100%, 97% and 99.9% for ion channels, nonion channels, and overall accuracy, respectively.


Assuntos
Algoritmos , Canais Iônicos de Abertura Ativada por Ligante , Teorema de Bayes , Proteômica , Aprendizado de Máquina , Canais de Cálcio , Aminoácidos/química
2.
Comput Methods Programs Biomed ; 134: 197-213, 2016 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-27480744

RESUMO

BACKGROUND AND OBJECTIVE: The G-protein coupled receptors are the largest superfamilies of membrane proteins and important targets for the drug design. G-protein coupled receptors are responsible for many physiochemical processes such as smell, taste, vision, neurotransmission, metabolism, cellular growth and immune response. So it is necessary to design a robust and efficient approach for the prediction of G-protein coupled receptors and their subfamilies. METHODS: In this paper, the protein samples are represented by amino acid composition, dipeptide composition, correlation features, composition, transition, distribution, sequence order descriptors and pseudo amino acid composition with total 1497 number of sequence derived features. To address the issue of efficient classification of G-protein coupled receptors and their subfamilies, we propose to use a weighted k-nearest neighbor classifier with UNION of best 50 features, selected by Fisher score based feature selection, ReliefF, fast correlation based filter, minimum redundancy maximum relevancy, and support vector machine based recursive elimination feature selection methods to exploit the advantages of these feature selection methods. RESULTS: The proposed method achieved an overall accuracy of 99.9%, 98.3%, 95.4%, MCC values of 1.00, 0.98, 0.95, ROC area values of 1.00, 0.998, 0.996 and precision of 99.9%, 98.3% and 95.5% using 10-fold cross-validation to predict the G-protein coupled receptors and non-G-protein coupled receptors, subfamilies of G-protein coupled receptors, and subfamilies of class A G-protein coupled receptors, respectively. CONCLUSIONS: The high accuracies, MCC, ROC area values, and precision values indicate that the proposed method is better for the prediction of G-protein coupled receptors families and their subfamilies.


Assuntos
Receptores Acoplados a Proteínas G/química , Sequência de Aminoácidos , Humanos , Máquina de Vetores de Suporte
3.
Comput Biol Chem ; 58: 205-21, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26256801

RESUMO

Ion channels are integral membrane proteins that are responsible for controlling the flow of ions across the cell. There are various biological functions that are performed by different types of ion channels. Therefore for new drug discovery it is necessary to develop a novel computational intelligence techniques based approach for the reliable prediction of ion channels families and their subfamilies. In this paper random forest based approach is proposed to predict ion channels families and their subfamilies by using sequence derived features. Here, seven feature vectors are used to represent the protein sample, including amino acid composition, dipeptide composition, correlation features, composition, transition and distribution and pseudo amino acid composition. The minimum redundancy and maximum relevance feature selection is used to find the optimal number of features for improving the prediction performance. The proposed method achieved an overall accuracy of 100%, 98.01%, 91.5%, 93.0%, 92.2%, 78.6%, 95.5%, 84.9%, MCC values of 1.00, 0.92, 0.88, 0.88, 0.90, 0.79, 0.91, 0.81 and ROC area values of 1.00, 0.99, 0.99, 0.99, 0.99, 0.95, 0.99 and 0.96 using 10-fold cross validation to predict the ion channels and non-ion channels, voltage gated ion channels and ligand gated ion channels, four subfamilies (calcium, potassium, sodium and chloride) of voltage gated ion channels, and four subfamilies of ligand gated ion channels and predict subfamilies of voltage gated calcium, potassium, sodium and chloride ion channels respectively.


Assuntos
Canais Iônicos/análise , Aminoácidos/análise , Inteligência Artificial , Dipeptídeos/análise
4.
Int J Proteomics ; 2014: 845479, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25574395

RESUMO

During the past, there was a massive growth of knowledge of unknown proteins with the advancement of high throughput microarray technologies. Protein function prediction is the most challenging problem in bioinformatics. In the past, the homology based approaches were used to predict the protein function, but they failed when a new protein was different from the previous one. Therefore, to alleviate the problems associated with homology based traditional approaches, numerous computational intelligence techniques have been proposed in the recent past. This paper presents a state-of-the-art comprehensive review of various computational intelligence techniques for protein function predictions using sequence, structure, protein-protein interaction network, and gene expression data used in wide areas of applications such as prediction of DNA and RNA binding sites, subcellular localization, enzyme functions, signal peptides, catalytic residues, nuclear/G-protein coupled receptors, membrane proteins, and pathway analysis from gene expression datasets. This paper also summarizes the result obtained by many researchers to solve these problems by using computational intelligence techniques with appropriate datasets to improve the prediction performance. The summary shows that ensemble classifiers and integration of multiple heterogeneous data are useful for protein function prediction.

5.
Clin Vaccine Immunol ; 18(10): 1760-4, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-21852548

RESUMO

Brucellosis is a disease with worldwide distribution affecting animals and human beings. Brucella abortus is the causative agent of bovine brucellosis. The cross-reactions of currently available diagnostic procedures for B. abortus infection result in false-positive reactions, which make the procedures unreliable. These tests are also unable to differentiate Brucella-infected and -vaccinated animals. The present work is focused on the use of a nonlipopolysaccharide (LPS) diagnostic antigen, a recombinant 10-kDa (r10-kDa) protein of B. abortus, for specific diagnosis of brucellosis. The purified recombinant protein was used as a diagnostic antigen in plate enzyme-linked immunosorbent assay (p-ELISA) format to screen 408 bovine serum samples (70 presumptively negative, 308 random, and 30 vaccinated), and the results were compared with those of the Rose Bengal plate agglutination test (RBPT) and the standard tube agglutination test (STAT). Statistical analysis in presumptive negative samples revealed 100 and 98.41% specificity of p-ELISA with RBPT and STAT, and an agreement of 91.43% with the tests using Cohen's kappa statistics. In random samples, the agreement of p-ELISA was 77.92% and 80.52% with RBPT and STAT, respectively. p-ELISA investigation of vaccinated samples reported no false-positive results, whereas RBPT and STAT reported 30% and 96.6% false-positive results, respectively. The data suggest that p-ELISA with r10-kDa protein may be a useful method for diagnosis of bovine brucellosis. Furthermore, p-ELISA may also be used as a tool for differentiating Brucella-vaccinated and naturally infected animals.


Assuntos
Antígenos de Bactérias , Brucella abortus/imunologia , Brucelose Bovina/diagnóstico , Técnicas de Laboratório Clínico/métodos , Proteínas de Membrana , Medicina Veterinária/métodos , Animais , Antígenos de Bactérias/genética , Brucella abortus/genética , Bovinos , Ensaio de Imunoadsorção Enzimática/métodos , Reações Falso-Positivas , Epitopos Imunodominantes/genética , Proteínas de Membrana/genética , Proteínas Recombinantes/genética , Sensibilidade e Especificidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...