Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Neural Comput Appl ; 35(4): 3307-3324, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36245794

RESUMO

Feature selection techniques are considered one of the most important preprocessing steps, which has the most significant influence on the performance of data analysis and decision making. These FS techniques aim to achieve several objectives (such as reducing classification error and minimizing the number of features) at the same time to increase the classification rate. FS based on Metaheuristic (MH) is considered one of the most promising techniques to improve the classification process. This paper presents a modified method of the Slime mould algorithm depending on the Marine Predators Algorithm (MPA) operators as a local search strategy, which leads to increasing the convergence rate of the developed method, named SMAMPA and avoiding the attraction to local optima. The efficiency of SMAMPA is evaluated using twenty datasets and compared its results with the state-of-the-art FS methods. In addition, the applicability of SMAMPA to work with real-world problems is evaluated by using it as a quantitative structure-activity relationship (QSAR) model. The obtained results show the high ability of the developed SMAMPA method to reduce the dimension of the tested datasets by increasing the prediction rate. In addition, it provides results better than other FS techniques in terms of performance metrics.

2.
Comput Biol Med ; 103: 262-268, 2018 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-30399534

RESUMO

In cancer classification, gene selection is one of the most important bioinformatics related topics. The selection of genes can be considered to be a variable selection problem, which aims to find a small subset of genes that has the most discriminative information for the classification target. The penalized support vector machine (PSVM) has proved its effectiveness at creating a strong classifier that combines the advantages of the support vector machine and penalization. PSVM with a smoothly clipped absolute deviation (SCAD) penalty is the most widely used method. However, the efficiency of PSVM with SCAD depends on choosing the appropriate tuning parameter involved in the SCAD penalty. In this paper, a firefly algorithm, which is a metaheuristic continuous algorithm, is proposed to determine the tuning parameter in PSVM with SCAD penalty. Our proposed algorithm can efficiently help to find the most relevant genes with high classification performance. The experimental results from four benchmark gene expression datasets show the superior performance of the proposed algorithm in terms of classification accuracy and the number of selected genes compared with competing methods.


Assuntos
Perfilação da Expressão Gênica/métodos , Neoplasias/classificação , Neoplasias/genética , Máquina de Vetores de Suporte , Biologia Computacional , Bases de Dados Genéticas , Humanos , Neoplasias/metabolismo
3.
Comput Biol Med ; 97: 145-152, 2018 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-29729489

RESUMO

Gene selection has been proven to be an effective way to improve the results of many classification methods. However, existing gene selection techniques in binary classification regression are sensitive to outliers of the data, heteroskedasticity or other anomalies of the latent response. In this paper, we propose a new Bayesian hierarchical model to overcome these problems in a relatively straightforward way. In particular, we propose a new Bayesian Lasso method that employs a skewed Laplace distribution for the errors and a scaled mixture of uniform distribution for the regression parameters, together with Bayesian MCMC estimation. Comprehensive comparisons between our proposed gene selection method and other competitor methods are performed experimentally, depending on four benchmark gene expression datasets. The experimental results prove that the proposed method is very effective for selecting the most relevant genes with high classification accuracy.


Assuntos
Perfilação da Expressão Gênica/métodos , Modelos Estatísticos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Transcriptoma/genética , Algoritmos , Teorema de Bayes , Biologia Computacional , Bases de Dados Genéticas , Humanos , Neoplasias/classificação , Neoplasias/genética , Neoplasias/metabolismo , Análise de Regressão
4.
Comput Biol Med ; 67: 136-45, 2015 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-26520484

RESUMO

Cancer classification and gene selection in high-dimensional data have been popular research topics in genetics and molecular biology. Recently, adaptive regularized logistic regression using the elastic net regularization, which is called the adaptive elastic net, has been successfully applied in high-dimensional cancer classification to tackle both estimating the gene coefficients and performing gene selection simultaneously. The adaptive elastic net originally used elastic net estimates as the initial weight, however, using this weight may not be preferable for certain reasons: First, the elastic net estimator is biased in selecting genes. Second, it does not perform well when the pairwise correlations between variables are not high. Adjusted adaptive regularized logistic regression (AAElastic) is proposed to address these issues and encourage grouping effects simultaneously. The real data results indicate that AAElastic is significantly consistent in selecting genes compared to the other three competitor regularization methods. Additionally, the classification performance of AAElastic is comparable to the adaptive elastic net and better than other regularization methods. Thus, we can conclude that AAElastic is a reliable adaptive regularized logistic regression method in the field of high-dimensional cancer classification.


Assuntos
Algoritmos , Perfilação da Expressão Gênica/métodos , Modelos Logísticos , Proteínas de Neoplasias/metabolismo , Neoplasias/metabolismo , Reconhecimento Automatizado de Padrão/métodos , Simulação por Computador , Humanos , Análise de Sequência com Séries de Oligonucleotídeos , Análise de Regressão , Transdução de Sinais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...