Pesquisa | Portal Regional da BVS

Feature selection in gene expression data using principal component analysis and rough set theory.

Mishra, Debahuti; Dash, Rajashree; Rath, Amiya Kumar; Acharya, Milu.

Adv Exp Med Biol ; 696: 91-100, 2011.

Artigo em Inglês | MEDLINE | ID: mdl-21431550

RESUMO

In many fields such as data mining, machine learning, pattern recognition and signal processing, data sets containing huge number of features are often involved. Feature selection is an essential data preprocessing technique for such high-dimensional data classification tasks. Traditional dimensionality reduction approach falls into two categories: Feature Extraction (FE) and Feature Selection (FS). Principal component analysis is an unsupervised linear FE method for projecting high-dimensional data into a low-dimensional space with minimum loss of information. It discovers the directions of maximal variances in the data. The Rough set approach to feature selection is used to discover the data dependencies and reduction in the number of attributes contained in a data set using the data alone, requiring no additional information. For selecting discriminative features from principal components, the Rough set theory can be applied jointly with PCA, which guarantees that the selected principal components will be the most adequate for classification. We call this method Rough PCA. The proposed method is successfully applied for choosing the principal features and then applying the Upper and Lower Approximations to find the reduced set of features from a gene expression data.

Assuntos

Mineração de Dados/métodos , Perfilação da Expressão Gênica/estatística & dados numéricos , Análise de Componente Principal , Algoritmos , Neoplasias da Mama/genética , Biologia Computacional , Mineração de Dados/estatística & dados numéricos , Bases de Dados Genéticas , Feminino , Análise de Elementos Finitos , Humanos , Modelos Estatísticos , Saccharomyces cerevisiae/genética

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA