Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 2 de 2
Filter
Add more filters










Database
Language
Publication year range
1.
Article in English | MEDLINE | ID: mdl-19407358

ABSTRACT

In order to get a better understanding of different types of cancers and to find the possible biomarkers for diseases, recently, many researchers are analyzing the gene expression data using various machine learning techniques. However, due to a very small number of training samples compared to the huge number of genes and class imbalance, most of these methods suffer from overfitting. In this paper, we present a majority voting genetic programming classifier (MVGPC) for the classification of microarray data. Instead of a single rule or a single set of rules, we evolve multiple rules with genetic programming (GP) and then apply those rules to test samples to determine their labels with majority voting technique. By performing experiments on four different public cancer data sets, including multiclass data sets, we have found that the test accuracies of MVGPC are better than those of other methods, including AdaBoost with GP. Moreover, some of the more frequently occurring genes in the classification rules are known to be associated with the types of cancers being studied in this paper.


Subject(s)
Artificial Intelligence , Gene Expression Profiling , Neoplasms/classification , Oligonucleotide Array Sequence Analysis , Software , Algorithms , Databases, Genetic , Humans , Models, Genetic , Neoplasms/genetics , Neoplasms/metabolism , Pattern Recognition, Automated
2.
Biosystems ; 82(3): 208-25, 2005 Dec.
Article in English | MEDLINE | ID: mdl-16112804

ABSTRACT

Recently, DNA microarray-based gene expression profiles have been used to correlate the clinical behavior of cancers with the differential gene expression levels in cancerous and normal tissues. To this end, after selection of some predictive genes based on signal-to-noise (S2N) ratio, unsupervised learning like clustering and supervised learning like k-nearest neighbor (k NN) classifier are widely used. Instead of S2N ratio, adaptive searches like Probabilistic Model Building Genetic Algorithm (PMBGA) can be applied for selection of a smaller size gene subset that would classify patient samples more accurately. In this paper, we propose a new PMBGA-based method for identification of informative genes from microarray data. By applying our proposed method to classification of three microarray data sets of binary and multi-type tumors, we demonstrate that the gene subsets selected with our technique yield better classification accuracy.


Subject(s)
Neoplasms/classification , Neoplasms/genetics , Algorithms , Brain Neoplasms/genetics , Brain Neoplasms/pathology , Cluster Analysis , Computational Biology , Disease Progression , Gene Expression Regulation, Neoplastic , Humans , Lung Neoplasms/genetics , Male , Models, Biological , Models, Genetic , Models, Statistical , Neoplasms/metabolism , Neoplasms/pathology , Oligonucleotide Array Sequence Analysis , Probability , Prostatic Neoplasms/genetics , Software
SELECTION OF CITATIONS
SEARCH DETAIL
...