Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 2 de 2
Filter
Add more filters










Database
Language
Publication year range
1.
Int J Data Min Bioinform ; 10(2): 146-61, 2014.
Article in English | MEDLINE | ID: mdl-25796735

ABSTRACT

The pathway-based microarray classification approach leads to a new era of genomic research. However, this approach is limited by the issues in quality of pathway data. Usually the pathway data are curated from biological literatures and in specific biological experiment (e.g., lung cancer experiment), context free pathway information collection process takes place leading to the presence of uninformative genes in the pathways. Many methods in this approach neglect these limitations by treating all genes in a pathway as significant. In this paper, we proposed a hybrid of support vector machine and smoothly clipped absolute deviation with group-specific tuning parameters (gSVM-SCAD) to select informative genes within pathways before the pathway evaluation process. Our experiment on canine, gender and lung cancer datasets shows that gSVM-SCAD obtains significant results in identifying significant genes and pathways and in classification accuracy.


Subject(s)
Algorithms , Data Mining/methods , Databases, Genetic , Gene Expression Profiling/methods , Pattern Recognition, Automated/methods , Support Vector Machine , Animals , Cardiovascular Diseases/genetics , Cardiovascular Diseases/veterinary , Dog Diseases/genetics , Dogs , Humans , Lung Neoplasms/genetics , Proteome/genetics , Signal Transduction/genetics
2.
Bioinformation ; 7(4): 169-75, 2011.
Article in English | MEDLINE | ID: mdl-22102773

ABSTRACT

Pathway analysis has lead to a new era in genomic research by providing further biological process information compared to traditional single gene analysis. Beside the advantage, pathway analysis provides some challenges to the researchers, one of which is the quality of pathway data itself. The pathway data usually defined from biological context free, when it comes to a specific biological context (e.g. lung cancer disease), typically only several genes within pathways are responsible for the corresponding cellular process. It also can be that some pathways may be included with uninformative genes or perhaps informative genes were excluded. Moreover, many algorithms in pathway analysis neglect these limitations by treating all the genes within pathways as significant. In previous study, a hybrid of support vector machines and smoothly clipped absolute deviation with groups-specific tuning parameters (gSVM-SCAD) was proposed in order to identify and select the informative genes before the pathway evaluation process. However, gSVM-SCAD had showed a limitation in terms of the performance of classification accuracy. In order to deal with this limitation, we made an enhancement to the tuning parameter method for gSVM-SCAD by applying the B-Type generalized approximate cross validation (BGACV). Experimental analyses using one simulated data and two gene expression data have shown that the proposed method obtains significant results in identifying biologically significant genes and pathways, and in classification accuracy.

SELECTION OF CITATIONS
SEARCH DETAIL
...