A parallel algorithm for finding small sets of genes that are enough to distinguish two biological states
Genet. mol. biol
;
27(4): 686-690, Dec. 2004. ilus, graf
Article
in English
| LILACS
| ID: lil-391248
RESUMO
GCLASS is an algorithm which explores small samples of two distinct biological states for finding small sets of genes, which form a feature vector that is enough to separate these two states. A typical sample is a set of 60 microarrays, 30 for each biological state, with several thousands of genes. The technique consists of the following a spreading model defined in the space of small sets of genes studied and centered in each feature vector considered; the designing of optimal linear classifiers under this spreading model; and ranking the designed classifiers are considered the best feature vectors.Due to the great number of potential feature sets, a parallel implementations is a good option for reducing the procedure execution time. This paper presents a parallel solution of GCLASS and shows some performance results. The experimental results show that the proposed solution provides quasi linear speedup if compared to the sequential implementation. For example, using 60 genes as the complete feature space and 6 genes as the small feature space, our parallel version with 11 processors is approximately 10.98 times faster than the sequential version.
Full text:
Available
Index:
LILACS (Americas)
Main subject:
Gene Expression
/
Genes
Type of study:
Diagnostic study
/
Prognostic study
Limits:
Humans
Language:
English
Journal:
Genet. mol. biol
Journal subject:
Genetics
Year:
2004
Type:
Article
/
Project document
Affiliation country:
Brazil
Institution/Affiliation country:
Universidade de São Paulo/BR
Similar
MEDLINE
...
LILACS
LIS