RESUMO
The aim in microarray data analysis is to discover patterns of gene expression and to identify similar genes. Simply comparing new gene sequences to known DNA sequences often does not reveal the function of a new gene; thus, more sophisticated techniques are in order. Nowadays, data mining techniques, and in particular the clustering process, play an important role in bioinformatics. To analyze vast amounts of data can be difficult; thus, a way to cluster similar data is needed. This chapter is devoted to illustrate the general data mining approach used in microarray data analysis, combining clustering, alignment and similarity, and to highlight a novel similarity measure capable of capturing hidden correlations between data.
Assuntos
Análise em Microsséries , Algoritmos , Análise por Conglomerados , Biologia Computacional , Perfilação da Expressão Gênica , Análise de Sequência com Séries de OligonucleotídeosRESUMO
In this paper, we propose an automated approach to extracting White Matter (WM) fiber-bundles through clustering and model characterization. The key novelties of our approach are: a new string-based formalism, allowing an alternative representation of WM fibers, a new string dissimilarity metric, a WM fiber clustering technique, and a new model-based characterization algorithm. Thanks to these novelties, the complex problem of WM fiber-bundle extraction and characterization reduces to a much simpler and well-known string extraction and analysis problem. Interestingly, while several past approaches extract fiber-bundles by grouping available fibers on the basis of provided atlases (and, therefore, cannot capture possibly existing fiber-bundles nor represented in the atlases), our approach first clusters available fibers once and for all, and then tries to associate obtained clusters with models provided directly and dynamically by users. This more dynamic and interactive way of proceeding can help the detection of fiber-bundles autonomously proposed by our approach and not present in the initial models provided by experts.