Pesquisa | Portal Regional da BVS

THD-Tricluster: A robust triclustering technique and its application in condition specific change analysis in HIV-1 progression data.

Kakati, Tulika; Ahmed, Hasin A; Bhattacharyya, Dhruba K; Kalita, Jugal K.

Comput Biol Chem ; 75: 154-167, 2018 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-29787933

RESUMO

Developing a cost-effective and robust triclustering algorithm that can identify triclusters of high biological significance in the gene-sample-time (GST) domain is a challenging task. Most existing triclustering algorithms can detect shifting and scaling patterns in isolation, they are not able to handle co-occurring shifting-and-scaling patterns. This paper makes an attempt to address this issue. It introduces a robust triclustering algorithm called THD-Tricluster to identify triclusters over the GST domain. In addition to applying over several benchmark datasets for its validation, the proposed THD-Tricluster algorithm was applied on HIV-1 progression data to identify disease-specific genes. THD-Tricluster could identify 38 most responsible genes for the deadly disease which includes GATA3, EGR1, JUN, ELF1, AGFG1, AGFG2, CX3CR1, CXCL12, CCR5, CCR2, and many others. The results are validated using GeneCard and other established results.

Assuntos

Algoritmos , HIV-1/genética , Análise por Conglomerados , HIV-1/isolamento & purificação , Humanos , Análise de Sequência com Séries de Oligonucleotídeos

Core and peripheral connectivity based cluster analysis over PPI network.

Ahmed, Hasin A; Bhattacharyya, Dhruba K; Kalita, Jugal K.

Comput Biol Chem ; 59 Pt B: 32-41, 2015 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-26362299

RESUMO

A number of methods have been proposed in the literature of protein-protein interaction (PPI) network analysis for detection of clusters in the network. Clusters are identified by these methods using various graph theoretic criteria. Most of these methods have been found time consuming due to involvement of preprocessing and post processing tasks. In addition, they do not achieve high precision and recall consistently and simultaneously. Moreover, the existing methods do not employ the idea of core-periphery structural pattern of protein complexes effectively to extract clusters. In this paper, we introduce a clustering method named CPCA based on a recent observation by researchers that a protein complex in a PPI network is arranged as a relatively dense core region and additional proteins weakly connected to the core. CPCA uses two connectivity criterion functions to identify core and peripheral regions of the cluster. To locate initial node of a cluster we introduce a measure called DNQ (Degree based Neighborhood Qualification) index that evaluates tendency of the node to be part of a cluster. CPCA performs well when compared with well-known counterparts. Along with protein complex gold standards, a co-localization dataset has also been used for validation of the results.

Assuntos

Mapas de Interação de Proteínas , Proteínas/química , Análise por Conglomerados , Bases de Dados de Proteínas , Ligação Proteica , Mapeamento de Interação de Proteínas , Reprodutibilidade dos Testes

An effective method for network module extraction from microarray data.

Mahanta, Priyakshi; Ahmed, Hasin A; Bhattacharyya, Dhruba K; Kalita, Jugal K.

BMC Bioinformatics ; 13 Suppl 13: S4, 2012.

Artigo em Inglês | MEDLINE | ID: mdl-23320896

RESUMO

BACKGROUND: The development of high-throughput Microarray technologies has provided various opportunities to systematically characterize diverse types of computational biological networks. Co-expression network have become popular in the analysis of microarray data, such as for detecting functional gene modules. RESULTS: This paper presents a method to build a co-expression network (CEN) and to detect network modules from the built network. We use an effective gene expression similarity measure called NMRS (Normalized mean residue similarity) to construct the CEN. We have tested our method on five publicly available benchmark microarray datasets. The network modules extracted by our algorithm have been biologically validated in terms of Q value and p value. CONCLUSIONS: Our results show that the technique is capable of detecting biologically significant network modules from the co-expression network. Biologist can use this technique to find groups of genes with similar functionality based on their expression information.

Assuntos

Biologia Computacional/métodos , Interpretação Estatística de Dados , Perfilação da Expressão Gênica/estatística & dados numéricos , Redes Reguladoras de Genes , Análise de Sequência com Séries de Oligonucleotídeos/estatística & dados numéricos , Algoritmos , Bases de Dados Genéticas/estatística & dados numéricos , Expressão Gênica

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA