Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
Methods Mol Biol ; 1613: 291-310, 2017.
Article in English | MEDLINE | ID: mdl-28849565

ABSTRACT

Analysis of gene co-expression networks is a powerful "data-driven" tool, invaluable for understanding cancer biology and mechanisms of tumor development. Yet, despite of completion of thousands of studies on cancer gene expression, there were few attempts to normalize and integrate co-expression data from scattered sources in a concise "meta-analysis" framework. Here we describe an integrated approach to cancer expression meta-analysis, which combines generation of "data-driven" co-expression networks with detailed statistical detection of promoter sequence motifs within the co-expression clusters. First, we applied Weighted Gene Co-Expression Network Analysis (WGCNA) workflow and Pearson's correlation to generate a comprehensive set of over 3000 co-expression clusters in 82 normalized microarray datasets from nine cancers of different origin. Next, we designed a genome-wide statistical approach to the detection of specific DNA sequence motifs based on similarities between the promoters of similarly expressed genes. The approach, realized as cisExpress software module, was specifically designed for analysis of very large data sets such as those generated by publicly accessible whole genome and transcriptome projects. cisExpress uses a task farming algorithm to exploit all available computational cores within a shared memory node.We discovered that although co-expression modules are populated with different sets of genes, they share distinct stable patterns of co-regulation based on promoter sequence analysis. The number of motifs per co-expression cluster varies widely in accordance with cancer tissue of origin, with the largest number in colon (68 motifs) and the lowest in ovary (18 motifs). The top scored motifs are typically shared between several tissues; they define sets of target genes responsible for certain functionality of cancerogenesis. Both the co-expression modules and a database of precalculated motifs are publically available and accessible for further studies.


Subject(s)
Computational Biology/methods , Gene Regulatory Networks , Neoplasms/genetics , Algorithms , Gene Expression Profiling/methods , Gene Expression Regulation, Neoplastic , Humans , Oligonucleotide Array Sequence Analysis/methods , Response Elements
2.
PLoS One ; 11(11): e0165059, 2016.
Article in English | MEDLINE | ID: mdl-27824868

ABSTRACT

Gene coexpression network analysis is a powerful "data-driven" approach essential for understanding cancer biology and mechanisms of tumor development. Yet, despite the completion of thousands of studies on cancer gene expression, there have been few attempts to normalize and integrate co-expression data from scattered sources in a concise "meta-analysis" framework. We generated such a resource by exploring gene coexpression networks in 82 microarray datasets from 9 major human cancer types. The analysis was conducted using an elaborate weighted gene coexpression network (WGCNA) methodology and identified over 3,000 robust gene coexpression modules. The modules covered a range of known tumor features, such as proliferation, extracellular matrix remodeling, hypoxia, inflammation, angiogenesis, tumor differentiation programs, specific signaling pathways, genomic alterations, and biomarkers of individual tumor subtypes. To prioritize genes with respect to those tumor features, we ranked genes within each module by connectivity, leading to identification of module-specific functionally prominent hub genes. To showcase the utility of this network information, we positioned known cancer drug targets within the coexpression networks and predicted that Anakinra, an anti-rheumatoid therapeutic agent, may be promising for development in colorectal cancer. We offer a comprehensive, normalized and well documented collection of >3000 gene coexpression modules in a variety of cancers as a rich data resource to facilitate further progress in cancer research.


Subject(s)
Gene Expression/genetics , Gene Regulatory Networks/genetics , Neoplasms/genetics , Biomarkers, Tumor/genetics , Cell Differentiation/genetics , Cell Proliferation/genetics , Data Mining/methods , Drug Repositioning/methods , Gene Expression Profiling/methods , Genomics/methods , Humans , Hypoxia/genetics , Inflammation/genetics , Signal Transduction/genetics
3.
PLoS One ; 9(8): e102909, 2014.
Article in English | MEDLINE | ID: mdl-25170892

ABSTRACT

Detailed analysis of disease-affected tissue provides insight into molecular mechanisms contributing to pathogenesis. Substantia nigra, striatum, and cortex are functionally connected with increasing degrees of alpha-synuclein pathology in Parkinson's disease. We undertook functional and causal pathway analysis of gene expression and proteomic alterations in these three regions, and the data revealed pathways that correlated with disease progression. In addition, microarray and RNAseq experiments revealed previously unidentified causal changes related to oligodendrocyte function and synaptic vesicle release, and these and other changes were reflected across all brain regions. Importantly, subsets of these changes were replicated in Parkinson's disease blood; suggesting peripheral tissue may provide important avenues for understanding and measuring disease status and progression. Proteomic assessment revealed alterations in mitochondria and vesicular transport proteins that preceded gene expression changes indicating defects in translation and/or protein turnover. Our combined approach of proteomics, RNAseq and microarray analyses provides a comprehensive view of the molecular changes that accompany functional loss and alpha-synuclein pathology in Parkinson's disease, and may be instrumental to understand, diagnose and follow Parkinson's disease progression.


Subject(s)
Brain/pathology , Parkinson Disease/metabolism , Parkinson Disease/pathology , Animals , Brain/metabolism , Disease Progression , Gene Expression Regulation , Humans , Microarray Analysis , Proteins/analysis , Proteins/genetics , Proteins/metabolism , Proteomics , Sequence Analysis, RNA , Signal Transduction , alpha-Synuclein/analysis , alpha-Synuclein/genetics , alpha-Synuclein/metabolism
4.
PLoS One ; 8(4): e60618, 2013.
Article in English | MEDLINE | ID: mdl-23593264

ABSTRACT

The discovery of novel drug targets is a significant challenge in drug development. Although the human genome comprises approximately 30,000 genes, proteins encoded by fewer than 400 are used as drug targets in the treatment of diseases. Therefore, novel drug targets are extremely valuable as the source for first in class drugs. On the other hand, many of the currently known drug targets are functionally pleiotropic and involved in multiple pathologies. Several of them are exploited for treating multiple diseases, which highlights the need for methods to reliably reposition drug targets to new indications. Network-based methods have been successfully applied to prioritize novel disease-associated genes. In recent years, several such algorithms have been developed, some focusing on local network properties only, and others taking the complete network topology into account. Common to all approaches is the understanding that novel disease-associated candidates are in close overall proximity to known disease genes. However, the relevance of these methods to the prediction of novel drug targets has not yet been assessed. Here, we present a network-based approach for the prediction of drug targets for a given disease. The method allows both repositioning drug targets known for other diseases to the given disease and the prediction of unexploited drug targets which are not used for treatment of any disease. Our approach takes as input a disease gene expression signature and a high-quality interaction network and outputs a prioritized list of drug targets. We demonstrate the high performance of our method and highlight the usefulness of the predictions in three case studies. We present novel drug targets for scleroderma and different types of cancer with their underlying biological processes. Furthermore, we demonstrate the ability of our method to identify non-suspected repositioning candidates using diabetes type 1 as an example.


Subject(s)
Computational Biology/methods , Drug Discovery/methods , Drug Repositioning , Algorithms , Cluster Analysis , Computer Simulation , Gene Expression Profiling , Gene Regulatory Networks , Humans , Molecular Targeted Therapy , ROC Curve , Reproducibility of Results
5.
PLoS One ; 7(4): e35618, 2012.
Article in English | MEDLINE | ID: mdl-22558177

ABSTRACT

Cilia are cell organelles that play important roles in cell motility, sensory and developmental functions and are involved in a range of human diseases, known as ciliopathies. Here, we search for novel human genes related to cilia using a strategy that exploits the previously reported tendency of cell type-specific genes to be coexpressed in the transcriptome of complex tissues. Gene coexpression networks were constructed using the noise-resistant WGCNA algorithm in 12 publicly available microarray datasets from human tissues rich in motile cilia: airways, fallopian tubes and brain. A cilia-related coexpression module was detected in 10 out of the 12 datasets. A consensus analysis of this module's gene composition recapitulated 297 known and predicted 74 novel cilia-related genes. 82% of the novel candidates were supported by tissue-specificity expression data from GEO and/or proteomic data from the Human Protein Atlas. The novel findings included a set of genes (DCDC2, DYX1C1, KIAA0319) related to a neurological disease dyslexia suggesting their potential involvement in ciliary functions. Furthermore, we searched for differences in gene composition of the ciliary module between the tissues. A multidrug-and-toxin extrusion transporter MATE2 (SLC47A2) was found as a brain-specific central gene in the ciliary module. We confirm the localization of MATE2 in cilia by immunofluorescence staining using MDCK cells as a model. While MATE2 has previously gained attention as a pharmacologically relevant transporter, its potential relation to cilia is suggested for the first time. Taken together, our large-scale analysis of gene coexpression networks identifies novel genes related to human cell cilia.


Subject(s)
Cilia/genetics , Dyslexia/genetics , Microtubule-Associated Proteins/genetics , Nerve Tissue Proteins/genetics , Nuclear Proteins/genetics , Organic Cation Transport Proteins/genetics , Proteomics/methods , Algorithms , Animals , Brain/cytology , Brain/metabolism , Cell Line , Cilia/metabolism , Cytoskeletal Proteins , Databases, Genetic , Dogs , Fallopian Tubes/cytology , Fallopian Tubes/metabolism , Female , Gene Expression Profiling , Gene Expression Regulation , Humans , Oligonucleotide Array Sequence Analysis , Organ Specificity , Respiratory System/cytology , Respiratory System/metabolism , Transcriptome
6.
Cancer Res ; 70(24): 10060-70, 2010 Dec 15.
Article in English | MEDLINE | ID: mdl-21159630

ABSTRACT

Gliomas are primary brain tumors with high mortality and heterogeneous biology that is insufficiently understood. In this study, we performed a systematic analysis of the intrinsic organization of complex glioma transcriptome to gain deeper knowledge of the tumor biology. Gene coexpression relationships were explored in 790 glioma samples from 5 published patient cohorts treated at different institutions. We identified 20 coexpression modules that were common to all the data sets and associated with proliferation, angiogenesis, hypoxia, immune response, genomic alterations, cell differentiation phenotypes, and other features inherent to glial tumors. A collection of high-quality signatures for the respective processes was obtained using cross-data set summarization of the modules' gene composition. Individual modules were found to be organized into higher order coexpression groups, the two largest of them associated with glioblastoma and oligodendroglioma, respectively. We identified a novel prognostic gene expression signature (185 genes) linked to a proastrocytic pattern of tumor cell differentiation. This "proastrocytic" signature was associated with long survival and defined a subgroup of the previously established "proneural" class of gliomas. A strong negative correlation between proastrocytic and proneural markers across differentiated tumors underscored the distinction between these subtypes of glioma. Interestingly, one further novel signature in glioma was identified that was associated with EGFR (epidermal growth factor receptor) gene amplification and suggested that EGF signaling in glioma may be a subject to regulation by Sprouty family proteins. In summary, this integrated analysis of the glioma transcriptome provided several novel insights into molecular heterogeneity and pathogenesis of glial tumors.


Subject(s)
Brain Neoplasms/genetics , Gene Expression Regulation, Neoplastic , Gene Regulatory Networks , Glioma/genetics , Algorithms , Astrocytoma/enzymology , Astrocytoma/genetics , Astrocytoma/pathology , Brain Neoplasms/enzymology , Brain Neoplasms/pathology , Cell Differentiation/genetics , ErbB Receptors/genetics , ErbB Receptors/metabolism , Gene Amplification , Gene Expression Profiling , Glioma/enzymology , Glioma/pathology , Humans , Oligodendroglioma/enzymology , Oligodendroglioma/genetics , Oligodendroglioma/pathology , Signal Transduction/genetics
7.
J Bioinform Comput Biol ; 6(4): 811-24, 2008 Aug.
Article in English | MEDLINE | ID: mdl-18763744

ABSTRACT

The identification of orthologs to a set of known genes is often the starting point for evolutionary studies focused on gene families of interest. To date, the existing orthology detection tools (COG, InParanoid, OrthoMCL, etc.) are aimed at genome-wide ortholog identification and lack flexibility for the purposes of case studies. We developed a program OrthoFocus, which employs an extended reciprocal best hit approach to quickly search for orthologs in a pair of genomes. A group of paralogs from the input genome is used as the start for the forward search and the criterion for the reverse search, which allows handling many-to-one and many-to-many relationships. By pairwise comparison of genomes with the input species genome, OrthoFocus enables quick identification of orthologs in multiple genomes and generates a multiple alignment of orthologs so that it can further be used in phylogenetic analysis. The program is available at http://www.lipidomics.ru/.


Subject(s)
Algorithms , Chromosome Mapping/methods , Sequence Alignment/methods , Sequence Analysis, DNA/methods , Software , Base Sequence , Molecular Sequence Data , Sequence Homology, Nucleic Acid
8.
Nucleic Acids Res ; 36(Web Server issue): W327-31, 2008 Jul 01.
Article in English | MEDLINE | ID: mdl-18463138

ABSTRACT

The major public microarray repositories Gene Expression Omnibus and ArrayExpress are growing rapidly. This enables meta-analysis studies, in which expression data from multiple individual studies are combined. To facilitate these types of studies, we developed Microarray Retriever for searching and retrieval of data from GEO and ArrayExpress. The tool allows access to the two repositories simultaneously, to search in the repositories using complex queries, to retrieve microarray data for published articles and to download data in one structured archive. The tool is available on the web at: http://www.lgtc.nl/MaRe/


Subject(s)
Databases, Genetic , Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis/methods , Software , Internet , Meta-Analysis as Topic
SELECTION OF CITATIONS
SEARCH DETAIL
...