Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Language
Publication year range
1.
Ann N Y Acad Sci ; 1158: 29-35, 2009 Mar.
Article in English | MEDLINE | ID: mdl-19348629

ABSTRACT

Thanks to the availability of high-throughput omics data, bioinformatics approaches are able to hypothesize thus-far undocumented genetic interactions. However, due to the amount of noise in these data, inferences based on a single data source are often unreliable. A popular approach to overcome this problem is to integrate different data sources. In this study, we describe DISTILLER, a novel framework for data integration that simultaneously analyzes microarray and motif information to find modules that consist of genes that are co-expressed in a subset of conditions, and their corresponding regulators. By applying our method on publicly available data, we evaluated the condition-specific transcriptional network of Escherichia coli. DISTILLER confirmed 62% of 736 interactions described in RegulonDB, and 278 novel interactions were predicted.


Subject(s)
Computational Biology/methods , Escherichia coli/genetics , Gene Expression Regulation, Bacterial , Gene Regulatory Networks , Algorithms , Databases, Genetic , Gene Expression Profiling , Models, Genetic , Oligonucleotide Array Sequence Analysis
2.
Genome Biol ; 10(3): R27, 2009.
Article in English | MEDLINE | ID: mdl-19265557

ABSTRACT

We present DISTILLER, a data integration framework for the inference of transcriptional module networks. Experimental validation of predicted targets for the well-studied fumarate nitrate reductase regulator showed the effectiveness of our approach in Escherichia coli. In addition, the condition dependency and modularity of the inferred transcriptional network was studied. Surprisingly, the level of regulatory complexity seemed lower than that which would be expected from RegulonDB, indicating that complex regulatory programs tend to decrease the degree of modularity.


Subject(s)
Computational Biology/methods , Escherichia coli/genetics , Gene Regulatory Networks , Regulon/genetics , Software , Chromatin Immunoprecipitation , Gene Expression Regulation, Bacterial , Transcription Factors
3.
BMC Bioinformatics ; 10 Suppl 1: S30, 2009 Jan 30.
Article in English | MEDLINE | ID: mdl-19208131

ABSTRACT

BACKGROUND: The detection of cis-regulatory modules (CRMs) that mediate transcriptional responses in eukaryotes remains a key challenge in the postgenomic era. A CRM is characterized by a set of co-occurring transcription factor binding sites (TFBS). In silico methods have been developed to search for CRMs by determining the combination of TFBS that are statistically overrepresented in a certain geneset. Most of these methods solve this combinatorial problem by relying on computational intensive optimization methods. As a result their usage is limited to finding CRMs in small datasets (containing a few genes only) and using binding sites for a restricted number of transcription factors (TFs) out of which the optimal module will be selected. RESULTS: We present an itemset mining based strategy for computationally detecting cis-regulatory modules (CRMs) in a set of genes. We tested our method by applying it on a large benchmark data set, derived from a ChIP-Chip analysis and compared its performance with other well known cis-regulatory module detection tools. CONCLUSION: We show that by exploiting the computational efficiency of an itemset mining approach and combining it with a well-designed statistical scoring scheme, we were able to prioritize the biologically valid CRMs in a large set of coregulated genes using binding sites for a large number of potential TFs as input.


Subject(s)
Algorithms , Regulatory Elements, Transcriptional , Software , Transcription Factors/metabolism , Binding Sites , Databases, Genetic , Models, Genetic
4.
Artif Life ; 14(1): 49-63, 2008.
Article in English | MEDLINE | ID: mdl-18171130

ABSTRACT

The development of structure-learning algorithms for gene regulatory networks depends heavily on the availability of synthetic data sets that contain both the original network and associated expression data. This article reports the application of SynTReN, an existing network generator that samples topologies from existing biological networks and uses Michaelis-Menten and Hill enzyme kinetics to simulate gene interactions. We illustrate the effects of different aspects of the expression data on the quality of the inferred network. The tested expression data parameters are network size, network topology, type and degree of noise, quantity of expression data, and interaction types between genes. This is done by applying three well-known inference algorithms to SynTReN data sets. The results show the power of synthetic data in revealing operational characteristics of inference algorithms that are unlikely to be discovered by means of biological microarray data only.


Subject(s)
Algorithms , Computer Simulation , Gene Regulatory Networks , Models, Biological , Transcription, Genetic , Artificial Intelligence , Databases, Genetic , Software
5.
Bioinformatics ; 23(19): 2573-80, 2007 Oct 01.
Article in English | MEDLINE | ID: mdl-17686800

ABSTRACT

MOTIVATION: Existing (bi)clustering methods for microarray data analysis often do not answer the specific questions of interest to a biologist. Such specific questions could be derived from other information sources, including expert prior knowledge. More specifically, given a set of seed genes which are believed to have a common function, we would like to recruit genes with similar expression profiles as the seed genes in a significant subset of experimental conditions. RESULTS: We introduce QDB, a novel Bayesian query-driven biclustering framework in which the prior distributions allow introducing knowledge from a set of seed genes (query) to guide the pattern search. In two well-known yeast compendia, we grow highly functionally enriched biclusters from small sets of seed genes using a resolution sweep approach. In addition, relevant conditions are identified and modularity of the biclusters is demonstrated, including the discovery of overlapping modules. Finally, our method deals with missing values naturally, performs well on artificial data from a recent biclustering benchmark study and has a number of conceptual advantages when compared to existing approaches for focused module search.


Subject(s)
Cluster Analysis , Gene Expression Profiling/methods , Multigene Family/physiology , Oligonucleotide Array Sequence Analysis/methods , Pattern Recognition, Automated/methods , Proteome/metabolism , Signal Transduction/physiology , Algorithms , Artificial Intelligence
6.
Genome Biol ; 7(5): R37, 2006.
Article in English | MEDLINE | ID: mdl-16677396

ABSTRACT

'ReMoDiscovery' is an intuitive algorithm to correlate regulatory programs with regulators and corresponding motifs to a set of co-expressed genes. It exploits in a concurrent way three independent data sources: ChIP-chip data, motif information and gene expression profiles. When compared to published module discovery algorithms, ReMoDiscovery is fast and easily tunable. We evaluated our method on yeast data, where it was shown to generate biologically meaningful findings and allowed the prediction of potential novel roles of transcriptional regulators.


Subject(s)
Algorithms , Chromatin Immunoprecipitation , Gene Expression Profiling , Gene Expression Regulation , Oligonucleotide Array Sequence Analysis , Amino Acids/metabolism , Cell Cycle , Galactose/metabolism , Regulatory Elements, Transcriptional , Ribosomes/metabolism , Software , Transcription Factors/metabolism , Yeasts/genetics , Yeasts/growth & development , Yeasts/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...