Automated discovery of tissue-targeting enhancers and transcription factors from binding motif and gene function data.
PLoS Comput Biol
; 10(1): e1003449, 2014 Jan.
Article
in En
| MEDLINE
| ID: mdl-24499934
Identifying enhancers regulating gene expression remains an important and challenging task. While recent sequencing-based methods provide epigenomic characteristics that correlate well with enhancer activity, it remains onerous to comprehensively identify all enhancers across development. Here we introduce a computational framework to identify tissue-specific enhancers evolving under purifying selection. First, we incorporate high-confidence binding site predictions with target gene functional enrichment analysis to identify transcription factors (TFs) likely functioning in a particular context. We then search the genome for clusters of binding sites for these TFs, overcoming previous constraints associated with biased manual curation of TFs or enhancers. Applying our method to the placenta, we find 33 known and implicate 17 novel TFs in placental function, and discover 2,216 putative placenta enhancers. Using luciferase reporter assays, 31/36 (86%) tested candidates drive activity in placental cells. Our predictions agree well with recent epigenomic data in human and mouse, yet over half our loci, including 7/8 (87%) tested regions, are novel. Finally, we establish that our method is generalizable by applying it to 5 additional tissues: heart, pancreas, blood vessel, bone marrow, and liver.
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Transcription Factors
/
Enhancer Elements, Genetic
Type of study:
Prognostic_studies
Limits:
Animals
/
Female
/
Humans
/
Pregnancy
Language:
En
Journal:
PLoS Comput Biol
Journal subject:
BIOLOGIA
/
INFORMATICA MEDICA
Year:
2014
Document type:
Article
Affiliation country:
United States
Country of publication:
United States