ABSTRACT
Physical interactions between distal regulatory elements have a key role in regulating gene expression, but the extent to which these interactions vary between cell types and contribute to cell-type-specific gene expression remains unclear. Here, to address these questions as part of phase III of the Encyclopedia of DNA Elements (ENCODE), we mapped cohesin-mediated chromatin loops, using chromatin interaction analysis by paired-end tag sequencing (ChIA-PET), and analysed gene expression in 24 diverse human cell types, including core ENCODE cell lines. Twenty-eight per cent of all chromatin loops vary across cell types; these variations modestly correlate with changes in gene expression and are effective at grouping cell types according to their tissue of origin. The connectivity of genes corresponds to different functional classes, with housekeeping genes having few contacts, and dosage-sensitive genes being more connected to enhancer elements. This atlas of chromatin loops complements the diverse maps of regulatory architecture that comprise the ENCODE Encyclopedia, and will help to support emerging analyses of genome structure and function.
Subject(s)
Cell Cycle Proteins/metabolism , Chromatin/chemistry , Chromatin/genetics , Chromosomal Proteins, Non-Histone/metabolism , Genome, Human/genetics , Molecular Sequence Annotation , Alternative Splicing/genetics , Cell Differentiation/genetics , Cell Line , Cells/metabolism , Chromatin/metabolism , Chromatin Immunoprecipitation Sequencing , Enhancer Elements, Genetic/genetics , Gene Expression Regulation , Humans , Molecular Conformation , Promoter Regions, Genetic/genetics , CohesinsABSTRACT
The CCCTC-binding factor (CTCF) is known to establish long-range DNA contacts that alter the three-dimensional architecture of chromatin, but how the presence of CTCF influences nearby gene expression is still poorly understood. Here, we analyze CTCF chromatin immunoprecipitation sequencing, RNA sequencing, and Hi-C data, together with genotypes from a healthy human cohort, and measure statistical associations between inter-individual variability in CTCF binding and alternative exon usage. We demonstrate that CTCF-mediated chromatin loops between promoters and intragenic regions are prevalent and that when exons are in physical proximity with their promoters, CTCF binding correlates with exon inclusion in spliced mRNA. Genome-wide, CTCF-bound exons are enriched for genes involved in signaling and cellular stress-response pathways. Structural analysis of three specific examples, checkpoint kinase 2 (CHK2), CDC-like kinase 3 (CLK3), and euchromatic histone-lysine N-methyltransferase (EHMT1), suggests that CTCF-mediated exon inclusion is likely to downregulate enzyme activity by disrupting annotated protein domains. In total, our study suggests that alternative exon usage is regulated by CTCF-dependent chromatin structure.