Search | VHL Regional Portal

Profiling the quantitative occupancy of myriad transcription factors across conditions by modeling chromatin accessibility data.

Luo, Kaixuan; Zhong, Jianling; Safi, Alexias; Hong, Linda K; Tewari, Alok K; Song, Lingyun; Reddy, Timothy E; Ma, Li; Crawford, Gregory E; Hartemink, Alexander J.

Genome Res ; 32(6): 1183-1198, 2022 06.

Article in English | MEDLINE | ID: mdl-35609992

ABSTRACT

Over a thousand different transcription factors (TFs) bind with varying occupancy across the human genome. Chromatin immunoprecipitation (ChIP) can assay occupancy genome-wide, but only one TF at a time, limiting our ability to comprehensively observe the TF occupancy landscape, let alone quantify how it changes across conditions. We developed TF occupancy profiler (TOP), a Bayesian hierarchical regression framework, to profile genome-wide quantitative occupancy of numerous TFs using data from a single chromatin accessibility experiment (DNase- or ATAC-seq). TOP is supervised, and its hierarchical structure allows it to predict the occupancy of any sequence-specific TF, even those never assayed with ChIP. We used TOP to profile the quantitative occupancy of hundreds of sequence-specific TFs at sites throughout the genome and examined how their occupancies changed in multiple contexts: in approximately 200 human cell types, through 12 h of exposure to different hormones, and across the genetic backgrounds of 70 individuals. TOP enables cost-effective exploration of quantitative changes in the landscape of TF binding.

Subject(s)

Chromatin , Transcription Factors , Bayes Theorem , Binding Sites/genetics , Chromatin/genetics , Genome, Human , Humans , Protein Binding , Transcription Factors/metabolism

Causal network inference from gene transcriptional time-series response to glucocorticoids.

Lu, Jonathan; Dumitrascu, Bianca; McDowell, Ian C; Jo, Brian; Barrera, Alejandro; Hong, Linda K; Leichter, Sarah M; Reddy, Timothy E; Engelhardt, Barbara E.

PLoS Comput Biol ; 17(1): e1008223, 2021 01.

Article in English | MEDLINE | ID: mdl-33513136

ABSTRACT

Gene regulatory network inference is essential to uncover complex relationships among gene pathways and inform downstream experiments, ultimately enabling regulatory network re-engineering. Network inference from transcriptional time-series data requires accurate, interpretable, and efficient determination of causal relationships among thousands of genes. Here, we develop Bootstrap Elastic net regression from Time Series (BETS), a statistical framework based on Granger causality for the recovery of a directed gene network from transcriptional time-series data. BETS uses elastic net regression and stability selection from bootstrapped samples to infer causal relationships among genes. BETS is highly parallelized, enabling efficient analysis of large transcriptional data sets. We show competitive accuracy on a community benchmark, the DREAM4 100-gene network inference challenge, where BETS is one of the fastest among methods of similar performance and additionally infers whether causal effects are activating or inhibitory. We apply BETS to transcriptional time-series data of differentially-expressed genes from A549 cells exposed to glucocorticoids over a period of 12 hours. We identify a network of 2768 genes and 31,945 directed edges (FDR ≤ 0.2). We validate inferred causal network edges using two external data sources: Overexpression experiments on the same glucocorticoid system, and genetic variants associated with inferred edges in primary lung tissue in the Genotype-Tissue Expression (GTEx) v6 project. BETS is available as an open source software package at https://github.com/lujonathanh/BETS.

Subject(s)

Glucocorticoids/pharmacology , Models, Statistical , Transcriptome/drug effects , A549 Cells , Algorithms , Computational Biology , Humans , Lung/chemistry , Lung/metabolism , Machine Learning , Software , Transcriptome/genetics

Glucocorticoid receptor recruits to enhancers and drives activation by motif-directed binding.

McDowell, Ian C; Barrera, Alejandro; D'Ippolito, Anthony M; Vockley, Christopher M; Hong, Linda K; Leichter, Sarah M; Bartelt, Luke C; Majoros, William H; Song, Lingyun; Safi, Alexias; Koçak, D Dewran; Gersbach, Charles A; Hartemink, Alexander J; Crawford, Gregory E; Engelhardt, Barbara E; Reddy, Timothy E.

Genome Res ; 28(9): 1272-1284, 2018 09.

Article in English | MEDLINE | ID: mdl-30097539

ABSTRACT

Glucocorticoids are potent steroid hormones that regulate immunity and metabolism by activating the transcription factor (TF) activity of glucocorticoid receptor (GR). Previous models have proposed that DNA binding motifs and sites of chromatin accessibility predetermine GR binding and activity. However, there are vast excesses of both features relative to the number of GR binding sites. Thus, these features alone are unlikely to account for the specificity of GR binding and activity. To identify genomic and epigenetic contributions to GR binding specificity and the downstream changes resultant from GR binding, we performed hundreds of genome-wide measurements of TF binding, epigenetic state, and gene expression across a 12-h time course of glucocorticoid exposure. We found that glucocorticoid treatment induces GR to bind to nearly all pre-established enhancers within minutes. However, GR binds to only a small fraction of the set of accessible sites that lack enhancer marks. Once GR is bound to enhancers, a combination of enhancer motif composition and interactions between enhancers then determines the strength and persistence of GR binding, which consequently correlates with dramatic shifts in enhancer activation. Over the course of several hours, highly coordinated changes in TF binding and histone modification occupancy occur specifically within enhancers, and these changes correlate with changes in the expression of nearby genes. Following GR binding, changes in the binding of other TFs precede changes in chromatin accessibility, suggesting that other TFs are also sensitive to genomic features beyond that of accessibility.

Subject(s)

Enhancer Elements, Genetic , Histone Code , Nucleotide Motifs , Receptors, Glucocorticoid/metabolism , Transcriptional Activation , Cell Line, Tumor , Epigenesis, Genetic , Humans , Protein Binding , Transcription Factors/metabolism

Pre-established Chromatin Interactions Mediate the Genomic Response to Glucocorticoids.

D'Ippolito, Anthony M; McDowell, Ian C; Barrera, Alejandro; Hong, Linda K; Leichter, Sarah M; Bartelt, Luke C; Vockley, Christopher M; Majoros, William H; Safi, Alexias; Song, Lingyun; Gersbach, Charles A; Crawford, Gregory E; Reddy, Timothy E.

Cell Syst ; 7(2): 146-160.e7, 2018 08 22.

Article in English | MEDLINE | ID: mdl-30031775

ABSTRACT

The glucocorticoid receptor (GR) is a hormone-inducible transcription factor involved in metabolic and anti-inflammatory gene expression responses. To investigate what controls interactions between GR binding sites and their target genes, we used in situ Hi-C to generate high-resolution, genome-wide maps of chromatin interactions before and after glucocorticoid treatment. We found that GR binding to the genome typically does not cause new chromatin interactions to target genes but instead acts through chromatin interactions that already exist prior to hormone treatment. Both glucocorticoid-induced and glucocorticoid-repressed genes increased interactions with distal GR binding sites. In addition, while glucocorticoid-induced genes increased interactions with transcriptionally active chromosome compartments, glucocorticoid-repressed genes increased interactions with transcriptionally silent compartments. Lastly, while the architectural DNA-binding proteins CTCF and RAD21 were bound to most chromatin interactions, we found that glucocorticoid-responsive chromatin interactions were depleted for CTCF binding but enriched for RAD21. Together, these findings offer new insights into the mechanisms underlying GC-mediated gene activation and repression.

Subject(s)

Chromatin/metabolism , Gene Expression Regulation , Glucocorticoids/metabolism , Receptors, Glucocorticoid/metabolism , Binding Sites , CCCTC-Binding Factor/metabolism , Cell Cycle Proteins , Cell Line , Chromatin/genetics , DNA-Binding Proteins , Genome, Human , Humans , Nuclear Proteins/metabolism , Phosphoproteins/metabolism , Protein Binding

Exome sequencing followed by large-scale genotyping fails to identify single rare variants of large effect in idiopathic generalized epilepsy.

Heinzen, Erin L; Depondt, Chantal; Cavalleri, Gianpiero L; Ruzzo, Elizabeth K; Walley, Nicole M; Need, Anna C; Ge, Dongliang; He, Min; Cirulli, Elizabeth T; Zhao, Qian; Cronin, Kenneth D; Gumbs, Curtis E; Campbell, C Ryan; Hong, Linda K; Maia, Jessica M; Shianna, Kevin V; McCormack, Mark; Radtke, Rodney A; O'Conner, Gerard D; Mikati, Mohamad A; Gallentine, William B; Husain, Aatif M; Sinha, Saurabh R; Chinthapalli, Krishna; Puranam, Ram S; McNamara, James O; Ottman, Ruth; Sisodiya, Sanjay M; Delanty, Norman; Goldstein, David B.

Am J Hum Genet ; 91(2): 293-302, 2012 Aug 10.

Article in English | MEDLINE | ID: mdl-22863189

ABSTRACT

Idiopathic generalized epilepsy (IGE) is a complex disease with high heritability, but little is known about its genetic architecture. Rare copy-number variants have been found to explain nearly 3% of individuals with IGE; however, it remains unclear whether variants with moderate effect size and frequencies below what are reliably detected with genome-wide association studies contribute significantly to disease risk. In this study, we compare the exome sequences of 118 individuals with IGE and 242 controls of European ancestry by using next-generation sequencing. The exome-sequenced epilepsy cases include study subjects with two forms of IGE, including juvenile myoclonic epilepsy (n = 93) and absence epilepsy (n = 25). However, our discovery strategy did not assume common genetic control between the subtypes of IGE considered. In the sequence data, as expected, no variants were significantly associated with the IGE phenotype or more specific IGE diagnoses. We then selected 3,897 candidate epilepsy-susceptibility variants from the sequence data and genotyped them in a larger set of 878 individuals with IGE and 1,830 controls. Again, no variant achieved statistical significance. However, 1,935 variants were observed exclusively in cases either as heterozygous or homozygous genotypes. It is likely that this set of variants includes real risk factors. The lack of significant association evidence of single variants with disease in this two-stage approach emphasizes the high genetic heterogeneity of epilepsy disorders, suggests that the impact of any individual single-nucleotide variant in this disease is small, and indicates that gene-based approaches might be more successful for future sequencing studies of epilepsy predisposition.

Subject(s)

Epilepsy, Generalized/genetics , Exome/genetics , Genetic Predisposition to Disease/genetics , Base Sequence , Genome-Wide Association Study , Genotype , Humans , Molecular Sequence Data , Sequence Alignment , Sequence Analysis, DNA , White People/genetics

The characterization of twenty sequenced human genomes.

Pelak, Kimberly; Shianna, Kevin V; Ge, Dongliang; Maia, Jessica M; Zhu, Mingfu; Smith, Jason P; Cirulli, Elizabeth T; Fellay, Jacques; Dickson, Samuel P; Gumbs, Curtis E; Heinzen, Erin L; Need, Anna C; Ruzzo, Elizabeth K; Singh, Abanish; Campbell, C Ryan; Hong, Linda K; Lornsen, Katharina A; McKenzie, Alexander M; Sobreira, Nara L M; Hoover-Fong, Julie E; Milner, Joshua D; Ottman, Ruth; Haynes, Barton F; Goedert, James J; Goldstein, David B.

PLoS Genet ; 6(9): e1001111, 2010 Sep 09.

Article in English | MEDLINE | ID: mdl-20838461

ABSTRACT

We present the analysis of twenty human genomes to evaluate the prospects for identifying rare functional variants that contribute to a phenotype of interest. We sequenced at high coverage ten "case" genomes from individuals with severe hemophilia A and ten "control" genomes. We summarize the number of genetic variants emerging from a study of this magnitude, and provide a proof of concept for the identification of rare and highly-penetrant functional variants by confirming that the cause of hemophilia A is easily recognizable in this data set. We also show that the number of novel single nucleotide variants (SNVs) discovered per genome seems to stabilize at about 144,000 new variants per genome, after the first 15 individuals have been sequenced. Finally, we find that, on average, each genome carries 165 homozygous protein-truncating or stop loss variants in genes representing a diverse set of pathways.

Subject(s)

Genome, Human/genetics , Sequence Analysis, DNA , Base Sequence , Case-Control Studies , DNA Copy Number Variations/genetics , Databases, Genetic , Exons/genetics , Factor VIII/genetics , Gene Duplication/genetics , Gene Knockout Techniques , Genetics, Population , Genotype , Hemophilia A/genetics , Humans , INDEL Mutation/genetics , Oligonucleotide Array Sequence Analysis , Open Reading Frames/genetics , Polymorphism, Genetic , Polymorphism, Single Nucleotide/genetics

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL