Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 11 de 11
Filter
Add more filters










Publication year range
1.
Mol Cell ; 84(10): 1842-1854.e7, 2024 May 16.
Article in English | MEDLINE | ID: mdl-38759624

ABSTRACT

Genomic context critically modulates regulatory function but is difficult to manipulate systematically. The murine insulin-like growth factor 2 (Igf2)/H19 locus is a paradigmatic model of enhancer selectivity, whereby CTCF occupancy at an imprinting control region directs downstream enhancers to activate either H19 or Igf2. We used synthetic regulatory genomics to repeatedly replace the native locus with 157-kb payloads, and we systematically dissected its architecture. Enhancer deletion and ectopic delivery revealed previously uncharacterized long-range regulatory dependencies at the native locus. Exchanging the H19 enhancer cluster with the Sox2 locus control region (LCR) showed that the H19 enhancers relied on their native surroundings while the Sox2 LCR functioned autonomously. Analysis of regulatory DNA actuation across cell types revealed that these enhancer clusters typify broader classes of context sensitivity genome wide. These results show that unexpected dependencies influence even well-studied loci, and our approach permits large-scale manipulation of complete loci to investigate the relationship between regulatory architecture and function.


Subject(s)
CCCTC-Binding Factor , Enhancer Elements, Genetic , Insulin-Like Growth Factor II , RNA, Long Noncoding , SOXB1 Transcription Factors , Animals , Mice , CCCTC-Binding Factor/metabolism , CCCTC-Binding Factor/genetics , Insulin-Like Growth Factor II/genetics , Insulin-Like Growth Factor II/metabolism , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism , SOXB1 Transcription Factors/genetics , SOXB1 Transcription Factors/metabolism , Locus Control Region/genetics , Genomic Imprinting , Genomics/methods
2.
bioRxiv ; 2024 Mar 12.
Article in English | MEDLINE | ID: mdl-37781588

ABSTRACT

Enhancer function is frequently investigated piecemeal using truncated reporter assays or single deletion analysis. Thus it remains unclear to what extent enhancer function at native loci relies on surrounding genomic context. Using the Big-IN technology for targeted integration of large DNAs, we analyzed the regulatory architecture of the murine Igf2/H19 locus, a paradigmatic model of enhancer selectivity. We assembled payloads containing a 157-kb functional Igf2/H19 locus and engineered mutations to genetically direct CTCF occupancy at the imprinting control region (ICR) that switches the target gene of the H19 enhancer cluster. Contrasting activity of payloads delivered at the endogenous Igf2/H19 locus or ectopically at Hprt revealed that the Igf2/H19 locus includes additional, previously unknown long-range regulatory elements. Exchanging components of the Igf2/H19 locus with the well-studied Sox2 locus showed that the H19 enhancer cluster functioned poorly out of context, and required its native surroundings to activate Sox2 expression. Conversely, the Sox2 locus control region (LCR) could activate both Igf2 and H19 outside its native context, but its activity was only partially modulated by CTCF occupancy at the ICR. Analysis of regulatory DNA actuation across different cell types revealed that, while the H19 enhancers are tightly coordinated within their native locus, the Sox2 LCR acts more independently. We show that these enhancer clusters typify broader classes of loci genome-wide. Our results show that unexpected dependencies may influence even the most studied functional elements, and our synthetic regulatory genomics approach permits large-scale manipulation of complete loci to investigate the relationship between locus architecture and function.

3.
Mol Cell ; 83(7): 1140-1152.e7, 2023 04 06.
Article in English | MEDLINE | ID: mdl-36931273

ABSTRACT

Sox2 expression in mouse embryonic stem cells (mESCs) depends on a distal cluster of DNase I hypersensitive sites (DHSs), but their individual contributions and degree of interdependence remain a mystery. We analyzed the endogenous Sox2 locus using Big-IN to scarlessly integrate large DNA payloads incorporating deletions, rearrangements, and inversions affecting single or multiple DHSs, as well as surgical alterations to transcription factor (TF) recognition sequences. Multiple mESC clones were derived for each payload, sequence-verified, and analyzed for Sox2 expression. We found that two DHSs comprising a handful of key TF recognition sequences were each sufficient for long-range activation of Sox2 expression. By contrast, three nearby DHSs were entirely context dependent, showing no activity alone but dramatically augmenting the activity of the autonomous DHSs. Our results highlight the role of context in modulating genomic regulatory element function, and our synthetic regulatory genomics approach provides a roadmap for the dissection of other genomic loci.


Subject(s)
Gene Expression Regulation , Regulatory Sequences, Nucleic Acid , Animals , Mice , Enhancer Elements, Genetic , Genomics , Regulatory Sequences, Nucleic Acid/genetics , Transcription Factors/genetics , Transcription Factors/metabolism , SOXB1 Transcription Factors/metabolism
4.
Hum Genet ; 141(8): 1431-1447, 2022 Aug.
Article in English | MEDLINE | ID: mdl-35147782

ABSTRACT

Drug development and biological discovery require effective strategies to map existing genetic associations to causal genes. To approach this problem, we selected 12 common diseases and quantitative traits for which highly powered genome-wide association studies (GWAS) were available. For each disease or trait, we systematically curated positive control gene sets from Mendelian forms of the disease and from targets of medicines used for disease treatment. We found that these positive control genes were highly enriched in proximity of GWAS-associated single-nucleotide variants (SNVs). We then performed quantitative assessment of the contribution of commonly used genomic features, including open chromatin maps, expression quantitative trait loci (eQTL), and chromatin conformation data. Using these features, we trained and validated an Effector Index (Ei), to map target genes for these 12 common diseases and traits. Ei demonstrated high predictive performance, both with cross-validation on the training set, and an independently derived set for type 2 diabetes. Key predictive features included coding or transcript-altering SNVs, distance to gene, and open chromatin-based metrics. This work outlines a simple, understandable approach to prioritize genes at GWAS loci for functional follow-up and drug development, and provides a systematic strategy for prioritization of GWAS target genes.


Subject(s)
Diabetes Mellitus, Type 2 , Genome-Wide Association Study , Chromatin/genetics , Diabetes Mellitus, Type 2/genetics , Genetic Predisposition to Disease , Humans , Polymorphism, Single Nucleotide , Quantitative Trait Loci
5.
Genome Res ; 32(3): 425-436, 2022 03.
Article in English | MEDLINE | ID: mdl-35082140

ABSTRACT

The specificity of interactions between genomic regulatory elements and potential target genes is influenced by the binding of insulator proteins such as CTCF, which can act as potent enhancer blockers when interposed between an enhancer and a promoter in a reporter assay. But not all CTCF sites genome-wide function as insulator elements, depending on cellular and genomic context. To dissect the influence of genomic context on enhancer blocker activity, we integrated reporter constructs with promoter-only, promoter and enhancer, and enhancer blocker configurations at hundreds of thousands of genomic sites using the Sleeping Beauty transposase. Deconvolution of reporter activity by genomic position reveals distinct expression patterns subject to genomic context, including a compartment of enhancer blocker reporter integrations with robust expression. The high density of integration sites permits quantitative delineation of characteristic genomic context sensitivity profiles and their decomposition into sensitivity to both local and distant DNase I hypersensitive sites. Furthermore, using a single-cell expression approach to test the effect of integrated reporters for differential expression of nearby endogenous genes reveals that CTCF insulator elements do not completely abrogate reporter effects on endogenous gene expression. Collectively, our results lend new insight into genomic regulatory compartmentalization and its influence on the determinants of promoter-enhancer specificity.


Subject(s)
Enhancer Elements, Genetic , Insulator Elements , CCCTC-Binding Factor/genetics , CCCTC-Binding Factor/metabolism , Genomics , Promoter Regions, Genetic
6.
Nat Commun ; 12(1): 2850, 2021 05 14.
Article in English | MEDLINE | ID: mdl-33990600

ABSTRACT

Functional assessment of disease-associated sequence variation at non-coding regulatory elements is complicated by their high degree of context sensitivity to both the local chromatin and nuclear environments. Allelic profiling of DNA accessibility across individuals has shown that only a select minority of sequence variation affects transcription factor (TF) occupancy, yet low sequence diversity in human populations means that no experimental assessment is available for the majority of disease-associated variants. Here we describe high-resolution in vivo maps of allelic DNA accessibility in liver, kidney, lung and B cells from 5 increasingly diverged strains of F1 hybrid mice. The high density of heterozygous sites in these hybrids enables precise quantification of effect size and cell-type specificity for hundreds of thousands of variants throughout the mouse genome. We show that chromatin-altering variants delineate characteristic sensitivity profiles for hundreds of TF motifs. We develop a compendium of TF-specific sensitivity profiles accounting for genomic context effects. Finally, we link maps of allelic accessibility to allelic transcript levels in the same samples. This work provides a foundation for quantitative prediction of cell-type specific effects of non-coding variation on TF activity, which will facilitate both fine-mapping and systems-level analyses of common disease-associated variation in human genomes.


Subject(s)
DNA/genetics , Alleles , Animals , Binding Sites/genetics , Chromatin/genetics , Chromatin/metabolism , Chromosome Mapping , DNA/metabolism , Female , Gene Expression Regulation , Genetic Variation , Genome, Human , Humans , Hybridization, Genetic , Male , Mice , Mice, 129 Strain , Mice, Inbred C3H , Mice, Inbred C57BL , Organ Specificity/genetics , Penetrance , Regulatory Sequences, Nucleic Acid , Transcription Factors/metabolism
7.
Proc Natl Acad Sci U S A ; 118(10)2021 03 09.
Article in English | MEDLINE | ID: mdl-33649239

ABSTRACT

Routine rewriting of loci associated with human traits and diseases would facilitate their functional analysis. However, existing DNA integration approaches are limited in terms of scalability and portability across genomic loci and cellular contexts. We describe Big-IN, a versatile platform for targeted integration of large DNAs into mammalian cells. CRISPR/Cas9-mediated targeting of a landing pad enables subsequent recombinase-mediated delivery of variant payloads and efficient positive/negative selection for correct clones in mammalian stem cells. We demonstrate integration of constructs up to 143 kb, and an approach for one-step scarless delivery. We developed a staged pipeline combining PCR genotyping and targeted capture sequencing for economical and comprehensive verification of engineered stem cells. Our approach should enable combinatorial interrogation of genomic functional elements and systematic locus-scale analysis of genome function.


Subject(s)
CRISPR-Cas Systems , Gene Editing , Genetic Loci , Genome, Human , Human Embryonic Stem Cells , Mouse Embryonic Stem Cells , Animals , Cell Line , Humans , Mice
8.
Genetics ; 218(1)2021 05 17.
Article in English | MEDLINE | ID: mdl-33742653

ABSTRACT

Design and large-scale synthesis of DNA has been applied to the functional study of viral and microbial genomes. New and expanded technology development is required to unlock the transformative potential of such bottom-up approaches to the study of larger mammalian genomes. Two major challenges include assembling and delivering long DNA sequences. Here, we describe a workflow for de novo DNA assembly and delivery that enables functional evaluation of mammalian genes on the length scale of 100 kilobase pairs (kb). The DNA assembly step is supported by an integrated robotic workcell. We demonstrate assembly of the 101 kb human HPRT1 gene in yeast from 3 kb building blocks, precision delivery of the resulting construct to mouse embryonic stem cells, and subsequent expression of the human protein from its full-length human gene in mouse cells. This workflow provides a framework for mammalian genome writing. We envision utility in producing designer variants of human genes linked to disease and their delivery and functional analysis in cell culture or animal models.


Subject(s)
Cloning, Molecular/methods , Genetic Engineering/methods , Animals , DNA/genetics , Gene Transfer Techniques/veterinary , Genetic Techniques/veterinary , Genome/genetics , Genomics/methods , Humans , Hypoxanthine Phosphoribosyltransferase/genetics , Mice , Sequence Analysis, DNA/methods , Workflow
9.
Genome Res ; 30(12): 1781-1788, 2020 12.
Article in English | MEDLINE | ID: mdl-33093069

ABSTRACT

Effective public response to a pandemic relies upon accurate measurement of the extent and dynamics of an outbreak. Viral genome sequencing has emerged as a powerful approach to link seemingly unrelated cases, and large-scale sequencing surveillance can inform on critical epidemiological parameters. Here, we report the analysis of 864 SARS-CoV-2 sequences from cases in the New York City metropolitan area during the COVID-19 outbreak in spring 2020. The majority of cases had no recent travel history or known exposure, and genetically linked cases were spread throughout the region. Comparison to global viral sequences showed that early transmission was most linked to cases from Europe. Our data are consistent with numerous seeds from multiple sources and a prolonged period of unrecognized community spreading. This work highlights the complementary role of genomic surveillance in addition to traditional epidemiological indicators.


Subject(s)
COVID-19 , Genome, Viral , Pandemics , Phylogeny , SARS-CoV-2/genetics , Whole Genome Sequencing , COVID-19/epidemiology , COVID-19/genetics , COVID-19/transmission , Female , Humans , Male , New York City
10.
medRxiv ; 2020 Aug 19.
Article in English | MEDLINE | ID: mdl-32511587

ABSTRACT

Effective public response to a pandemic relies upon accurate measurement of the extent and dynamics of an outbreak. Viral genome sequencing has emerged as a powerful approach to link seemingly unrelated cases, and large-scale sequencing surveillance can inform on critical epidemiological parameters. Here, we report the analysis of 864 SARS-CoV-2 sequences from cases in the New York City metropolitan area during the COVID-19 outbreak in Spring 2020. The majority of cases had no recent travel history or known exposure, and genetically linked cases were spread throughout the region. Comparison to global viral sequences showed that early transmission was most linked to cases from Europe. Our data are consistent with numerous seeds from multiple sources and a prolonged period of unrecognized community spreading. This work highlights the complementary role of genomic surveillance in addition to traditional epidemiological indicators.

11.
Cell Stem Cell ; 16(3): 275-88, 2015 Mar 05.
Article in English | MEDLINE | ID: mdl-25748933

ABSTRACT

The relationship between chromatin organization and transcriptional regulation is an area of intense investigation. We characterized the spatial relationships between alleles of the Oct4, Sox2, and Nanog genes in single cells during the earliest stages of mouse embryonic stem cell (ESC) differentiation and during embryonic development. We describe homologous pairing of the Oct4 alleles during ESC differentiation and embryogenesis, and we present evidence that pairing is correlated with the kinetics of ESC differentiation. Importantly, we identify critical DNA elements within the Oct4 promoter/enhancer region that mediate pairing of Oct4 alleles. Finally, we show that mutation of OCT4/SOX2 binding sites within this region abolishes inter-chromosomal interactions and affects accumulation of the repressive H3K9me2 modification at the Oct4 enhancer. Our findings demonstrate that chromatin organization and transcriptional programs are intimately connected in ESCs and that the dynamic positioning of the Oct4 alleles is associated with the transition from pluripotency to lineage specification.


Subject(s)
Alleles , Cell Differentiation/physiology , Embryonic Stem Cells/metabolism , Octamer Transcription Factor-3/metabolism , Pluripotent Stem Cells/metabolism , Response Elements/physiology , Animals , Embryonic Stem Cells/cytology , Homeodomain Proteins/genetics , Homeodomain Proteins/metabolism , Mice , Nanog Homeobox Protein , Octamer Transcription Factor-3/genetics , Pluripotent Stem Cells/cytology , SOXB1 Transcription Factors/genetics , SOXB1 Transcription Factors/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...