Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 2.428
Filter
1.
Sci Adv ; 10(21): eadj4452, 2024 May 24.
Article in English | MEDLINE | ID: mdl-38781344

ABSTRACT

Most genetic variants associated with psychiatric disorders are located in noncoding regions of the genome. To investigate their functional implications, we integrate epigenetic data from the PsychENCODE Consortium and other published sources to construct a comprehensive atlas of candidate brain cis-regulatory elements. Using deep learning, we model these elements' sequence syntax and predict how binding sites for lineage-specific transcription factors contribute to cell type-specific gene regulation in various types of glia and neurons. The elements' evolutionary history suggests that new regulatory information in the brain emerges primarily via smaller sequence mutations within conserved mammalian elements rather than entirely new human- or primate-specific sequences. However, primate-specific candidate elements, particularly those active during fetal brain development and in excitatory neurons and astrocytes, are implicated in the heritability of brain-related human traits. Additionally, we introduce PsychSCREEN, a web-based platform offering interactive visualization of PsychENCODE-generated genetic and epigenetic data from diverse brain cell types in individuals with psychiatric disorders and healthy controls.


Subject(s)
Brain , Epigenesis, Genetic , Regulatory Sequences, Nucleic Acid , Humans , Brain/metabolism , Regulatory Sequences, Nucleic Acid/genetics , Animals , Evolution, Molecular , Mental Disorders/genetics , Regulatory Elements, Transcriptional/genetics , Neurons/metabolism , Gene Expression Regulation , Transcription Factors/genetics , Transcription Factors/metabolism
2.
Science ; 384(6698): eadh4265, 2024 May 24.
Article in English | MEDLINE | ID: mdl-38781378

ABSTRACT

Nucleotide variants in cell type-specific gene regulatory elements in the human brain are risk factors for human disease. We measured chromatin accessibility in 1932 aliquots of sorted neurons and non-neurons from 616 human postmortem brains and identified 34,539 open chromatin regions with chromatin accessibility quantitative trait loci (caQTLs). Only 10.4% of caQTLs are shared between neurons and non-neurons, which supports cell type-specific genetic regulation of the brain regulome. Incorporating allele-specific chromatin accessibility improves statistical fine-mapping and refines molecular mechanisms that underlie disease risk. Using massively parallel reporter assays in induced excitatory neurons, we screened 19,893 brain QTLs and identified the functional impact of 476 regulatory variants. Combined, this comprehensive resource captures variation in the human brain regulome and provides insights into disease etiology.


Subject(s)
Brain Diseases , Brain , Chromatin , Gene Expression Regulation , Regulatory Elements, Transcriptional , Humans , Alleles , Brain/metabolism , Brain Diseases/genetics , Chromatin/metabolism , Neurons/metabolism , Quantitative Trait Loci , Male , Female
3.
Science ; 384(6698): eadh0559, 2024 May 24.
Article in English | MEDLINE | ID: mdl-38781390

ABSTRACT

Nucleotide changes in gene regulatory elements are important determinants of neuronal development and diseases. Using massively parallel reporter assays in primary human cells from mid-gestation cortex and cerebral organoids, we interrogated the cis-regulatory activity of 102,767 open chromatin regions, including thousands of sequences with cell type-specific accessibility and variants associated with brain gene regulation. In primary cells, we identified 46,802 active enhancer sequences and 164 variants that alter enhancer activity. Activity was comparable in organoids and primary cells, suggesting that organoids provide an adequate model for the developing cortex. Using deep learning we decoded the sequence basis and upstream regulators of enhancer activity. This work establishes a comprehensive catalog of functional gene regulatory elements and variants in human neuronal development.


Subject(s)
Cerebral Cortex , Neurogenesis , Organoids , Humans , Cerebral Cortex/embryology , Cerebral Cortex/metabolism , Chromatin/metabolism , Chromatin/genetics , Deep Learning , Enhancer Elements, Genetic , Gene Expression Regulation, Developmental , Neurogenesis/genetics , Neurons/metabolism , Organoids/metabolism , Regulatory Sequences, Nucleic Acid , Promoter Regions, Genetic , Regulatory Elements, Transcriptional
4.
Genome Res ; 34(4): 620-632, 2024 May 15.
Article in English | MEDLINE | ID: mdl-38631728

ABSTRACT

Differential gene expression in response to perturbations is mediated at least in part by changes in binding of transcription factors (TFs) and other proteins at specific genomic regions. Association of these cis-regulatory elements (CREs) with their target genes is a challenging task that is essential to address many biological and mechanistic questions. Many current approaches rely on chromatin conformation capture techniques or single-cell correlational methods to establish CRE-to-gene associations. These methods can be effective but have limitations, including resolution, gaps in detectable association distances, and cost. As an alternative, we have developed DegCre, a nonparametric method that evaluates correlations between measurements of perturbation-induced differential gene expression and differential regulatory signal at CREs to score possible CRE-to-gene associations. It has several unique features, including the ability to use any type of CRE activity measurement, yield probabilistic scores for CRE-to-gene pairs, and assess CRE-to-gene pairings across a wide range of sequence distances. We apply DegCre to six data sets, each using different perturbations and containing a variety of regulatory signal measurements, including chromatin openness, histone modifications, and TF occupancy. To test their efficacy, we compare DegCre associations to Hi-C loop calls and CRISPR-validated CRE-to-gene associations, establishing good performance by DegCre that is comparable or superior to competing methods. DegCre is a novel approach to the association of CREs to genes from a perturbation-differential perspective, with strengths that are complementary to existing approaches and allow for new insights into gene regulation.


Subject(s)
Chromatin , Transcription Factors , Humans , Transcription Factors/metabolism , Transcription Factors/genetics , Chromatin/metabolism , Chromatin/genetics , Gene Expression Regulation , Regulatory Sequences, Nucleic Acid , Regulatory Elements, Transcriptional
5.
Pac Symp Biocomput ; 29: 564-578, 2024.
Article in English | MEDLINE | ID: mdl-38160307

ABSTRACT

The problem of microdissection of heterogeneous tissue samples is of great interest for both fundamental biology and biomedical research. Until now, microdissection in the form of supervised deconvolution of mixed sequencing samples has been limited to assays measuring gene expression (RNA-seq) or chromatin accessibility (ATAC-seq). We present here the first attempt at solving the supervised deconvolution problem for run-on nascent sequencing data (GRO-seq and PRO-seq), a readout of active transcription. Then, we develop a novel filtering method suited to the mixed set of promoter and enhancer regions provided by nascent sequencing, and apply best-practice standards from the RNA-seq literature, using in-silico mixtures of cells. Using these methods, we find that enhancer RNAs are highly informative features for supervised deconvolution. In most cases, simple deconvolution methods perform better than more complex ones for solving the nascent deconvolution problem. Furthermore, undifferentiated cell types confound deconvolution of nascent sequencing data, likely as a consequence of transcriptional activity over the highly open chromatin regions of undifferentiated cell types. Our results suggest that while the problem of nascent deconvolution is generally tractable, stronger approaches integrating other sequencing protocols may be required to solve mixtures containing undifferentiated celltypes.


Subject(s)
Computational Biology , Regulatory Sequences, Nucleic Acid , Humans , Computational Biology/methods , Regulatory Elements, Transcriptional , Promoter Regions, Genetic , Chromatin/genetics , High-Throughput Nucleotide Sequencing
6.
Sci Rep ; 13(1): 19085, 2023 11 04.
Article in English | MEDLINE | ID: mdl-37925545

ABSTRACT

Enhancer RNAs (eRNAs) are non-coding RNAs produced by transcriptional enhancers that are highly correlated with their activity. Using a capped nascent RNA sequencing (PRO-cap) dataset in human lymphoblastoid cell lines across 67 individuals, we identified inter-individual variation in the expression of over 80 thousand transcribed transcriptional regulatory elements (tTREs), in both enhancers and promoters. Co-expression analysis of eRNAs from tTREs across individuals revealed how enhancers are associated with each other and with promoters. Mid- to long-range co-expression showed a distance-dependent decay that was modified by TF occupancy. In particular, we found a class of "bivalent" TFs, including Cohesin, that both facilitate and isolate the interaction between enhancers and/or promoters, depending on their topology. At short distances, we observed strand-specific correlations between nearby eRNAs in both convergent and divergent orientations. Our results support a cooperative model of convergent eRNAs, consistent with eRNAs facilitating adjacent enhancers rather than interfering with each other. Therefore, our approach to infer functional interactions from co-expression analyses provided novel insights into the principles of enhancer interactions as a function of distance, orientation, and binding landscapes of TFs.


Subject(s)
Enhancer Elements, Genetic , Transcription, Genetic , Humans , RNA/genetics , Regulatory Elements, Transcriptional , Promoter Regions, Genetic
7.
Genetics ; 225(2)2023 10 04.
Article in English | MEDLINE | ID: mdl-37551428

ABSTRACT

Transcriptional regulatory elements (TREs) are the primary nodes that control developmental gene regulatory networks. In embryo stages, larvae, and adult differentiated red spherule cells of the sea urchin Strongylocentrotus purpuratus, transcriptionally engaged TREs are detected by Precision Run-On Sequencing (PRO-seq), which maps genome-wide at base pair resolution the location of paused or elongating RNA polymerase II (Pol II). In parallel, TRE accessibility is estimated by the Assay for Transposase-Accessible Chromatin using Sequencing (ATAC-seq). Our analysis identifies surprisingly early and widespread TRE accessibility in 4-cell cleavage embryos that is not necessarily followed by concurrent or subsequent transcription. TRE transcriptional differences identified by PRO-seq provide more contrast among embryonic stages than ATAC-seq accessibility differences, in agreement with the apparent excess of accessible but inactive TREs during embryogenesis. Global TRE accessibility reaches a maximum around the 20-hour late blastula stage, which coincides with the consolidation of major embryo regionalizations and peak histone variant H2A.Z expression. A transcriptional potency model based on labile nucleosome TRE occupancy driven by DNA sequences and the prevalence of histone variants is proposed in order to explain the basal accessibility of transcriptionally inactive TREs during embryogenesis. However, our results would not reconcile well with labile nucleosome models based on simple A/T sequence enrichment. In addition, a large number of distal TREs become transcriptionally disengaged during developmental progression, in support of an early Pol II paused model for developmental gene regulation that eventually resolves in transcriptional activation or silencing. Thus, developmental potency in early embryos may be facilitated by incipient accessibility and transcriptional pause at TREs.


Subject(s)
Histones , Strongylocentrotus purpuratus , Animals , Histones/genetics , Strongylocentrotus purpuratus/genetics , Strongylocentrotus purpuratus/metabolism , Nucleosomes , RNA Polymerase II/genetics , RNA Polymerase II/metabolism , Chromatin/genetics , Sea Urchins/genetics , Sea Urchins/metabolism , Regulatory Elements, Transcriptional
8.
Science ; 380(6643): eabn7930, 2023 04 28.
Article in English | MEDLINE | ID: mdl-37104580

ABSTRACT

Understanding the regulatory landscape of the human genome is a long-standing objective of modern biology. Using the reference-free alignment across 241 mammalian genomes produced by the Zoonomia Consortium, we charted evolutionary trajectories for 0.92 million human candidate cis-regulatory elements (cCREs) and 15.6 million human transcription factor binding sites (TFBSs). We identified 439,461 cCREs and 2,024,062 TFBSs under evolutionary constraint. Genes near constrained elements perform fundamental cellular processes, whereas genes near primate-specific elements are involved in environmental interaction, including odor perception and immune response. About 20% of TFBSs are transposable element-derived and exhibit intricate patterns of gains and losses during primate evolution whereas sequence variants associated with complex traits are enriched in constrained TFBSs. Our annotations illuminate the regulatory functions of the human genome.


Subject(s)
Evolution, Molecular , Genome, Human , Mammals , Regulatory Elements, Transcriptional , Transcription Factors , Animals , Humans , Binding Sites , DNA Transposable Elements , Mammals/classification , Mammals/genetics , Primates/classification , Primates/genetics , Transcription Factors/genetics , Transcription Factors/metabolism , Phylogeny
9.
PLoS Genet ; 19(3): e1010468, 2023 03.
Article in English | MEDLINE | ID: mdl-36862752

ABSTRACT

The genomic sequence of the horse has been available since 2009, providing critical resources for discovering important genomic variants regarding both animal health and population structures. However, to fully understand the functional implications of these variants, detailed annotation of the horse genome is required. Due to the limited availability of functional data for the equine genome, as well as the technical limitations of short-read RNA-seq, existing annotation of the equine genome contains limited information about important aspects of gene regulation, such as alternate isoforms and regulatory elements, which are either not transcribed or transcribed at a very low level. To solve above problems, the Functional Annotation of the Animal Genomes (FAANG) project proposed a systemic approach to tissue collection, phenotyping, and data generation, adopting the blueprint laid out by the Encyclopedia of DNA Elements (ENCODE) project. Here we detail the first comprehensive overview of gene expression and regulation in the horse, presenting 39,625 novel transcripts, 84,613 candidate cis-regulatory elements (CRE) and their target genes, 332,115 open chromatin regions genome wide across a diverse set of tissues. We showed substantial concordance between chromatin accessibility, chromatin states in different genic features and gene expression. This comprehensive and expanded set of genomics resources will provide the equine research community ample opportunities for studies of complex traits in the horse.


Subject(s)
Genome , Horses , Transcriptome , Horses/genetics , Animals , Molecular Sequence Annotation , Organ Specificity , Chromatin , Regulatory Elements, Transcriptional , Transcription Initiation Site , Sequence Analysis, RNA , Gene Expression Regulation
10.
G3 (Bethesda) ; 13(4)2023 04 11.
Article in English | MEDLINE | ID: mdl-36753570

ABSTRACT

Transposable elements (TEs) are hypothesized to play important roles in shaping genome evolution following whole-genome duplications (WGDs), including rewiring of gene regulation. In a recent analysis, duplicate gene copies that had evolved higher expression in liver following the salmonid WGD ∼100 million years ago were associated with higher numbers of predicted TE-derived cis-regulatory elements (TE-CREs). Yet, the ability of these TE-CREs to recruit transcription factors (TFs) in vivo and impact gene expression remains unknown. Here, we evaluated the gene-regulatory functions of 11 TEs using luciferase promoter reporter assays in Atlantic salmon (Salmo salar) primary liver cells. Canonical Tc1-Mariner elements from intronic regions showed no or small repressive effects on transcription. However, other TE-CREs upstream of transcriptional start sites increased expression significantly. Our results question the hypothesis that TEs in the Tc1-Mariner superfamily, which were extremely active following WGD in salmonids, had a major impact on regulatory rewiring of gene duplicates, but highlights the potential of other TEs in post-WGD rewiring of gene regulation in the Atlantic salmon genome.


Subject(s)
Salmon , Animals , Salmon/genetics , Regulatory Elements, Transcriptional , Gene Expression Regulation , DNA Transposable Elements , Transcription, Genetic , Promoter Regions, Genetic
11.
J Biol Chem ; 299(4): 103060, 2023 04.
Article in English | MEDLINE | ID: mdl-36841482

ABSTRACT

The mitogen-activated protein kinase kinase kinase 18 (MAPKKK18) has been reported to play a role in abiotic stress priming in long-term abscisic acid (ABA) response including drought tolerance and leaf senescence. However, the upstream transcriptional regulators of MAPKKK18 remain to be determined. Here, we report ABA-responsive element binding factors (ABFs) as upstream transcription factors of MAPKKK18 expression. Mutants of abf2, abf3, abf4, and abf2abf3abf4 dramatically reduced the transcription of MAPKKK18. Our electrophoresis mobility shift assay and dual-luciferase reporter assay demonstrated that ABF2, ABF3, and ABF4 bound to ABA-responsive element cis-elements within the promoter of MAPKKK18 to transactivate its expression. Furthermore, enrichments of the promoter region of MAPKKK18 by ABF2, ABF3, and ABF4 were confirmed by in vivo chromatin immunoprecipitation coupled with quantitative PCR. In addition, we found that mutants of mapkkk18 exhibited obvious delayed leaf senescence. Moreover, a genetic study showed that overexpression of ABF2, ABF3, and ABF4 in the background of mapkkk18 mostly phenocopied the stay-green phenotype of mapkkk18 and, expression levels of five target genes of ABFs, that is, NYE1, NYE2, NYC1, PAO, and SAG29, were attenuated as a result of MAPKKK18 mutation. These findings demonstrate that ABF2, ABF3, and ABF4 act as transcription regulators of MAPKKK18 and also suggest that, at least in part, ABA acts in priming leaf senescence via ABF-induced expression of MAPKKK18.


Subject(s)
Abscisic Acid , Arabidopsis Proteins , Arabidopsis , Gene Expression Regulation, Plant , Plant Leaves , Plant Senescence , Regulatory Elements, Transcriptional , Abscisic Acid/metabolism , Abscisic Acid/pharmacology , Arabidopsis/metabolism , Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Basic-Leucine Zipper Transcription Factors/metabolism , MAP Kinase Kinase Kinases/metabolism , Plant Senescence/genetics , Plant Senescence/physiology , Plants, Genetically Modified/metabolism , Transcription Factors/metabolism , Plant Leaves/genetics , Plant Leaves/physiology
12.
Genome Res ; 33(3): 314-331, 2023 03.
Article in English | MEDLINE | ID: mdl-36810156

ABSTRACT

Adipocytes contribute to metabolic disorders such as obesity, diabetes, and atherosclerosis. Prior characterizations of the transcriptional network driving adipogenesis have overlooked transiently acting transcription factors (TFs), genes, and regulatory elements that are essential for proper differentiation. Moreover, traditional gene regulatory networks provide neither mechanistic details about individual regulatory element-gene relationships nor temporal information needed to define a regulatory hierarchy that prioritizes key regulatory factors. To address these shortcomings, we integrate kinetic chromatin accessibility (ATAC-seq) and nascent transcription (PRO-seq) data to generate temporally resolved networks that describe TF binding events and resultant effects on target gene expression. Our data indicate which TF families cooperate with and antagonize each other to regulate adipogenesis. Compartment modeling of RNA polymerase density quantifies how individual TFs mechanistically contribute to distinct steps in transcription. The glucocorticoid receptor activates transcription by inducing RNA polymerase pause release, whereas SP and AP-1 factors affect RNA polymerase initiation. We identify Twist2 as a previously unappreciated effector of adipocyte differentiation. We find that TWIST2 acts as a negative regulator of 3T3-L1 and primary preadipocyte differentiation. We confirm that Twist2 knockout mice have compromised lipid storage within subcutaneous and brown adipose tissue. Previous phenotyping of Twist2 knockout mice and Setleis syndrome Twist2 -/- patients noted deficiencies in subcutaneous adipose tissue. This network inference framework is a powerful and general approach for interpreting complex biological phenomena and can be applied to a wide range of cellular processes.


Subject(s)
Adipocytes , Gene Regulatory Networks , Twist-Related Protein 1 , Animals , Mice , Cell Line , Adipocytes/cytology , Adipocytes/metabolism , Transcription Factors/metabolism , Adipogenesis , Transcription, Genetic , Regulatory Elements, Transcriptional , Twist-Related Protein 1/metabolism
13.
Cancer Discov ; 13(3): OF1, 2023 03 01.
Article in English | MEDLINE | ID: mdl-36607220

ABSTRACT

Recurrent repeat expansions were identified near known regulatory elements across seven cancer types.


Subject(s)
DNA Repeat Expansion , Neoplasms , Humans , Neoplasms/genetics , Regulatory Elements, Transcriptional
14.
Nature ; 613(7942): 96-102, 2023 01.
Article in English | MEDLINE | ID: mdl-36517591

ABSTRACT

Expansion of a single repetitive DNA sequence, termed a tandem repeat (TR), is known to cause more than 50 diseases1,2. However, repeat expansions are often not explored beyond neurological and neurodegenerative disorders. In some cancers, mutations accumulate in short tracts of TRs, a phenomenon termed microsatellite instability; however, larger repeat expansions have not been systematically analysed in cancer3-8. Here we identified TR expansions in 2,622 cancer genomes spanning 29 cancer types. In seven cancer types, we found 160 recurrent repeat expansions (rREs), most of which (155/160) were subtype specific. We found that rREs were non-uniformly distributed in the genome with enrichment near candidate cis-regulatory elements, suggesting a potential role in gene regulation. One rRE, a GAAA-repeat expansion, located near a regulatory element in the first intron of UGT2B7 was detected in 34% of renal cell carcinoma samples and was validated by long-read DNA sequencing. Moreover, in preliminary experiments, treating cells that harbour this rRE with a GAAA-targeting molecule led to a dose-dependent decrease in cell proliferation. Overall, our results suggest that rREs may be an important but unexplored source of genetic variation in human cancer, and we provide a comprehensive catalogue for further study.


Subject(s)
DNA Repeat Expansion , Genome, Human , Neoplasms , Humans , Base Sequence , DNA Repeat Expansion/genetics , Genome, Human/genetics , Neoplasms/classification , Neoplasms/genetics , Neoplasms/pathology , Sequence Analysis, DNA , Gene Expression Regulation , Regulatory Elements, Transcriptional/genetics , Introns/genetics , Carcinoma, Renal Cell/genetics , Carcinoma, Renal Cell/pathology , Cell Proliferation/drug effects , Reproducibility of Results
15.
Methods Mol Biol ; 2599: 215-226, 2023.
Article in English | MEDLINE | ID: mdl-36427152

ABSTRACT

Run-on and sequencing assays like GRO-seq, PRO-seq, and ChRO-seq allow for joint profiling of transcription activity of transcriptional regulatory elements (TREs), i.e., promoters and active enhancers, and target genes. Variation in biological conditions, such as treated vs. control, results in changes in the activity of transcription factors (TFs), which induces concerted changes in TREs and target genes. By modeling the differences between two biological conditions, we developed the computational pipeline known as tfTarget that predicts a set of putative TREs and target genes responding to each TF under the biological condition of interest. In this chapter, we demonstrate the use of the new web-based tfTarget in mapping transcription regulation using run-on sequencing data.


Subject(s)
Gene Expression Regulation , Regulatory Elements, Transcriptional , Transcription Factors/genetics , Promoter Regions, Genetic , Internet
16.
Cells ; 13(1)2023 12 19.
Article in English | MEDLINE | ID: mdl-38201209

ABSTRACT

Genetic predisposition to cardiac arrhythmias has been a field of intense investigation. Research initially focused on rare hereditary arrhythmias, but over the last two decades, the role of genetic variation (single nucleotide polymorphisms) in heart rate, rhythm, and arrhythmias has been taken into consideration as well. In particular, genome-wide association studies have identified hundreds of genomic loci associated with quantitative electrocardiographic traits, atrial fibrillation, and less common arrhythmias such as Brugada syndrome. A significant number of associated variants have been found to systematically localize in non-coding regulatory elements that control the tissue-specific and temporal transcription of genes encoding transcription factors, ion channels, and other proteins. However, the identification of causal variants and the mechanism underlying their impact on phenotype has proven difficult due to the complex tissue-specific, time-resolved, condition-dependent, and combinatorial function of regulatory elements, as well as their modest conservation across different model species. In this review, we discuss research efforts aimed at identifying and characterizing-trait-associated variant regulatory elements and the molecular mechanisms underlying their impact on heart rate or rhythm.


Subject(s)
Atrial Fibrillation , Brugada Syndrome , Humans , Genome-Wide Association Study , Regulatory Elements, Transcriptional , Atrial Fibrillation/genetics , Polymorphism, Single Nucleotide
17.
Sci Rep ; 12(1): 22410, 2022 12 27.
Article in English | MEDLINE | ID: mdl-36575359

ABSTRACT

Reprogramming Müller glia (MG) into functional cells is considered a promising therapeutic strategy to treat ocular diseases and vision loss. However, current AAV-based system for MG-tracing was reported to have high leakage in recent studies. Here, we focused on reducing the leakage of AAV-based labeling systems and found that different AAV serotypes showed a range of efficiency and specificity in labeling MG, leading us to optimize a human GFAP-Cre reporter system packaged in the AAV9 serotype with the woodchuck hepatitis virus post-transcriptional regulatory element (WPRE) removed. The leakage ratio of the AAV9-hGFAP-Cre-ΔWPRE decreased by an approximate 40-fold compared with the AAV9-hGFAP-Cre-WPRE labeling system. In addition, we validated the specificity of the AAV-ΔWPRE system for tracing MG reprogramming under Ptbp1-suppression and observed strict non-MG-conversion, similar to previous studies using genetic lineage tracking mouse models. Thus, the AAV9-hGFAP-Cre-ΔWPRE system showed high efficiency and specificity for MG labeling, providing a promising tool for tracing cell fate in vivo.


Subject(s)
Genetic Vectors , Neuroglia , Mice , Animals , Humans , Genetic Vectors/genetics , Regulatory Elements, Transcriptional , Serogroup , Dependovirus/genetics , Heterogeneous-Nuclear Ribonucleoproteins/genetics , Polypyrimidine Tract-Binding Protein
18.
BMC Genomics ; 23(1): 714, 2022 Oct 19.
Article in English | MEDLINE | ID: mdl-36261804

ABSTRACT

BACKGROUND: Mouse is probably the most important model organism to study mammal biology and human diseases. A better understanding of the mouse genome will help understand the human genome, biology and diseases. However, despite the recent progress, the characterization of the regulatory sequences in the mouse genome is still far from complete, limiting its use to understand the regulatory sequences in the human genome. RESULTS: Here, by integrating binding peaks in ~ 9,000 transcription factor (TF) ChIP-seq datasets that cover 79.9% of the mouse mappable genome using an efficient pipeline, we were able to partition these binding peak-covered genome regions into a cis-regulatory module (CRM) candidate (CRMC) set and a non-CRMC set. The CRMCs contain 912,197 putative CRMs and 38,554,729 TF binding sites (TFBSs) islands, covering 55.5% and 24.4% of the mappable genome, respectively. The CRMCs tend to be under strong evolutionary constraints, indicating that they are likely cis-regulatory; while the non-CRMCs are largely selectively neutral, indicating that they are unlikely cis-regulatory. Based on evolutionary profiles of the genome positions, we further estimated that 63.8% and 27.4% of the mouse genome might code for CRMs and TFBSs, respectively. CONCLUSIONS: Validation using experimental data suggests that at least most of the CRMCs are authentic. Thus, this unprecedentedly comprehensive map of CRMs and TFBSs can be a good resource to guide experimental studies of regulatory genomes in mice and humans.


Subject(s)
Genome, Human , Regulatory Elements, Transcriptional , Humans , Mice , Animals , Regulatory Elements, Transcriptional/genetics , Binding Sites/genetics , Protein Binding , Transcription Factors/genetics , Transcription Factors/metabolism , Mammals/genetics
19.
Database (Oxford) ; 20222022 04 22.
Article in English | MEDLINE | ID: mdl-35452518

ABSTRACT

More accurate and more complete predictions of cis-regulatory modules (CRMs) and constituent transcription factor (TF) binding sites (TFBSs) in genomes can facilitate characterizing functions of regulatory sequences. Here, we developed a database predicted cis-regulatory modules (PCRMS) (https://cci-bioinfo.uncc.edu) that stores highly accurate and unprecedentedly complete maps of predicted CRMs and TFBSs in the human and mouse genomes. The web interface allows the user to browse CRMs and TFBSs in an organism, find the closest CRMs to a gene, search CRMs around a gene and find all TFBSs of a TF. PCRMS can be a useful resource for the research community to characterize regulatory genomes. Database URL: https://cci-bioinfo.uncc.edu/.


Subject(s)
Regulatory Elements, Transcriptional , Transcription Factors , Animals , Binding Sites , Genome/genetics , Mice , Protein Binding , Regulatory Elements, Transcriptional/genetics , Transcription Factors/genetics , Transcription Factors/metabolism
20.
Development ; 149(7)2022 04 01.
Article in English | MEDLINE | ID: mdl-35394007

ABSTRACT

A long-standing biological question is how DNA cis-regulatory elements shape transcriptional patterns during metazoan development. Reporter constructs, cell culture assays and computational modeling have made major contributions to answering this question, but analysis of elements in their natural context is an important complement. Here, we mutate Notch-dependent LAG-1 binding sites (LBSs) in the endogenous Caenorhabditis elegans sygl-1 gene, which encodes a key stem cell regulator, and analyze the consequences on sygl-1 expression (nascent transcripts, mRNA, protein) and stem cell maintenance. Mutation of one LBS in a three-element cluster approximately halved both expression and stem cell pool size, whereas mutation of two LBSs essentially abolished them. Heterozygous LBS mutant clusters provided intermediate values. Our results lead to two major conclusions. First, both LBS number and configuration impact cluster activity: LBSs act additively in trans and synergistically in cis. Second, the SYGL-1 gradient promotes self-renewal above its functional threshold and triggers differentiation below the threshold. Our approach of coupling CRISPR/Cas9 LBS mutations with effects on both molecular and biological readouts establishes a powerful model for in vivo analyses of DNA cis-regulatory elements.


Subject(s)
Caenorhabditis elegans , Regulatory Elements, Transcriptional , Stem Cells , Animals , Caenorhabditis elegans/cytology , Caenorhabditis elegans/metabolism , Caenorhabditis elegans Proteins/genetics , Cell Self Renewal , DNA/metabolism , DNA-Binding Proteins/genetics , Receptors, Notch , Stem Cells/cytology
SELECTION OF CITATIONS
SEARCH DETAIL
...