Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
2.
Nucleic Acids Res ; 52(D1): D154-D163, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37971293

RESUMO

We present a major update of the HOCOMOCO collection that provides DNA binding specificity patterns of 949 human transcription factors and 720 mouse orthologs. To make this release, we performed motif discovery in peak sets that originated from 14 183 ChIP-Seq experiments and reads from 2554 HT-SELEX experiments yielding more than 400 thousand candidate motifs. The candidate motifs were annotated according to their similarity to known motifs and the hierarchy of DNA-binding domains of the respective transcription factors. Next, the motifs underwent human expert curation to stratify distinct motif subtypes and remove non-informative patterns and common artifacts. Finally, the curated subset of 100 thousand motifs was supplied to the automated benchmarking to select the best-performing motifs for each transcription factor. The resulting HOCOMOCO v12 core collection contains 1443 verified position weight matrices, including distinct subtypes of DNA binding motifs for particular transcription factors. In addition to the core collection, HOCOMOCO v12 provides motif sets optimized for the recognition of binding sites in vivo and in vitro, and for annotation of regulatory sequence variants. HOCOMOCO is available at https://hocomoco12.autosome.org and https://hocomoco.autosome.org.


Assuntos
Bases de Dados Genéticas , Regulação da Expressão Gênica , Domínios e Motivos de Interação entre Proteínas , Fatores de Transcrição , Animais , Humanos , Camundongos , Sítios de Ligação/genética , Motivos de Nucleotídeos , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Internet , Domínios e Motivos de Interação entre Proteínas/genética
3.
Sci Rep ; 13(1): 5238, 2023 03 31.
Artigo em Inglês | MEDLINE | ID: mdl-37002329

RESUMO

Thousands of RNA-binding proteins (RBPs) crosslink to cellular mRNA. Among these are numerous unconventional RBPs (ucRBPs)-proteins that associate with RNA but lack known RNA-binding domains (RBDs). The vast majority of ucRBPs have uncharacterized RNA-binding specificities. We analyzed 492 human ucRBPs for intrinsic RNA-binding in vitro and identified 23 that bind specific RNA sequences. Most (17/23), including 8 ribosomal proteins, were previously associated with RNA-related function. We identified the RBDs responsible for sequence-specific RNA-binding for several of these 23 ucRBPs and surveyed whether corresponding domains from homologous proteins also display RNA sequence specificity. CCHC-zf domains from seven human proteins recognized specific RNA motifs, indicating that this is a major class of RBD. For Nudix, HABP4, TPR, RanBP2-zf, and L7Ae domains, however, only isolated members or closely related homologs yielded motifs, consistent with RNA-binding as a derived function. The lack of sequence specificity for most ucRBPs is striking, and we suggest that many may function analogously to chromatin factors, which often crosslink efficiently to cellular DNA, presumably via indirect recruitment. Finally, we show that ucRBPs tend to be highly abundant proteins and suggest their identification in RNA interactome capture studies could also result from weak nonspecific interactions with RNA.


Assuntos
Proteínas de Ligação a RNA , RNA , Humanos , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , RNA/metabolismo , Proteínas Ribossômicas/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Motivos de Ligação ao RNA/genética , Ligação Proteica , Fatores de Regulação Miogênica/metabolismo
4.
Nucleic Acids Res ; 50(19): e111, 2022 10 28.
Artigo em Inglês | MEDLINE | ID: mdl-36018788

RESUMO

Modelling both primary sequence and secondary structure preferences for RNA binding proteins (RBPs) remains an ongoing challenge. Current models use varied RNA structure representations and can be difficult to interpret and evaluate. To address these issues, we present a universal RNA motif-finding/scanning strategy, termed PRIESSTESS (Predictive RBP-RNA InterpretablE Sequence-Structure moTif regrESSion), that can be applied to diverse RNA binding datasets. PRIESSTESS identifies dozens of enriched RNA sequence and/or structure motifs that are subsequently reduced to a set of core motifs by logistic regression with LASSO regularization. Importantly, these core motifs are easily visualized and interpreted, and provide a measure of RBP secondary structure specificity. We used PRIESSTESS to interrogate new HTR-SELEX data for 23 RBPs with diverse RNA binding modes and captured known primary sequence and secondary structure preferences for each. Moreover, when applying PRIESSTESS to 144 RBPs across 202 RNA binding datasets, 75% showed an RNA secondary structure preference but only 10% had a preference besides unpaired bases, suggesting that most RBPs simply recognize the accessibility of primary sequences.


Assuntos
Algoritmos , Proteínas de Ligação a RNA , Sítios de Ligação , Proteínas de Ligação a RNA/metabolismo , Motivos de Nucleotídeos , RNA/química , Ligação Proteica
5.
Genes Dev ; 36(3-4): 225-240, 2022 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-35144965

RESUMO

The BEN domain is a recently recognized DNA binding module that is present in diverse metazoans and certain viruses. Several BEN domain factors are known as transcriptional repressors, but, overall, relatively little is known of how BEN factors identify their targets in humans. In particular, X-ray structures of BEN domain:DNA complexes are only known for Drosophila factors bearing a single BEN domain, which lack direct vertebrate orthologs. Here, we characterize several mammalian BEN domain (BD) factors, including from two NACC family BTB-BEN proteins and from BEND3, which has four BDs. In vitro selection data revealed sequence-specific binding activities of isolated BEN domains from all of these factors. We conducted detailed functional, genomic, and structural studies of BEND3. We show that BD4 is a major determinant for in vivo association and repression of endogenous BEND3 targets. We obtained a high-resolution structure of BEND3-BD4 bound to its preferred binding site, which reveals how BEND3 identifies cognate DNA targets and shows differences with one of its non-DNA-binding BEN domains (BD1). Finally, comparison with our previous invertebrate BEN structures, along with additional structural predictions using AlphaFold2 and RoseTTAFold, reveal distinct strategies for target DNA recognition by different types of BEN domain proteins. Together, these studies expand the DNA recognition activities of BEN factors and provide structural insights into sequence-specific DNA binding by mammalian BEN proteins.


Assuntos
Proteínas Repressoras , Fatores de Transcrição , Animais , Sítios de Ligação , Drosophila/metabolismo , Mamíferos , Ligação Proteica , Domínios Proteicos , Proteínas Repressoras/genética , Fatores de Transcrição/metabolismo
6.
Biochim Biophys Acta Gene Regul Mech ; 1864(11-12): 194765, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34673265

RESUMO

To control gene transcription, DNA-binding transcription factors recognise specific sequence motifs in gene regulatory regions. A complete and reliable GO annotation of all DNA-binding transcription factors is key to investigating the delicate balance of gene regulation in response to environmental and developmental stimuli. The need for such information is demonstrated by the many lists of transcription factors that have been produced over the past decade. The COST Action Gene Regulation Ensemble Effort for the Knowledge Commons (GREEKC) Consortium brought together experts in the field of transcription with the aim of providing high quality and interoperable gene regulatory data. The Gene Ontology (GO) Consortium provides strict definitions for gene product function, including factors that regulate transcription. The collaboration between the GREEKC and GO Consortia has enabled the application of those definitions to produce a new curated catalogue of over 1400 human DNA-binding transcription factors, that can be accessed at https://www.ebi.ac.uk/QuickGO/targetset/dbTF. This catalogue has facilitated an improvement in the GO annotation of human DNA-binding transcription factors and led to the GO annotation of almost sixty thousand DNA-binding transcription factors in over a hundred species. Thus, this work will aid researchers investigating the regulation of transcription in both biomedical and basic science.


Assuntos
DNA/metabolismo , Ontologia Genética , Anotação de Sequência Molecular , Fatores de Transcrição/classificação , Bases de Dados Genéticas , Humanos , Fatores de Transcrição/metabolismo
7.
Genome Res ; 30(7): 962-973, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32703884

RESUMO

RNA-binding proteins (RBPs) regulate RNA metabolism at multiple levels by affecting splicing of nascent transcripts, RNA folding, base modification, transport, localization, translation, and stability. Despite their central role in RNA function, the RNA-binding specificities of most RBPs remain unknown or incompletely defined. To address this, we have assembled a genome-scale collection of RBPs and their RNA-binding domains (RBDs) and assessed their specificities using high-throughput RNA-SELEX (HTR-SELEX). Approximately 70% of RBPs for which we obtained a motif bound to short linear sequences, whereas ∼30% preferred structured motifs folding into stem-loops. We also found that many RBPs can bind to multiple distinctly different motifs. Analysis of the matches of the motifs in human genomic sequences suggested novel roles for many RBPs. We found that three cytoplasmic proteins-ZC3H12A, ZC3H12B, and ZC3H12C-bound to motifs resembling the splice donor sequence, suggesting that these proteins are involved in degradation of cytoplasmic viral and/or unspliced transcripts. Structural analysis revealed that the RNA motif was not bound by the conventional C3H1 RNA-binding domain of ZC3H12B. Instead, the RNA motif was bound by the ZC3H12B's PilT N terminus (PIN) RNase domain, revealing a potential mechanism by which unconventional RBDs containing active sites or molecule-binding pockets could interact with short, structured RNA molecules. Our collection containing 145 high-resolution binding specificity models for 86 RBPs is the largest systematic resource for the analysis of human RBPs and will greatly facilitate future analysis of the various biological roles of this important class of proteins.


Assuntos
Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/metabolismo , RNA/química , RNA/metabolismo , Sequência de Bases , Genoma Humano , Humanos , Conformação de Ácido Nucleico , Motivos de Nucleotídeos , Ligação Proteica , Domínios Proteicos , Multimerização Proteica , Ribonucleases/química , Ribonucleases/metabolismo , Técnica de Seleção de Aptâmeros
8.
Biochem J ; 477(7): 1345-1362, 2020 04 17.
Artigo em Inglês | MEDLINE | ID: mdl-32207815

RESUMO

We report the identification and characterization of a bacteriophage λ-encoded protein, NinH. Sequence homology suggests similarity between NinH and Fis, a bacterial nucleoid-associated protein (NAP) involved in numerous DNA topology manipulations, including chromosome condensation, transcriptional regulation and phage site-specific recombination. We find that NinH functions as a homodimer and is able to bind and bend double-stranded DNA in vitro. Furthermore, NinH shows a preference for a 15 bp signature sequence related to the degenerate consensus favored by Fis. Structural studies reinforced the proposed similarity to Fis and supported the identification of residues involved in DNA binding which were demonstrated experimentally. Overexpression of NinH proved toxic and this correlated with its capacity to associate with DNA. NinH is the first example of a phage-encoded Fis-like NAP that likely influences phage excision-integration reactions or bacterial gene expression.


Assuntos
Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Bacteriófago lambda/genética , Bacteriófago lambda/metabolismo , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Proteínas Virais/genética , Proteínas Virais/metabolismo , Proteínas de Bactérias/química , Sequência de Bases , Sítios de Ligação , Simulação por Computador , DNA/metabolismo , DNA Viral/metabolismo , Proteínas de Ligação a DNA/química , Escherichia coli/genética , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Fator Proteico para Inversão de Estimulação/química , Fator Proteico para Inversão de Estimulação/genética , Expressão Gênica , Proteínas Mutantes/metabolismo , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Multimerização Proteica/genética , Proteínas Virais/química
10.
Nat Biotechnol ; 36(6): 521-529, 2018 07.
Artigo em Inglês | MEDLINE | ID: mdl-29786094

RESUMO

No existing method to characterize transcription factor (TF) binding to DNA allows genome-wide measurement of all TF-binding activity in cells. Here we present a massively parallel protein activity assay, active TF identification (ATI), that measures the DNA-binding activity of all TFs in cell or tissue extracts. ATI is based on electrophoretic separation of protein-bound DNA sequences from a highly complex DNA library and subsequent mass-spectrometric identification of the DNA-bound proteins. We applied ATI to four mouse tissues and mouse embryonic stem cells and found that, in a given tissue or cell type, a small set of TFs, which bound to only ∼10 distinct motifs, displayed strong DNA-binding activity. Some of these TFs were found in all cell types, whereas others were specific TFs known to determine cell fate in the analyzed tissue or cell type. We also show that a small number of TFs determined the accessible chromatin landscape of a cell, suggesting that gene regulatory logic may be simpler than previously appreciated.


Assuntos
Cromatina/metabolismo , Fatores de Transcrição/metabolismo , Animais , Sequência de Bases , Sítios de Ligação/genética , Biotecnologia , Diferenciação Celular , Cromatina/genética , DNA/genética , DNA/metabolismo , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Camundongos , Células-Tronco Embrionárias Murinas/citologia , Células-Tronco Embrionárias Murinas/metabolismo , Ligação Proteica , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Schizosaccharomyces/genética , Schizosaccharomyces/metabolismo , Especificidade da Espécie , Distribuição Tecidual
11.
Elife ; 72018 04 11.
Artigo em Inglês | MEDLINE | ID: mdl-29638214

RESUMO

Most transcription factors (TFs) can bind to a population of sequences closely related to a single optimal site. However, some TFs can bind to two distinct sequences that represent two local optima in the Gibbs free energy of binding (ΔG). To determine the molecular mechanism behind this effect, we solved the structures of human HOXB13 and CDX2 bound to their two optimal DNA sequences, CAATAAA and TCGTAAA. Thermodynamic analyses by isothermal titration calorimetry revealed that both sites were bound with similar ΔG. However, the interaction with the CAA sequence was driven by change in enthalpy (ΔH), whereas the TCG site was bound with similar affinity due to smaller loss of entropy (ΔS). This thermodynamic mechanism that leads to at least two local optima likely affects many macromolecular interactions, as ΔG depends on two partially independent variables ΔH and ΔS according to the central equation of thermodynamics, ΔG = ΔH - TΔS.


Assuntos
Fator de Transcrição CDX2/metabolismo , DNA/metabolismo , Entropia , Proteínas de Homeodomínio/metabolismo , Termodinâmica , Fator de Transcrição CDX2/química , Fator de Transcrição CDX2/genética , DNA/química , DNA/genética , Proteínas de Homeodomínio/química , Proteínas de Homeodomínio/genética , Humanos , Modelos Moleculares , Conformação de Ácido Nucleico , Ligação Proteica , Conformação Proteica , Especificidade por Substrato
12.
Nucleic Acids Res ; 46(8): e44, 2018 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-29385521

RESUMO

In some dimeric cases of transcription factor (TF) binding, the specificity of dimeric motifs has been observed to differ notably from what would be expected were the two factors to bind to DNA independently of each other. Current motif discovery methods are unable to learn monomeric and dimeric motifs in modular fashion such that deviations from the expected motif would become explicit and the noise from dimeric occurrences would not corrupt monomeric models. We propose a novel modeling technique and an expectation maximization algorithm, implemented as software tool MODER, for discovering monomeric TF binding motifs and their dimeric combinations. Given training data and seeds for monomeric motifs, the algorithm learns in the same probabilistic framework a mixture model which represents monomeric motifs as standard position-specific probability matrices (PPMs), and dimeric motifs as pairs of monomeric PPMs, with associated orientation and spacing preferences. For dimers the model represents deviations from pure modular model of two independent monomers, thus making co-operative binding effects explicit. MODER can analyze in reasonable time tens of Mbps of training data. We validated the tool on HT-SELEX and ChIP-seq data. Our findings include some TFs whose expected model has palindromic symmetry but the observed model is directional.


Assuntos
DNA/química , DNA/metabolismo , Fatores de Transcrição/metabolismo , Algoritmos , Sequência de Bases , Sítios de Ligação , Imunoprecipitação da Cromatina , Biologia Computacional/métodos , Aprendizado de Máquina , Modelos Estatísticos , Motivos de Nucleotídeos , Probabilidade , Técnica de Seleção de Aptâmeros , Software
13.
Cell ; 172(4): 650-665, 2018 02 08.
Artigo em Inglês | MEDLINE | ID: mdl-29425488

RESUMO

Transcription factors (TFs) recognize specific DNA sequences to control chromatin and transcription, forming a complex system that guides expression of the genome. Despite keen interest in understanding how TFs control gene expression, it remains challenging to determine how the precise genomic binding sites of TFs are specified and how TF binding ultimately relates to regulation of transcription. This review considers how TFs are identified and functionally characterized, principally through the lens of a catalog of over 1,600 likely human TFs and binding motifs for two-thirds of them. Major classes of human TFs differ markedly in their evolutionary trajectories and expression patterns, underscoring distinct functions. TFs likewise underlie many different aspects of human physiology, disease, and variation, highlighting the importance of continued effort to understand TF-mediated gene regulation.


Assuntos
Evolução Molecular , Regulação da Expressão Gênica , Elementos de Resposta , Fatores de Transcrição , Motivos de Aminoácidos , Humanos , Fatores de Transcrição/química , Fatores de Transcrição/classificação , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
14.
Science ; 356(6337)2017 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-28473536

RESUMO

The majority of CpG dinucleotides in the human genome are methylated at cytosine bases. However, active gene regulatory elements are generally hypomethylated relative to their flanking regions, and the binding of some transcription factors (TFs) is diminished by methylation of their target sequences. By analysis of 542 human TFs with methylation-sensitive SELEX (systematic evolution of ligands by exponential enrichment), we found that there are also many TFs that prefer CpG-methylated sequences. Most of these are in the extended homeodomain family. Structural analysis showed that homeodomain specificity for methylcytosine depends on direct hydrophobic interactions with the methylcytosine 5-methyl group. This study provides a systematic examination of the effect of an epigenetic DNA modification on human TF binding specificity and reveals that many developmentally important proteins display preference for mCpG-containing sequences.


Assuntos
Citosina/química , Metilação de DNA , Fosfatos de Dinucleosídeos/química , Epigênese Genética , Fatores de Transcrição/química , Ilhas de CpG , DNA/química , Genoma Humano , Humanos , Ligação Proteica , Domínios Proteicos , Técnica de Seleção de Aptâmeros , Fatores de Transcrição/classificação
15.
Nature ; 544(7649): 245-249, 2017 04 13.
Artigo em Inglês | MEDLINE | ID: mdl-28379941

RESUMO

Normal differentiation and induced reprogramming require the activation of target cell programs and silencing of donor cell programs. In reprogramming, the same factors are often used to reprogram many different donor cell types. As most developmental repressors, such as RE1-silencing transcription factor (REST) and Groucho (also known as TLE), are considered lineage-specific repressors, it remains unclear how identical combinations of transcription factors can silence so many different donor programs. Distinct lineage repressors would have to be induced in different donor cell types. Here, by studying the reprogramming of mouse fibroblasts to neurons, we found that the pan neuron-specific transcription factor Myt1-like (Myt1l) exerts its pro-neuronal function by direct repression of many different somatic lineage programs except the neuronal program. The repressive function of Myt1l is mediated via recruitment of a complex containing Sin3b by binding to a previously uncharacterized N-terminal domain. In agreement with its repressive function, the genomic binding sites of Myt1l are similar in neurons and fibroblasts and are preferentially in an open chromatin configuration. The Notch signalling pathway is repressed by Myt1l through silencing of several members, including Hes1. Acute knockdown of Myt1l in the developing mouse brain mimicked a Notch gain-of-function phenotype, suggesting that Myt1l allows newborn neurons to escape Notch activation during normal development. Depletion of Myt1l in primary postmitotic neurons de-repressed non-neuronal programs and impaired neuronal gene expression and function, indicating that many somatic lineage programs are actively and persistently repressed by Myt1l to maintain neuronal identity. It is now tempting to speculate that similar 'many-but-one' lineage repressors exist for other cell fates; such repressors, in combination with lineage-specific activators, would be prime candidates for use in reprogramming additional cell types.


Assuntos
Linhagem da Célula/genética , Reprogramação Celular/genética , Inativação Gênica , Proteínas do Tecido Nervoso/metabolismo , Neurogênese/genética , Neurônios/citologia , Neurônios/metabolismo , Proteínas Repressoras/metabolismo , Fatores de Transcrição/metabolismo , Animais , Animais Recém-Nascidos , Encéfalo/citologia , Encéfalo/embriologia , Encéfalo/metabolismo , Células Cultivadas , Cromatina/genética , Cromatina/metabolismo , Fibroblastos/citologia , Fibroblastos/metabolismo , Humanos , Camundongos , Proteínas do Tecido Nervoso/deficiência , Especificidade de Órgãos/genética , Domínios Proteicos , Receptores Notch/deficiência , Proteínas Repressoras/química , Proteínas Repressoras/deficiência , Transdução de Sinais , Fatores de Transcrição HES-1/deficiência , Fatores de Transcrição/deficiência
16.
Mol Syst Biol ; 13(2): 910, 2017 02 06.
Artigo em Inglês | MEDLINE | ID: mdl-28167566

RESUMO

Transcription factors (TFs) achieve DNA-binding specificity through contacts with functional groups of bases (base readout) and readout of structural properties of the double helix (shape readout). Currently, it remains unclear whether DNA shape readout is utilized by only a few selected TF families, or whether this mechanism is used extensively by most TF families. We resequenced data from previously published HT-SELEX experiments, the most extensive mammalian TF-DNA binding data available to date. Using these data, we demonstrated the contributions of DNA shape readout across diverse TF families and its importance in core motif-flanking regions. Statistical machine-learning models combined with feature-selection techniques helped to reveal the nucleotide position-dependent DNA shape readout in TF-binding sites and the TF family-specific position dependence. Based on these results, we proposed novel DNA shape logos to visualize the DNA shape preferences of TFs. Overall, this work suggests a way of obtaining mechanistic insights into TF-DNA binding without relying on experimentally solved all-atom structures.


Assuntos
DNA/química , Análise de Sequência de DNA/métodos , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , DNA/genética , DNA/metabolismo , Bases de Dados Genéticas , Humanos , Aprendizado de Máquina , Mamíferos/genética , Camundongos , Conformação de Ácido Nucleico , Fatores de Transcrição/genética
17.
Genome Res ; 26(12): 1742-1752, 2016 12.
Artigo em Inglês | MEDLINE | ID: mdl-27852650

RESUMO

C2H2 zinc finger proteins represent the largest and most enigmatic class of human transcription factors. Their C2H2-ZF arrays are highly variable, indicating that most will have unique DNA binding motifs. However, most of the binding motifs have not been directly determined. In addition, little is known about whether or how these proteins regulate transcription. Most of the ∼700 human C2H2-ZF proteins also contain at least one KRAB, SCAN, BTB, or SET domain, suggesting that they may have common interacting partners and/or effector functions. Here, we report a multifaceted functional analysis of 131 human C2H2-ZF proteins, encompassing DNA binding sites, interacting proteins, and transcriptional response to genetic perturbation. We confirm the expected diversity in DNA binding motifs and genomic binding sites, and provide motif models for 78 previously uncharacterized C2H2-ZF proteins, most of which are unique. Surprisingly, the diversity in protein-protein interactions is nearly as high as diversity in DNA binding motifs: Most C2H2-ZF proteins interact with a unique spectrum of co-activators and co-repressors. Thus, multiparameter diversification likely underlies the evolutionary success of this large class of human proteins.


Assuntos
DNA/metabolismo , Fatores de Transcrição/química , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Sítios de Ligação , Dedos de Zinco CYS2-HIS2 , Evolução Molecular , Regulação da Expressão Gênica , Células HEK293 , Humanos , Ligação Proteica , Mapas de Interação de Proteínas , Análise de Sequência de DNA , Análise de Sequência de RNA
18.
Nat Commun ; 6: 10050, 2015 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-26632596

RESUMO

The mammalian cell cycle is controlled by the E2F family of transcription factors. Typical E2Fs bind to DNA as heterodimers with the related dimerization partner (DP) proteins, whereas the atypical E2Fs, E2F7 and E2F8 contain two DNA-binding domains (DBDs) and act as repressors. To understand the mechanism of repression, we have resolved the structure of E2F8 in complex with DNA at atomic resolution. We find that the first and second DBDs of E2F8 resemble the DBDs of typical E2F and DP proteins, respectively. Using molecular dynamics simulations, biochemical affinity measurements and chromatin immunoprecipitation, we further show that both atypical and typical E2Fs bind to similar DNA sequences in vitro and in vivo. Our results represent the first crystal structure of an E2F protein with two DBDs, and reveal the mechanism by which atypical E2Fs can repress canonical E2F target genes and exert their negative influence on cell cycle progression.


Assuntos
Proteínas de Ligação a DNA/química , DNA/metabolismo , Fatores de Transcrição E2F/química , Família Multigênica , Cristalografia por Raios X , DNA/genética , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Fatores de Transcrição E2F/genética , Fatores de Transcrição E2F/metabolismo , Humanos , Simulação de Dinâmica Molecular , Estrutura Terciária de Proteína , Especificidade da Espécie
19.
Nature ; 527(7578): 384-8, 2015 Nov 19.
Artigo em Inglês | MEDLINE | ID: mdl-26550823

RESUMO

Gene expression is regulated by transcription factors (TFs), proteins that recognize short DNA sequence motifs. Such sequences are very common in the human genome, and an important determinant of the specificity of gene expression is the cooperative binding of multiple TFs to closely located motifs. However, interactions between DNA-bound TFs have not been systematically characterized. To identify TF pairs that bind cooperatively to DNA, and to characterize their spacing and orientation preferences, we have performed consecutive affinity-purification systematic evolution of ligands by exponential enrichment (CAP-SELEX) analysis of 9,400 TF-TF-DNA interactions. This analysis revealed 315 TF-TF interactions recognizing 618 heterodimeric motifs, most of which have not been previously described. The observed cooperativity occurred promiscuously between TFs from diverse structural families. Structural analysis of the TF pairs, including a novel crystal structure of MEIS1 and DLX3 bound to their identified recognition site, revealed that the interactions between the TFs were predominantly mediated by DNA. Most TF pair sites identified involved a large overlap between individual TF recognition motifs, and resulted in recognition of composite sites that were markedly different from the individual TF's motifs. Together, our results indicate that the DNA molecule commonly plays an active role in cooperative interactions that define the gene regulatory lexicon.


Assuntos
DNA/genética , DNA/metabolismo , Especificidade por Substrato , Fatores de Transcrição/metabolismo , Sequência de Aminoácidos , Sequência de Bases , Sítios de Ligação/genética , Cristalografia por Raios X , Regulação da Expressão Gênica/genética , Humanos , Dados de Sequência Molecular , Motivos de Nucleotídeos/genética , Reprodutibilidade dos Testes , Especificidade por Substrato/genética
20.
Elife ; 42015 Mar 17.
Artigo em Inglês | MEDLINE | ID: mdl-25779349

RESUMO

Divergent morphology of species has largely been ascribed to genetic differences in the tissue-specific expression of proteins, which could be achieved by divergence in cis-regulatory elements or by altering the binding specificity of transcription factors (TFs). The relative importance of the latter has been difficult to assess, as previous systematic analyses of TF binding specificity have been performed using different methods in different species. To address this, we determined the binding specificities of 242 Drosophila TFs, and compared them to human and mouse data. This analysis revealed that TF binding specificities are highly conserved between Drosophila and mammals, and that for orthologous TFs, the similarity extends even to the level of very subtle dinucleotide binding preferences. The few human TFs with divergent specificities function in cell types not found in fruit flies, suggesting that evolution of TF specificities contributes to emergence of novel types of differentiated cells.


Assuntos
Evolução Biológica , Fatores de Transcrição/metabolismo , Sequência de Aminoácidos , Animais , Sítios de Ligação , Drosophila , Duplicação Gênica , Humanos , Camundongos , Filogenia , Técnica de Seleção de Aptâmeros , Homologia de Sequência de Aminoácidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...