Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 27
Filter
Add more filters










Publication year range
1.
bioRxiv ; 2023 Dec 02.
Article in English | MEDLINE | ID: mdl-38076840

ABSTRACT

Spermatogenesis is a unidirectional differentiation process that generates haploid sperm, but how the gene expression program that directs this process is established is largely unknown. Here we determine the high-resolution 3D chromatin architecture of male germ cells during spermatogenesis and show that CTCF-mediated 3D chromatin predetermines the gene expression program required for spermatogenesis. In undifferentiated spermatogonia, CTCF-mediated chromatin contacts on autosomes pre-establish meiosis-specific super-enhancers (SE). These meiotic SE recruit the master transcription factor A-MYB in meiotic spermatocytes, which strengthens their 3D contacts and instructs a burst of meiotic gene expression. We also find that at the mitosis-to-meiosis transition, the germline-specific Polycomb protein SCML2 resolves chromatin loops that are specific to mitotic spermatogonia. Moreover, SCML2 and A-MYB establish the unique 3D chromatin organization of sex chromosomes during meiotic sex chromosome inactivation. We propose that CTCF-mediated 3D chromatin organization enforces epigenetic priming that directs unidirectional differentiation, thereby determining the cellular identity of the male germline.

2.
Commun Biol ; 6(1): 1110, 2023 11 02.
Article in English | MEDLINE | ID: mdl-37919399

ABSTRACT

The noisy and high-dimensional nature of biological data has spawned advanced clustering algorithms that are tailored for specific biological datatypes. However, the performance of such methods varies greatly between datasets and they require post hoc tuning of cryptic hyperparameters. We present k minimal distance (KMD) clustering, a general-purpose method based on a generalization of single and average linkage hierarchical clustering. We introduce a generalized silhouette-like function to eliminate the cryptic hyperparameter k, and use sampling to enable application to million-object datasets. Rigorous comparisons to general and specialized clustering methods on simulated, mass cytometry and scRNA-seq datasets show consistent high performance of KMD clustering across all datasets.


Subject(s)
Algorithms , Sequence Analysis, RNA/methods , Cluster Analysis
3.
Nucleic Acids Res ; 51(3): 1103-1119, 2023 02 22.
Article in English | MEDLINE | ID: mdl-36629266

ABSTRACT

The Hi-C method has revolutionized the study of genome organization, yet interpretation of Hi-C interaction frequency maps remains a major challenge. Genomic compartments are a checkered Hi-C interaction pattern suggested to represent the partitioning of the genome into two self-interacting states associated with active and inactive chromatin. Based on a few elementary mechanistic assumptions, we derive a generative probabilistic model of genomic compartments, called deGeco. Testing our model, we find it can explain observed Hi-C interaction maps in a highly robust manner, allowing accurate inference of interaction probability maps from extremely sparse data without any training of parameters. Taking advantage of the interpretability of the model parameters, we then test hypotheses regarding the nature of genomic compartments. We find clear evidence of multiple states, and that these states self-interact with different affinities. We also find that the interaction rules of chromatin states differ considerably within and between chromosomes. Inspecting the molecular underpinnings of a four-state model, we show that a simple classifier can use histone marks to predict the underlying states with 87% accuracy. Finally, we observe instances of mixed-state loci and analyze these loci in single-cell Hi-C maps, finding that mixing of states occurs mainly at the cell level.


Subject(s)
Chromatin , Genome , Genomics/methods , Chromosomes , Probability
4.
Mol Cell ; 77(4): 688-708, 2020 02 20.
Article in English | MEDLINE | ID: mdl-32001106

ABSTRACT

Rapidly developing technologies have recently fueled an exciting era of discovery in the field of chromosome structure and nuclear organization. In addition to chromosome conformation capture (3C) methods, new alternative techniques have emerged to study genome architecture and biological processes in the nucleus, often in single or living cells. This sets an unprecedented stage for exploring the mechanisms that link chromosome structure and biological function. Here we review popular as well as emerging approaches to study chromosome organization, focusing on the contribution of complementary methodologies to our understanding of structures revealed by 3C methods and their biological implications, and discuss the next technical and conceptual frontiers.


Subject(s)
Chromosomes, Mammalian/chemistry , Animals , Cell Nucleus/genetics , DNA Repair , DNA Replication Timing , Genetic Techniques , Models, Genetic , Single-Cell Analysis , Transcription, Genetic
5.
Nat Struct Mol Biol ; 26(3): 175-184, 2019 03.
Article in English | MEDLINE | ID: mdl-30778237

ABSTRACT

Germ cells manifest a unique gene expression program and regain totipotency in the zygote. Here, we perform Hi-C analysis to examine 3D chromatin organization in male germ cells during spermatogenesis. We show that the highly compartmentalized 3D chromatin organization characteristic of interphase nuclei is attenuated in meiotic prophase. Meiotic prophase is predominated by short-range intrachromosomal interactions that represent a condensed form akin to that of mitotic chromosomes. However, unlike mitotic chromosomes, meiotic chromosomes display weak genomic compartmentalization, weak topologically associating domains, and localized point interactions in prophase. In postmeiotic round spermatids, genomic compartmentalization increases and gives rise to the strong compartmentalization seen in mature sperm. The X chromosome lacks domain organization during meiotic sex-chromosome inactivation. We propose that male meiosis occurs amid global reprogramming of 3D chromatin organization and that strengthening of chromatin compartmentalization takes place in spermiogenesis to prepare the next generation of life.


Subject(s)
Chromatin Assembly and Disassembly/physiology , Meiosis/physiology , Spermatids/growth & development , Spermatocytes/growth & development , Spermatogenesis/physiology , Animals , Chromatin/metabolism , Chromosomes/metabolism , Interphase/physiology , Male , Meiotic Prophase I/physiology , Mice , Mice, Inbred C57BL , Protein Domains/physiology
6.
Nature ; 563(7729): 121-125, 2018 11.
Article in English | MEDLINE | ID: mdl-30333624

ABSTRACT

Many evolutionarily distant pathogenic organisms have evolved similar survival strategies to evade the immune responses of their hosts. These include antigenic variation, through which an infecting organism prevents clearance by periodically altering the identity of proteins that are visible to the immune system of the host1. Antigenic variation requires large reservoirs of immunologically diverse antigen genes, which are often generated through homologous recombination, as well as mechanisms to ensure the expression of one or very few antigens at any given time. Both homologous recombination and gene expression are affected by three-dimensional genome architecture and local DNA accessibility2,3. Factors that link three-dimensional genome architecture, local chromatin conformation and antigenic variation have, to our knowledge, not yet been identified in any organism. One of the major obstacles to studying the role of genome architecture in antigenic variation has been the highly repetitive nature and heterozygosity of antigen-gene arrays, which has precluded complete genome assembly in many pathogens. Here we report the de novo haplotype-specific assembly and scaffolding of the long antigen-gene arrays of the model protozoan parasite Trypanosoma brucei, using long-read sequencing technology and conserved features of chromosome folding4. Genome-wide chromosome conformation capture (Hi-C) reveals a distinct partitioning of the genome, with antigen-encoding subtelomeric regions that are folded into distinct, highly compact compartments. In addition, we performed a range of analyses-Hi-C, fluorescence in situ hybridization, assays for transposase-accessible chromatin using sequencing and single-cell RNA sequencing-that showed that deletion of the histone variants H3.V and H4.V increases antigen-gene clustering, DNA accessibility across sites of antigen expression and switching of the expressed antigen isoform, via homologous recombination. Our analyses identify histone variants as a molecular link between global genome architecture, local chromatin conformation and antigenic variation.


Subject(s)
Antigenic Variation/genetics , Chromatin/genetics , Chromatin/metabolism , DNA, Protozoan/metabolism , Genome/genetics , Trypanosoma brucei brucei/genetics , Trypanosoma brucei brucei/immunology , DNA, Protozoan/genetics , Haplotypes/genetics , Histones/deficiency , Histones/genetics , Multigene Family/genetics , Protein Isoforms/biosynthesis , Protein Isoforms/genetics , Variant Surface Glycoproteins, Trypanosoma/biosynthesis , Variant Surface Glycoproteins, Trypanosoma/genetics
7.
Genome Res ; 28(10): 1455-1466, 2018 10.
Article in English | MEDLINE | ID: mdl-30166406

ABSTRACT

Mitosis encompasses key molecular changes including chromatin condensation, nuclear envelope breakdown, and reduced transcription levels. Immediately after mitosis, the interphase chromatin structure is reestablished and transcription resumes. The reestablishment of the interphase chromatin is probably achieved by "bookmarking," i.e., the retention of at least partial information during mitosis. To gain a deeper understanding of the contribution of histone modifications to the mitotic bookmarking process, we merged proteomics, immunofluorescence, and ChIP-seq approaches. We focused on key histone modifications and employed HeLa-S3 cells as a model system. Generally, in spite of the general hypoacetylation observed during mitosis, we observed a global concordance between the genomic organization of histone modifications in interphase and mitosis, suggesting that the epigenomic landscape may serve as a component of the mitotic bookmarking process. Next, we investigated the nucleosome that enters nucleosome depleted regions (NDRs) during mitosis. We observed that in ∼60% of the NDRs, the entering nucleosome is distinct from the surrounding highly acetylated nucleosomes and appears to have either low levels of acetylation or high levels of phosphorylation in adjacent residues (since adjacent phosphorylation may interfere with the ability to detect acetylation). Inhibition of histone deacetylases (HDACs) by the small molecule TSA reverts this pattern, suggesting that these nucleosomes are specifically deacetylated during mitosis. Altogether, by merging multiple approaches, our study provides evidence to support a model where histone modifications may play a role in mitotic bookmarking and uncovers new insights into the deposition of nucleosomes during mitosis.


Subject(s)
Histones/metabolism , Mitosis , Nucleosomes/genetics , Acetylation/drug effects , Chromatin Immunoprecipitation , HeLa Cells , Histone Code , Histone Deacetylase Inhibitors/pharmacology , Histone Deacetylases/metabolism , Humans , Nucleosomes/drug effects , Nucleosomes/metabolism , Phosphorylation , Proteomics
8.
Methods ; 142: 89-99, 2018 06 01.
Article in English | MEDLINE | ID: mdl-29684640

ABSTRACT

Assembly of reference-quality genomes from next-generation sequencing data is a key challenge in genomics. Recently, we and others have shown that Hi-C data can be used to address several outstanding challenges in the field of genome assembly. This principle has since been developed in academia and industry, and has been used in the assembly of several major genomes. In this paper, we explore the central principles underlying Hi-C-based assembly approaches, by quantitatively defining and characterizing three invariant Hi-C interaction patterns on which these approaches can build: Intrachromosomal interaction enrichment, distance-dependent interaction decay and local interaction smoothness. Specifically, we evaluate to what degree each invariant pattern holds on a single locus level in different species, cell types and Hi-C map resolutions. We find that these patterns are generally consistent across species and cell types but are affected by sequencing depth, and that matrix balancing improves consistency of loci with all three invariant patterns. Finally, we overview current Hi-C-based assembly approaches in light of these invariant patterns and demonstrate how local interaction smoothness can be used to easily detect scaffolding errors in extremely sparse Hi-C maps. We suggest that simultaneously considering all three invariant patterns may lead to better Hi-C-based genome assembly methods.


Subject(s)
Chromosome Mapping/methods , High-Throughput Nucleotide Sequencing/methods , Metagenomics/methods , Models, Genetic , Molecular Sequence Annotation/methods , Animals , Chromosome Mapping/instrumentation , DNA/chemistry , DNA/genetics , Genome/genetics , High-Throughput Nucleotide Sequencing/instrumentation , Humans , Imaging, Three-Dimensional/instrumentation , Imaging, Three-Dimensional/methods , Metagenomics/instrumentation , Models, Statistical , Molecular Imaging/instrumentation , Molecular Imaging/methods , Nucleic Acid Conformation , Sequence Analysis, DNA/instrumentation , Sequence Analysis, DNA/methods
9.
Nature ; 535(7613): 575-9, 2016 07 28.
Article in English | MEDLINE | ID: mdl-27437574

ABSTRACT

X-chromosome inactivation (XCI) involves major reorganization of the X chromosome as it becomes silent and heterochromatic. During female mammalian development, XCI is triggered by upregulation of the non-coding Xist RNA from one of the two X chromosomes. Xist coats the chromosome in cis and induces silencing of almost all genes via its A-repeat region, although some genes (constitutive escapees) avoid silencing in most cell types, and others (facultative escapees) escape XCI only in specific contexts. A role for Xist in organizing the inactive X (Xi) chromosome has been proposed. Recent chromosome conformation capture approaches have revealed global loss of local structure on the Xi chromosome and formation of large mega-domains, separated by a region containing the DXZ4 macrosatellite. However, the molecular architecture of the Xi chromosome, in both the silent and expressed regions,remains unclear. Here we investigate the structure, chromatin accessibility and expression status of the mouse Xi chromosome in highly polymorphic clonal neural progenitors (NPCs) and embryonic stem cells. We demonstrate a crucial role for Xist and the DXZ4-containing boundary in shaping Xi chromosome structure using allele-specific genome-wide chromosome conformation capture (Hi-C) analysis, an assay for transposase-accessible chromatin with high throughput sequencing (ATAC-seq) and RNA sequencing. Deletion of the boundary disrupts mega-domain formation, and induction of Xist RNA initiates formation of the boundary and the loss of DNA accessibility. We also show that in NPCs, the Xi chromosome lacks active/inactive compartments and topologically associating domains (TADs), except around genes that escape XCI. Escapee gene clusters display TAD-like structures and retain DNA accessibility at promoter-proximal and CTCF-binding sites. Furthermore, altered patterns of facultative escape genes indifferent neural progenitor clones are associated with the presence of different TAD-like structures after XCI. These findings suggest a key role for transcription and CTCF in the formation of TADs in the context of the Xi chromosome in neural progenitors.


Subject(s)
Chromosomes, Mammalian/metabolism , X Chromosome Inactivation , X Chromosome/metabolism , Alleles , Animals , Binding Sites , CCCTC-Binding Factor , Chromatin/chemistry , Chromatin/genetics , Chromatin/metabolism , Chromosomes, Mammalian/chemistry , Chromosomes, Mammalian/genetics , Embryonic Stem Cells/metabolism , Female , Gene Silencing , Male , Mice , Neural Stem Cells/metabolism , Promoter Regions, Genetic/genetics , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism , Repressor Proteins/metabolism , Sequence Analysis , Transcription, Genetic , X Chromosome/chemistry , X Chromosome/genetics , X Chromosome Inactivation/genetics
10.
Genome Res ; 25(11): 1727-38, 2015 Nov.
Article in English | MEDLINE | ID: mdl-26330564

ABSTRACT

A conserved hallmark of eukaryotic chromatin architecture is the distinctive array of well-positioned nucleosomes downstream from transcription start sites (TSS). Recent studies indicate that trans-acting factors establish this stereotypical array. Here, we present the first genome-wide in vitro and in vivo nucleosome maps for the ciliate Tetrahymena thermophila. In contrast with previous studies in yeast, we find that the stereotypical nucleosome array is preserved in the in vitro reconstituted map, which is governed only by the DNA sequence preferences of nucleosomes. Remarkably, this average in vitro pattern arises from the presence of subsets of nucleosomes, rather than the whole array, in individual Tetrahymena genes. Variation in GC content contributes to the positioning of these sequence-directed nucleosomes and affects codon usage and amino acid composition in genes. Given that the AT-rich Tetrahymena genome is intrinsically unfavorable for nucleosome formation, we propose that these "seed" nucleosomes--together with trans-acting factors--may facilitate the establishment of nucleosome arrays within genes in vivo, while minimizing changes to the underlying coding sequences.


Subject(s)
Genome, Protozoan , Nucleosomes/genetics , Open Reading Frames , Tetrahymena thermophila/genetics , Chromosome Mapping , DNA, Protozoan/genetics , Genetic Association Studies , Sequence Analysis, DNA , Transcription, Genetic
11.
Methods ; 72: 65-75, 2015 Jan 15.
Article in English | MEDLINE | ID: mdl-25448293

ABSTRACT

Over the last decade, development and application of a set of molecular genomic approaches based on the chromosome conformation capture method (3C), combined with increasingly powerful imaging approaches, have enabled high resolution and genome-wide analysis of the spatial organization of chromosomes. The aim of this paper is to provide guidelines for analyzing and interpreting data obtained with genome-wide 3C methods such as Hi-C and 3C-seq that rely on deep sequencing to detect and quantify pairwise chromatin interactions.


Subject(s)
Chromatin/metabolism , Genomics/methods , Nucleic Acid Conformation , Chromatin/chemistry , Chromosome Mapping/methods , Data Interpretation, Statistical , Datasets as Topic , Genome, Human , High-Throughput Nucleotide Sequencing , Humans , Molecular Conformation
12.
Nat Biotechnol ; 31(12): 1143-7, 2013 Dec.
Article in English | MEDLINE | ID: mdl-24270850

ABSTRACT

Despite advances in DNA sequencing technology, assembly of complex genomes remains a major challenge, particularly for genomes sequenced using short reads, which yield highly fragmented assemblies. Here we show that genome-wide in vivo chromatin interaction frequency data, which are measurable with chromosome conformation capture-based experiments, can be used as genomic distance proxies to accurately position individual contigs without requiring any sequence overlap. We also use these data to construct approximate genome scaffolds de novo. Applying our approach to incomplete regions of the human genome, we predict the positions of 65 previously unplaced contigs, in agreement with alternative methods in 26/31 cases attempted in common. Our approach can theoretically bridge any gap size and should be applicable to any species for which global chromatin interaction data can be generated.


Subject(s)
Algorithms , Contig Mapping/methods , DNA/genetics , Data Interpretation, Statistical , Gene Frequency/genetics , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods
13.
Cell Cycle ; 11(4): 658-67, 2012 Feb 15.
Article in English | MEDLINE | ID: mdl-22373526

ABSTRACT

We recently reported the identification and characterization of DNA replication origins (Oris) in metazoan cell lines. Here, we describe additional bioinformatic analyses showing that the previously identified GC-rich sequence elements form origin G-rich repeated elements (OGREs) that are present in 67% to 90% of the DNA replication origins from Drosophila to human cells, respectively. Our analyses also show that initiation of DNA synthesis takes place precisely at 160 bp (Drosophila) and 280 bp (mouse) from the OGRE. We also found that in most CpG islands, an OGRE is positioned in opposite orientation on each of the two DNA strands and detected two sites of initiation of DNA synthesis upstream or downstream of each OGRE. Conversely, Oris not associated with CpG islands have a single initiation site. OGRE density along chromosomes correlated with previously published replication timing data. Ori sequences centered on the OGRE are also predicted to have high intrinsic nucleosome occupancy. Finally, OGREs predict G-quadruplex structures at Oris that might be structural elements controlling the choice or activation of replication origins.


Subject(s)
DNA Replication/genetics , Replication Origin/genetics , Animals , CpG Islands/genetics , Drosophila , G-Quadruplexes , Humans , Mice , Models, Biological
14.
Genome Biol ; 11(11): 140, 2010.
Article in English | MEDLINE | ID: mdl-21118582

ABSTRACT

We propose definitions and procedures for comparing nucleosome maps and discuss current agreement and disagreement on the effect of histone sequence preferences on nucleosome organization in vivo.


Subject(s)
Histones/chemistry , Histones/genetics , Nucleosomes/chemistry , Nucleosomes/classification , Chromatin/metabolism , Models, Biological , Sequence Analysis, DNA , Terminology as Topic
16.
PLoS One ; 5(2): e9129, 2010 Feb 09.
Article in English | MEDLINE | ID: mdl-20161746

ABSTRACT

Active eukaryotic regulatory sites are characterized by open chromatin, and yeast promoters and transcription factor binding sites (TFBSs) typically have low intrinsic nucleosome occupancy. Here, we show that in contrast to yeast, DNA at human promoters, enhancers, and TFBSs generally encodes high intrinsic nucleosome occupancy. In most cases we examined, these elements also have high experimentally measured nucleosome occupancy in vivo. These regions typically have high G+C content, which correlates positively with intrinsic nucleosome occupancy, and are depleted for nucleosome-excluding poly-A sequences. We propose that high nucleosome preference is directly encoded at regulatory sequences in the human genome to restrict access to regulatory information that will ultimately be utilized in only a subset of differentiated cells.


Subject(s)
Nucleosomes/metabolism , Regulatory Sequences, Nucleic Acid/genetics , Transcription Factors/metabolism , Base Composition , Base Sequence , Binding Sites/genetics , CD4-Positive T-Lymphocytes/cytology , CD4-Positive T-Lymphocytes/metabolism , Cells, Cultured , CpG Islands/genetics , Enhancer Elements, Genetic/genetics , Fibroblasts/cytology , Fibroblasts/metabolism , HeLa Cells , Humans , Jurkat Cells , Promoter Regions, Genetic/genetics , Protein Binding
17.
Nat Genet ; 41(4): 438-45, 2009 Apr.
Article in English | MEDLINE | ID: mdl-19252487

ABSTRACT

Eukaryotic transcription occurs within a chromatin environment, whose organization has an important regulatory function and is partly encoded in cis by the DNA sequence itself. Here, we examine whether evolutionary changes in gene expression are linked to changes in the DNA-encoded nucleosome organization of promoters. We find that in aerobic yeast species, where cellular respiration genes are active under typical growth conditions, the promoter sequences of these genes encode a relatively open (nucleosome-depleted) chromatin organization. This nucleosome-depleted organization requires only DNA sequence information, is independent of any cofactors and of transcription, and is a general property of growth-related genes. In contrast, in anaerobic yeast species, where cellular respiration genes are relatively inactive under typical growth conditions, respiration gene promoters encode relatively closed (nucleosome-occupied) chromatin organizations. Our results suggest a previously unidentified genetic mechanism underlying phenotypic diversity, consisting of DNA sequence changes that directly alter the DNA-encoded nucleosome organization of promoters.


Subject(s)
DNA, Fungal/genetics , Gene Expression Regulation, Fungal , Genetic Variation , Nucleosomes/genetics , Yeasts/genetics , Candida albicans/genetics , Environment , Fungal Proteins/genetics , Models, Genetic , Nucleosomes/ultrastructure , Oligonucleotide Array Sequence Analysis , Promoter Regions, Genetic , Ribosomal Proteins/genetics , Saccharomyces cerevisiae/genetics
18.
Nature ; 458(7236): 362-6, 2009 Mar 19.
Article in English | MEDLINE | ID: mdl-19092803

ABSTRACT

Nucleosome organization is critical for gene regulation. In living cells this organization is determined by multiple factors, including the action of chromatin remodellers, competition with site-specific DNA-binding proteins, and the DNA sequence preferences of the nucleosomes themselves. However, it has been difficult to estimate the relative importance of each of these mechanisms in vivo, because in vivo nucleosome maps reflect the combined action of all influencing factors. Here we determine the importance of nucleosome DNA sequence preferences experimentally by measuring the genome-wide occupancy of nucleosomes assembled on purified yeast genomic DNA. The resulting map, in which nucleosome occupancy is governed only by the intrinsic sequence preferences of nucleosomes, is similar to in vivo nucleosome maps generated in three different growth conditions. In vitro, nucleosome depletion is evident at many transcription factor binding sites and around gene start and end sites, indicating that nucleosome depletion at these sites in vivo is partly encoded in the genome. We confirm these results with a micrococcal nuclease-independent experiment that measures the relative affinity of nucleosomes for approximately 40,000 double-stranded 150-base-pair oligonucleotides. Using our in vitro data, we devise a computational model of nucleosome sequence preferences that is significantly correlated with in vivo nucleosome occupancy in Caenorhabditis elegans. Our results indicate that the intrinsic DNA sequence preferences of nucleosomes have a central role in determining the organization of nucleosomes in vivo.


Subject(s)
Eukaryotic Cells/metabolism , Genome, Fungal/genetics , Nucleosomes/genetics , Saccharomyces cerevisiae/genetics , Animals , Base Sequence , Caenorhabditis elegans/genetics , Chickens , Computational Biology , Computer Simulation , Micrococcal Nuclease/metabolism , Nucleosomes/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , Saccharomyces cerevisiae/growth & development , Sequence Analysis, DNA , Transcription Factors/metabolism
19.
PLoS Comput Biol ; 4(11): e1000216, 2008 Nov.
Article in English | MEDLINE | ID: mdl-18989395

ABSTRACT

The detailed positions of nucleosomes profoundly impact gene regulation and are partly encoded by the genomic DNA sequence. However, less is known about the functional consequences of this encoding. Here, we address this question using a genome-wide map of approximately 380,000 yeast nucleosomes that we sequenced in their entirety. Utilizing the high resolution of our map, we refine our understanding of how nucleosome organizations are encoded by the DNA sequence and demonstrate that the genomic sequence is highly predictive of the in vivo nucleosome organization, even across new nucleosome-bound sequences that we isolated from fly and human. We find that Poly(dA:dT) tracts are an important component of these nucleosome positioning signals and that their nucleosome-disfavoring action results in large nucleosome depletion over them and over their flanking regions and enhances the accessibility of transcription factors to their cognate sites. Our results suggest that the yeast genome may utilize these nucleosome positioning signals to regulate gene expression with different transcriptional noise and activation kinetics and DNA replication with different origin efficiency. These distinct functions may be achieved by encoding both relatively closed (nucleosome-covered) chromatin organizations over some factor binding sites, where factors must compete with nucleosomes for DNA access, and relatively open (nucleosome-depleted) organizations over other factor sites, where factors bind without competition.


Subject(s)
DNA, Fungal/genetics , Locus Control Region , Nucleosomes/genetics , Saccharomyces cerevisiae/genetics , Transcription, Genetic/genetics , Animals , Base Sequence/genetics , Binding Sites/genetics , Chromatin Assembly and Disassembly/genetics , Drosophila melanogaster/genetics , Gene Expression Regulation, Fungal/genetics , HeLa Cells , Humans , Saccharomyces cerevisiae/metabolism , Transcription Factors/metabolism
20.
J Mol Biol ; 369(2): 553-66, 2007 Jun 01.
Article in English | MEDLINE | ID: mdl-17433819

ABSTRACT

Most animal toxins are short proteins that appear in venom and vary in sequence, structure and function. A common characteristic of many such toxins is their apparent structural stability. Sporadic instances of endogenous toxin-like proteins that function in non-venom context have been reported. We have utilized machine learning methodology, based on sequence-derived features and guided by the notion of structural stability, in order to conduct a large-scale search for toxin and toxin-like proteins. Application of the method to insect and mammalian sequences revealed novel families of toxin-like proteins. One of these proteins shows significant similarity to ion channel inhibitors that are expressed in cone snail and assassin bug venom, and is surprisingly expressed in the bee brain. A toxicity assay in which the protein was injected to fish induced a strong yet reversible paralytic effect. We suggest that the protein may function as an endogenous modulator of voltage-gated Ca(2+) channels. Additionally, we have identified a novel mammalian cluster of toxin-like proteins that are expressed in the testis. We suggest that these proteins might be involved in regulation of nicotinic acetylcholine receptors that affect the acrosome reaction and sperm motility. Finally, we highlight a possible evolutionary link between venom toxins and antibacterial proteins. We expect our methodology to enhance the discovery of additional novel protein families.


Subject(s)
Computer Simulation , Peptides/genetics , Toxins, Biological/chemistry , Toxins, Biological/genetics , Amino Acid Sequence , Animals , Antimicrobial Cationic Peptides/chemistry , Antimicrobial Cationic Peptides/genetics , Apamin/chemistry , Apamin/genetics , Base Sequence , Bees , Humans , Insect Proteins/chemistry , Insect Proteins/genetics , Insecta , Mice , Molecular Sequence Data , Neuropeptides/chemistry , Peptides/chemistry , Peptides/classification , Protein Conformation , Reproducibility of Results , Sequence Alignment , Toxins, Biological/classification
SELECTION OF CITATIONS
SEARCH DETAIL
...