Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 15 de 15
Filter
Add more filters










Publication year range
1.
Mol Ecol ; 33(9): e17339, 2024 May.
Article in English | MEDLINE | ID: mdl-38556927

ABSTRACT

Copy number variation is a common contributor to phenotypic diversity, yet its involvement in ecological adaptation is not easily discerned. Instances of parallelly evolving populations of the same species in a similar environment marked by strong selective pressures present opportunities to study the role of copy number variants (CNVs) in adaptation. By identifying CNVs that repeatedly occur in multiple populations of the derived ecotype and are not (or are rarely) present in the populations of the ancestral ecotype, the association of such CNVs with adaptation to the novel environment can be inferred. We used this paradigm to identify CNVs associated with recurrent adaptation of the Mexican tetra (Astyanax mexicanus) to cave environment. Using a read-depth approach, we detected CNVs from previously re-sequenced genomes of 44 individuals belonging to two ancestral surfaces and three derived cave populations. We identified 102 genes and 292 genomic regions that repeatedly diverge in copy number between the two ecotypes and occupy 0.8% of the reference genome. Functional analysis revealed their association with processes previously recognized to be relevant for adaptation, such as vision, immunity, oxygen consumption, metabolism, and neural function and we propose that these variants have been selected for in the cave or surface waters. The majority of the ecotype-divergent CNVs are multiallelic and display copy number increases in cavefish compared to surface fish. Our findings suggest that multiallelic CNVs - including gene duplications - and divergence in copy number provide a fast route to produce novel phenotypes associated with adaptation to subterranean life.


Subject(s)
Caves , Characidae , DNA Copy Number Variations , DNA Copy Number Variations/genetics , Animals , Characidae/genetics , Genetics, Population , Adaptation, Physiological/genetics , Ecotype , Mexico
2.
Front Genet ; 13: 1060898, 2022.
Article in English | MEDLINE | ID: mdl-36523759

ABSTRACT

The field of population genomics has seen a surge of studies on genomic structural variation over the past two decades. These studies witnessed that structural variation is taxonomically ubiquitous and represent a dominant form of genetic variation within species. Recent advances in technology, especially the development of long-read sequencing platforms, have enabled the discovery of structural variants (SVs) in previously inaccessible genomic regions which unlocked additional structural variation for population studies and revealed that more SVs contribute to evolution than previously perceived. An increasing number of studies suggest that SVs of all types and sizes may have a large effect on phenotype and consequently major impact on rapid adaptation, population divergence, and speciation. However, the functional effect of the vast majority of SVs is unknown and the field generally lacks evidence on the phenotypic consequences of most SVs that are suggested to have adaptive potential. Non-human genomes are heavily under-represented in population-scale studies of SVs. We argue that more research on other species is needed to objectively estimate the contribution of SVs to evolution. We discuss technical challenges associated with SV detection and outline the most recent advances towards more representative reference genomes, which opens a new era in population-scale studies of structural variation.

3.
Epigenetics Chromatin ; 14(1): 47, 2021 10 18.
Article in English | MEDLINE | ID: mdl-34663449

ABSTRACT

BACKGROUND: Trimethylation of histone H3 on lysine 9 (H3K9me3) at satellite DNA sequences has been primarily studied at (peri)centromeric regions, where its level shows differences associated with various processes such as development and malignant transformation. However, the dynamics of H3K9me3 at distal satellite DNA repeats has not been thoroughly investigated. RESULTS: We exploit the sets of publicly available data derived from chromatin immunoprecipitation combined with massively parallel DNA sequencing (ChIP-Seq), produced by the The Encyclopedia of DNA Elements (ENCODE) project, to analyze H3K9me3 at assembled satellite DNA repeats in genomes of human cell lines and during mouse fetal development. We show that annotated satellite elements are generally enriched for H3K9me3, but its level in cancer cell lines is on average lower than in normal cell lines. We find 407 satellite DNA instances with differential H3K9me3 enrichment between cancer and normal cells including a large 115-kb cluster of GSATII elements on chromosome 12. Differentially enriched regions are not limited to satellite DNA instances, but instead encompass a wider region of flanking sequences. We found no correlation between the levels of H3K9me3 and noncoding RNA at corresponding satellite DNA loci. The analysis of data derived from multiple tissues identified 864 instances of satellite DNA sequences in the mouse reference genome that are differentially enriched between fetal developmental stages. CONCLUSIONS: Our study reveals significant differences in H3K9me3 level at a subset of satellite repeats between biological states and as such contributes to understanding of the role of satellite DNA repeats in epigenetic regulation during development and carcinogenesis.


Subject(s)
DNA, Satellite , Epigenesis, Genetic , Animals , Cell Line , DNA, Satellite/genetics , Fetal Development , Histones/genetics , Histones/metabolism , Humans , Mice
4.
Genome Biol Evol ; 13(10)2021 10 01.
Article in English | MEDLINE | ID: mdl-34581786

ABSTRACT

Comparison of the androgen-binding protein (Abp) gene regions of six Mus genomes provides insights into the evolutionary history of this large murid rodent gene family. We identified 206 unique Abp sequences and mapped their physical relationships. At least 48 are duplicated and thus present in more than two identical copies. All six taxa have substantially elevated LINE1 densities in Abp regions compared with flanking regions, similar to levels in mouse and rat genomes, although nonallelic homologous recombination seems to have only occurred in Mus musculus domesticus. Phylogenetic and structural relationships support the hypothesis that the extensive Abp expansion began in an ancestor of the genus Mus. We also found duplicated Abpa27's in two taxa, suggesting that previously reported selection on a27 alleles may have actually detected selection on haplotypes wherein different paralogs were lost in each. Other studies reported that a27 gene and species trees were incongruent, likely because of homoplasy. However, L1MC3 phylogenies, supposed to be homoplasy-free compared with coding regions, support our paralog hypothesis because the L1MC3 phylogeny was congruent with the a27 topology. This paralog hypothesis provides an alternative explanation for the origin of the a27 gene that is suggested to be fixed in the three different subspecies of Mus musculus and to mediate sexual selection and incipient reinforcement between at least two of them. Finally, we ask why there are so many Abp genes, especially given the high frequency of pseudogenes and suggest that relaxed selection operates over a large part of the gene clusters.


Subject(s)
Androgen-Binding Protein , Evolution, Molecular , Alleles , Amino Acid Sequence , Androgen-Binding Protein/genetics , Animals , Mice , Muridae/genetics , Phylogeny , Rats
5.
Prog Mol Subcell Biol ; 60: 145-167, 2021.
Article in English | MEDLINE | ID: mdl-34386875

ABSTRACT

Satellite DNAs are tandemly repeated sequences organized in large clusters within (peri)centromeric and/or subtelomeric heterochromatin. However, in many species, satellite DNAs are not restricted to heterochromatin but are also dispersed as short arrays within euchromatin. Such genomic organization together with transcriptional activity seems to be a prerequisite for the gene-modulatory effect of satellite DNAs which was first demonstrated in the beetle Tribolium castaneum upon heat stress. Namely, enrichment of a silent histone mark at euchromatic repeats of a major beetle satellite DNA results in epigenetic silencing of neighboring genes. In addition, human satellite III transcripts induced by heat shock contribute to genome-wide gene silencing, providing protection against stress-induced cell death. Gene silencing mediated by satellite RNA was also shown to be fundamental for the early embryonic development of the mosquito Aedes aegypti. Apart from a physiological role during embryogenesis and heat stress response, activation of satellite DNAs in terms of transcription and proliferation can have an evolutionary impact. Spreading of satellite repeats throughout euchromatin promotes the variation of epigenetic landscapes and gene expression diversity, contributing to the evolution of gene regulatory networks and to genome adaptation in fluctuating environmental conditions.


Subject(s)
DNA, Satellite , Tribolium , Animals , DNA, Satellite/genetics , Euchromatin , Gene Expression Regulation , Heterochromatin , Humans , Tribolium/genetics
6.
Genome Biol Evol ; 12(11): 2125-2138, 2020 11 03.
Article in English | MEDLINE | ID: mdl-33078196

ABSTRACT

Major human alpha satellite DNA repeats are preferentially assembled within (peri)centromeric regions but are also dispersed within euchromatin in the form of clustered or short single repeat arrays. To study the evolutionary history of single euchromatic human alpha satellite repeats (ARs), we analyzed their orthologous loci across the primate genomes. The continuous insertion of euchromatic ARs throughout the evolutionary history of primates starting with the ancestors of Simiformes (45-60 Ma) and continuing up to the ancestors of Homo is revealed. Once inserted, the euchromatic ARs were stably transmitted to the descendant species, some exhibiting copy number variation, whereas their sequence divergence followed the species phylogeny. Many euchromatic ARs have sequence characteristics of (peri)centromeric alpha repeats suggesting heterochromatin as a source of dispersed euchromatic ARs. The majority of euchromatic ARs are inserted in the vicinity of other repetitive elements such as L1, Alu, and ERV or are embedded within them. Irrespective of the insertion context, each AR insertion seems to be unique and once inserted, ARs do not seem to be subsequently spread to new genomic locations. In spite of association with (retro)transposable elements, there is no indication that such elements play a role in ARs proliferation. The presence of short duplications at most of ARs insertion sites suggests site-directed recombination between homologous motifs in ARs and in the target genomic sequence, probably mediated by extrachromosomal circular DNA, as a mechanism of spreading within euchromatin.


Subject(s)
DNA, Satellite , Euchromatin , Evolution, Molecular , Genome, Human , Animals , Humans , Phylogeny , Primates/genetics , Synteny
7.
Genes (Basel) ; 11(6)2020 06 18.
Article in English | MEDLINE | ID: mdl-32570830

ABSTRACT

Satellite DNAs are tandemly repeated sequences preferentially assembled into large arrays within constitutive heterochromatin and their transcription is often activated by stress conditions, particularly by heat stress. Bioinformatic analyses of sequenced genomes however reveal single repeats or short arrays of satellite DNAs dispersed in the vicinity of genes within euchromatin. Here, we analyze transcription of a major human alpha satellite DNA upon heat stress and follow the dynamics of "silent" H3K9me3 and "active" H3K4me2/3 histone marks at dispersed euchromatic and tandemly arranged heterochromatic alpha repeats. The results show H3K9me3 enrichment at alpha repeats upon heat stress, which correlates with the dynamics of alpha satellite DNA transcription activation, while no change in H3K4me2/3 level is detected. Spreading of H3K9me3 up to 1-2 kb from the insertion sites of the euchromatic alpha repeats is detected, revealing the alpha repeats as modulators of local chromatin structure. In addition, expression of genes containing alpha repeats within introns as well as of genes closest to the intergenic alpha repeats is downregulated upon heat stress. Further studies are necessary to reveal the possible contribution of H3K9me3 enriched alpha repeats, in particular those located within introns, to the silencing of their associated genes.


Subject(s)
DNA, Satellite/genetics , Heat-Shock Response/genetics , Histones/genetics , Repetitive Sequences, Nucleic Acid/genetics , Euchromatin/genetics , Gene Silencing , Heterochromatin/genetics , Humans , Introns/genetics
8.
G3 (Bethesda) ; 8(3): 875-886, 2018 03 02.
Article in English | MEDLINE | ID: mdl-29311112

ABSTRACT

Satellite DNAs are tandemly repeated sequences clustered within heterochromatin. However, in some cases, such as the major TCAST1 satellite DNA from the beetle Tribolium castaneum, they are found partially dispersed within euchromatin. Such organization together with transcriptional activity enables TCAST1 to modulate the activity of neighboring genes. In order to explore if other T. castaneum repetitive families have features that could provide them with a possible gene-modulatory role, we compare here the structure, organization, dispersion profiles, and transcription activity of 10 distinct TCAST repetitive families including TCAST1. The genome organization of TCAST families exhibit either satellite-like or transposon-like characteristics. In addition to heterochromatin localization, bioinformatic searches of the assembled genome have revealed dispersion of all families within euchromatin, preferentially in the form of single repeats. Dispersed TCAST repeats are mutually correlated in distribution and are grouped in distinct regions of euchromatin. The repeats are associated with genes, are enriched in introns relative to intergenic regions, and very rarely overlap exons. In spite of the different mechanisms of repeat proliferation, such as transposition and homologous recombination, all TCAST families share a similar frequency of spreading as well as dispersion and gene association profiles. Additionally, TCAST families are transcribed and their transcription is significantly activated by heat stress. A possibility that such common features of TCAST families might be related to their potential gene-modulatory role is discussed.


Subject(s)
Euchromatin/genetics , Repetitive Sequences, Nucleic Acid , Tribolium/genetics , Animals , Chromosomes, Insect , DNA, Satellite , Databases, Genetic , Gene Dosage , Gene Expression Regulation , Genome, Insect , Genomics/methods , Phylogeny , Tandem Repeat Sequences , Transcription, Genetic , Transcriptional Activation
9.
Genome Biol Evol ; 9(6)2017 06 01.
Article in English | MEDLINE | ID: mdl-28575204

ABSTRACT

The Androgen-binding protein ( Abp ) gene region of the mouse genome contains 64 genes, some encoding pheromones that influence assortative mating between mice from different subspecies. Using CNVnator and quantitative PCR, we explored copy number variation in this gene family in natural populations of Mus musculus domesticus ( Mmd ) and Mus musculus musculus ( Mmm ), two subspecies of house mice that form a narrow hybrid zone in Central Europe. We found that copy number variation in the center of the Abp gene region is very common in wild Mmd , primarily representing the presence/absence of the final duplications described for the mouse genome. Clustering of Mmd individuals based on this variation did not reflect their geographical origin, suggesting no population divergence in the Abp gene cluster. However, copy number variation patterns differ substantially between Mmd and other mouse taxa. Large blocks of Abp genes are absent in Mmm , Mus musculus castaneus and an outgroup, Mus spretus , although with differences in variation and breakpoint locations. Our analysis calls into question the reliance on a reference genome for interpreting the detailed organization of genes in taxa more distant from the Mmd reference genome. The polymorphic nature of the gene family expansion in all four taxa suggests that the number of Abp genes, especially in the central gene region, is not critical to the survival and reproduction of the mouse. However, Abp haplotypes of variable length may serve as a source of raw genetic material for new signals influencing reproductive communication and thus speciation of mice.


Subject(s)
Androgen-Binding Protein/genetics , DNA Copy Number Variations , Genetic Speciation , Mice/classification , Mice/genetics , Animals , Base Sequence , Gene Duplication , Mice, Inbred C57BL , Mice, Inbred Strains , Polymerase Chain Reaction
10.
Sci Data ; 3: 160075, 2016 Sep 13.
Article in English | MEDLINE | ID: mdl-27622383

ABSTRACT

Wild populations of the house mouse (Mus musculus) represent the raw genetic material for the classical inbred strains in biomedical research and are a major model system for evolutionary biology. We provide whole genome sequencing data of individuals representing natural populations of M. m. domesticus (24 individuals from 3 populations), M. m. helgolandicus (3 individuals), M. m. musculus (22 individuals from 3 populations) and M. spretus (8 individuals from one population). We use a single pipeline to map and call variants for these individuals and also include 10 additional individuals of M. m. castaneus for which genomic data are publically available. In addition, RNAseq data were obtained from 10 tissues of up to eight adult individuals from each of the three M. m. domesticus populations for which genomic data were collected. Data and analyses are presented via tracks viewable in the UCSC or IGV genome browsers. We also provide information on available outbred stocks and instructions on how to keep them in the laboratory.


Subject(s)
Genome , Genomics , Animals , Biological Evolution , Mice
11.
Genome Res ; 25(8): 1114-24, 2015 Aug.
Article in English | MEDLINE | ID: mdl-26149421

ABSTRACT

Copy number variation represents a major source of genetic divergence, yet the evolutionary dynamics of genic copy number variation in natural populations during differentiation and adaptation remain unclear. We applied a read depth approach to genome resequencing data to detect copy number variants (CNVs) ≥1 kb in wild-caught mice belonging to four populations of Mus musculus domesticus. We complemented the bioinformatics analyses with experimental validation using droplet digital PCR. The specific focus of our analysis is CNVs that include complete genes, as these CNVs could be expected to contribute most directly to evolutionary divergence. In total, 1863 transcription units appear to be completely encompassed within CNVs in at least one individual when compared to the reference assembly. Further, 179 of these CNVs show population-specific copy number differences, and 325 are subject to complete deletion in multiple individuals. Among the most copy-number variable genes are three highly conserved genes that encode the splicing factor CWC22, the spindle protein SFI1, and the Holliday junction recognition protein HJURP. These genes exhibit population-specific expansion patterns that suggest involvement in local adaptations. We found that genes that overlap with large segmental duplications are generally more copy-number variable. These genes encode proteins that are relevant for environmental and behavioral interactions, such as vomeronasal and olfactory receptors, as well as major urinary proteins and several proteins of unknown function. The overall analysis shows that genic CNVs contribute more to population differentiation in mice than in humans and may promote and speed up population divergence.


Subject(s)
Cell Cycle Proteins/genetics , DNA Copy Number Variations , DNA-Binding Proteins/genetics , Mice/genetics , Nuclear Proteins/genetics , Adaptation, Biological , Animals , Cell Cycle Proteins/metabolism , Conserved Sequence , DNA-Binding Proteins/metabolism , Evolution, Molecular , Genetics, Population , Genome , Genomics/methods , Mice/classification , Nuclear Proteins/metabolism , RNA-Binding Proteins , Selection, Genetic
12.
RNA Biol ; 9(5): 587-95, 2012 May.
Article in English | MEDLINE | ID: mdl-22647527

ABSTRACT

Conversion of environmental signals into epigenetic information is thought to occur widely but has been poorly studied as yet. It is proposed that changes in the expression of molecules involved in chromatin modifications might play a role in this process. Here we study the expression of abundant satellite DNA TCAST that makes up 35% of genome of the red flour beetle Tribolium castaneum and is located within the constitutive pericentromeric heterochromatin. RNA polymerase II promotes the transcription of TCAST satellite DNA from both strands, and long primary transcripts are rapidly processed into 21-30 nt siRNAs. Expression of TCAST satellite DNA-associated siRNAs is developmentally regulated, the most intense being at specific stages of embryogenesis. Moreover, the expression is strongly induced following heat shock and is accompanied by increase in repressive epigenetic modifications of histones at TCAST regions. Upon recovery from heat stress, the expression of satellite DNA-associated siRNAs as well as histone modifications is quickly restored. Our results indicate that satellite DNA-associated siRNAs, transiently activated after heat shock, affect epigenetic state of constitutive heterochromatin in Tribolium. It can be hypothesized that transient remodeling of heterochromatin is part of a physiological gene expression program activated under stress conditions in insects.


Subject(s)
DNA, Satellite/genetics , Heat-Shock Response/genetics , RNA, Small Interfering/genetics , Tribolium/genetics , Animals , Base Sequence , Cell Nucleus/metabolism , Chromosome Mapping , Epigenesis, Genetic , Genes, Insect , Heterochromatin/genetics , Heterochromatin/metabolism , Histones/metabolism , Insect Proteins/metabolism , Molecular Sequence Data , RNA Processing, Post-Transcriptional , RNA, Small Interfering/metabolism , RNA, Small Interfering/physiology , Transcription Initiation Site , Tribolium/cytology , Tribolium/physiology
13.
Prog Mol Subcell Biol ; 51: 161-78, 2011.
Article in English | MEDLINE | ID: mdl-21287138

ABSTRACT

The very complex life cycle and extreme diversity of insect life forms require a carefully regulated network of biological processes to switch on and off the right genes at the right time. Chromatin condensation is an important regulatory mechanism of gene silencing as well as gene activation for the hundreds of functional protein genes harbored in heterochromatic regions of different insect species. Being the major heterochromatin constituents, satellite DNAs (satDNAs) serve important roles in heterochromatin regulation in insects in general. Their expression occurs in all developmental stages, being the highest during embryogenesis. satDNA transcripts range from small RNAs, corresponding in size to siRNAs, and piwiRNAs, to large, a few kb long RNAs. The long transcripts are preferentially nonpolyadenylated and remain in the nucleus. The actively regulated expression of satDNAs by cis or trans elements as well as by environmental stress, rather than constitutive transcription, speaks in favor of their involvement in differentiation, development, and environmental response.


Subject(s)
DNA, Satellite , Heterochromatin , Animals , Cell Nucleus/metabolism , Chromatin , Gene Silencing , Insecta/genetics , RNA, Small Interfering/genetics
14.
PLoS One ; 3(2): e1594, 2008 Feb 13.
Article in English | MEDLINE | ID: mdl-18270581

ABSTRACT

Transcripts of centromeric satellite DNAs are known to play a role in heterochromatin formation as well as in establishment of the kinetochore. However, little is known about basic mechanisms of satellite DNA expression within constitutive heterochromatin and its regulation. Here we present comprehensive analysis of transcription of abundant centromeric satellite DNA, PRAT from beetle Palorus ratzeburgii (Coleoptera). This satellite is characterized by preservation and extreme sequence conservation among evolutionarily distant insect species. PRAT is expressed in all three developmental stages: larvae, pupae and adults at similar level. Transcripts are abundant comprising 0.033% of total RNA and are heterogeneous in size ranging from 0.5 kb up to more than 5 kb. Transcription proceeds from both strands but with 10 fold different expression intensity and transcripts are not processed into siRNAs. Most of the transcripts (80%) are not polyadenylated and remain in the nucleus while a small portion is exported to the cytoplasm. Multiple, irregularly distributed transcription initiation sites as well as termination sites have been mapped within the PRAT sequence using primer extension and RLM-RACE. The presence of cap structure as well as poly(A) tails in a portion of the transcripts indicate RNA polymerase II-dependent transcription and a putative polymerase II promoter site overlaps the most conserved part of the PRAT sequence. The treatment of larvae with alpha-amanitin decreases the level of PRAT transcripts at concentrations that selectively inhibit pol II activity. In conclusion, stable, RNA polymerase II dependant transcripts of abundant centromeric satellite DNA, not regulated by RNAi, have been identified and characterized. This study offers a basic understanding of expression of highly abundant heterochromatic DNA which in beetle species constitutes up to 50% of the genome.


Subject(s)
Centromere/genetics , Coleoptera/genetics , DNA, Satellite/genetics , RNA Polymerase II/physiology , Transcription, Genetic , Animals , Codon, Terminator , Life Cycle Stages , RNA, Messenger/analysis , RNA, Messenger/genetics , Transcription Initiation Site
15.
Semin Cancer Biol ; 18(2): 123-30, 2008 Apr.
Article in English | MEDLINE | ID: mdl-18291669

ABSTRACT

Abnormal chromosome content known as aneuploidy is the most common characteristic of human solid tumours. The molecular roots of aneuploidy lie in defective centromere/kinetochore assembly and function leading to improper chromosome segregation. These defects can be caused by mutations and/or by altered expression of diverse kinetochore proteins. In addition to proteins, non-coding RNA deriving from centromeric repeats plays an active role, mostly through the RNAi pathway, in the formation of pericentromeric and centromeric heterochromatin, both of them important for proper centromere function. We propose that stoichiometric expression of major kinetochore components such as non-coding centromeric RNA and proteins is crucial for centromere/kinetochore assembly and function. Slight changes in expression of non-coding RNA or mutations in the RNA metabolic pathways induce chromosome instability, mis-segregation and aneuploidy, facilitating finally tumourigenesis.


Subject(s)
Aneuploidy , Heterochromatin/physiology , Neoplasms/genetics , RNA, Untranslated/physiology , Animals , Centromere/physiology , Chromosome Segregation/physiology , Genomic Instability/genetics , Heterochromatin/metabolism , Humans , Models, Biological
SELECTION OF CITATIONS
SEARCH DETAIL
...