Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 49
Filter
Add more filters










Publication year range
1.
Int J Mol Sci ; 24(3)2023 Feb 03.
Article in English | MEDLINE | ID: mdl-36769294

ABSTRACT

A large part of the genome is known to be transcribed as non-coding DNA including some tandem repeats (satellites) such as telomeric/centromeric satellites in different species. However, there has been no detailed study on the eventual transcription of the interspersed satellites found in many species. In the present paper, we studied the transcription of the abundant DNA satellites in the nematode Caenorhabditis elegans using available RNA-Seq results. We found that many of them have been transcribed, but usually in an irregular manner; different regions of a satellite have been transcribed with variable efficiency. Satellites with a similar repeat sequence also have a different transcription pattern depending on their position in the genome. We also describe the peculiar features of satellites associated with Helitron transposons in C. elegans. Our demonstration that some satellite RNAs are transcribed adds a new family of non-coding RNAs, a new element in the world of RNA interference, with new paths for the control of mRNA translation. This is a field that requires further investigation and will provide a deeper understanding of gene expression and control.


Subject(s)
Caenorhabditis elegans , DNA, Satellite , Animals , DNA, Satellite/genetics , Caenorhabditis elegans/genetics , Repetitive Sequences, Nucleic Acid , DNA , Centromere
2.
Genes (Basel) ; 12(11)2021 10 20.
Article in English | MEDLINE | ID: mdl-34828257

ABSTRACT

It has been shown in recent years that many repeated sequences in the genome are expressed as RNA transcripts, although the role of such RNAs is poorly understood. Some isolated and tandem repeats (satellites) have been found to be transcribed, such as mammalian Alu sequences and telomeric/centromeric satellites in different species. However, there is no detailed study on the eventual transcription of the interspersed satellites found in many species. Therefore, we decided to study for the first time the transcription of the abundant DNA satellites in the bacterium Bacillus coagulans and in the nematode Caenorhabditis elegans. We have updated the data for C. elegans satellites using the latest version of the genome. We analyzed the transcription of satellites in both species in available RNA-seq results and found that they are widely transcribed. Our demonstration that satellite RNAs are transcribed adds a new family of non-coding RNAs. This is a field that requires further investigation and will provide a deeper understanding of gene expression and control.


Subject(s)
Bacillus coagulans/genetics , Caenorhabditis elegans/genetics , DNA, Satellite/genetics , Animals , Bacteria/genetics , Eukaryota/genetics , Gene Expression Regulation , Genome/genetics , RNA, Untranslated/genetics , Repetitive Sequences, Nucleic Acid/genetics , Sequence Analysis, DNA , Sequence Analysis, RNA , Transcription, Genetic
3.
Int J Mol Sci ; 22(10)2021 May 20.
Article in English | MEDLINE | ID: mdl-34065296

ABSTRACT

Little is known about DNA tandem repeats across prokaryotes. We have recently described an enigmatic group of tandem repeats in bacterial genomes with a constant repeat size but variable sequence. These findings strongly suggest that tandem repeat size in some bacteria is under strong selective constraints. Here, we extend these studies and describe tandem repeats in a large set of Bacillus. Some species have very few repeats, while other species have a large number. Most tandem repeats have repeats with a constant size (either 52 or 20-21 nt), but a variable sequence. We characterize in detail these intriguing tandem repeats. Individual species have several families of tandem repeats with the same repeat length and different sequence. This result is in strong contrast with eukaryotes, where tandem repeats of many sizes are found in any species. We discuss the possibility that they are transcribed as small RNA molecules. They may also be involved in the stabilization of the nucleoid through interaction with proteins. We also show that the distribution of tandem repeats in different species has a taxonomic significance. The data we present for all tandem repeats and their families in these bacterial species will be useful for further genomic studies.


Subject(s)
Bacillus/genetics , Tandem Repeat Sequences/genetics , Bacteria/genetics , Eukaryota/genetics , Genome, Bacterial/genetics , Genomics/methods , Prokaryotic Cells/physiology , Species Specificity
4.
Biopolymers ; 112(4): e23397, 2021 Apr.
Article in English | MEDLINE | ID: mdl-32898299

ABSTRACT

We present a new structure of a DNA dodecamer obtained in the presence of Ni2+ ions. The DNA forms Ni-guanine cross-links between neighboring molecules. Our results show that an adequate dosage of Ni2+ may help to form well-defined DNA nanostructures. We also compare our structure with other dodecamers which present unique features and also crystallize in trigonal unit cells, strongly influenced by the counterions associated with DNA. In all cases, the DNA duplexes form parallel pseudo-helical columns in the crystal, similar to DNA-protamine and native DNA fibers.


Subject(s)
DNA/chemistry , Guanine/chemistry , Nickel/chemistry , Cations, Divalent , Crystallography, X-Ray , Humans , Models, Molecular , Nucleic Acid Conformation
5.
J Bacteriol ; 202(21)2020 10 08.
Article in English | MEDLINE | ID: mdl-32839174

ABSTRACT

DNA tandem repeats, or satellites, are well described in eukaryotic species, but little is known about their prevalence across prokaryotes. Here, we performed the most complete characterization to date of satellites in bacteria. We identified 121,638 satellites from 12,233 fully sequenced and assembled bacterial genomes with a very uneven distribution. We also determined the families of satellites which have a related sequence. There are 85 genomes that are particularly satellite rich and contain several families of satellites of yet unknown function. Interestingly, we only found two main types of noncoding satellites, depending on their repeat sizes, 22/44 or 52 nucleotides (nt). An intriguing feature is the constant size of the repeats in the genomes of different species, whereas their sequences show no conservation. Individual species also have several families of satellites with the same repeat length and different sequences. This result is in marked contrast with previous findings in eukaryotes, where noncoding satellites of many sizes are found in any species investigated. We describe in greater detail these noncoding satellites in the spirochete Leptospira interrogans and in several bacilli. These satellites undoubtedly play a specific role in the species which have acquired them. We discuss the possibility that they represent binding sites for transcription factors not previously described or that they are involved in the stabilization of the nucleoid through interaction with proteins.IMPORTANCE We found an enigmatic group of noncoding satellites in 85 bacterial genomes with a constant repeat size but variable sequence. This pattern of DNA organization is unique and had not been previously described in bacteria. These findings strongly suggest that satellite size in some bacteria is under strong selective constraints and thus that satellites are very likely to play a fundamental role. We also provide a list and properties of all satellites in 12,233 genomes, which may be used for further genomic analysis.


Subject(s)
Bacteria/genetics , DNA, Bacterial/genetics , DNA, Satellite/genetics , Genome, Bacterial , Leptospira interrogans/genetics , Databases, Genetic , Genomics
6.
BMC Evol Biol ; 19(1): 181, 2019 09 18.
Article in English | MEDLINE | ID: mdl-31533616

ABSTRACT

BACKGROUND: Satellites or tandem repeats are very abundant in many eukaryotic genomes. Occasionally they have been reported to be present in some prokaryotes, but to our knowledge there is no general comparative study on their occurrence. For this reason we present here an overview of the distribution and properties of satellites in a set of representative species. Our results provide novel insights into the evolutionary relationship between eukaryotes, Archaea and Bacteria. RESULTS: We have searched all possible satellites present in the NCBI reference group of genomes in Archaea (142 species) and in Bacteria (119 species), detecting 2735 satellites in Archaea and 1067 in Bacteria. We have found that the distribution of satellites is very variable in different organisms. The archaeal Methanosarcina class stands out for the large amount of satellites in their genomes. Satellites from a few species have similar characteristics to those in eukaryotes, but most species have very few satellites: only 21 species in Archaea and 18 in Bacteria have more than 4 satellites/Mb. The distribution of satellites in these species is reminiscent of what is found in eukaryotes, but we find two significant differences: most satellites have a short length and many of them correspond to segments of genes coding for amino acid repeats. Transposition of non-coding satellites throughout the genome occurs rarely: only in the bacteria Leptospira interrogans and the archaea Methanocella conradii we have detected satellite families of transposed satellites with long repeats. CONCLUSIONS: Our results demonstrate that the presence of satellites in the genome is not an exclusive feature of eukaryotes. We have described a few prokaryotes which do contain satellites. We present a discussion on their eventual evolutionary significance.


Subject(s)
DNA, Satellite/genetics , Prokaryotic Cells/metabolism , Archaea/genetics , Bacteria/genetics , Microsatellite Repeats/genetics , Phylogeny
7.
Genes (Basel) ; 9(10)2018 Oct 16.
Article in English | MEDLINE | ID: mdl-30332836

ABSTRACT

Repetitive genome regions have been difficult to sequence, mainly because of the comparatively small size of the fragments used in assembly. Satellites or tandem repeats are very abundant in nematodes and offer an excellent playground to evaluate different assembly methods. Here, we compare the structure of satellites found in three different assemblies of the Caenorhabditis elegans genome: the original sequence obtained by Sanger sequencing, an assembly based on PacBio technology, and an assembly using Nanopore sequencing reads. In general, satellites were found in equivalent genomic regions, but the new long-read methods (PacBio and Nanopore) tended to result in longer assembled satellites. Important differences exist between the assemblies resulting from the two long-read technologies, such as the sizes of long satellites. Our results also suggest that the lengths of some annotated genes with internal repeats which were assembled using Sanger sequencing are likely to be incorrect.

8.
Genes (Basel) ; 8(12)2017 Nov 28.
Article in English | MEDLINE | ID: mdl-29182550

ABSTRACT

The availability of the genome sequence of the unisexual (male-female) Caenorhabditis nigoni offers an opportunity to compare its non-coding features with the related hermaphroditic species Caenorhabditis briggsae; to understand the evolutionary dynamics of their tandem repeat sequences (satellites), as a result of evolution from the unisexual ancestor. We take advantage of the previously developed SATFIND program to build satellite families defined by a consensus sequence. The relative number of satellites (satellites/Mb) in C. nigoni is 24.6% larger than in C. briggsae. Some satellites in C. nigoni have developed from a proto-repeat present in the ancestor species and are conserved as an isolated sequence in C. briggsae. We also identify unique satellites which occur only once and joint satellite families with a related sequence in both species. Some of these families are only found in C. nigoni, which indicates a recent appearance; they contain conserved adjacent 5' and 3' regions, which may favor transposition. Our results show that the number, length and turnover of satellites are restricted in the hermaphrodite C. briggsae when compared with the unisexual C. nigoni. We hypothesize that this results from differences in unequal recombination during meiotic chromosome pairing, which limits satellite turnover in hermaphrodites.

9.
BMC Evol Biol ; 15: 218, 2015 Oct 05.
Article in English | MEDLINE | ID: mdl-26438045

ABSTRACT

BACKGROUND: The high density of tandem repeat sequences (satellites) in nematode genomes and the availability of genome sequences from several species in the group offer a unique opportunity to better understand the evolutionary dynamics and the functional role of these sequences. We take advantage of the previously developed SATFIND program to study the satellites in four Caenorhabditis species and investigate these questions. METHODS: The identification and comparison of satellites is carried out in three steps. First we find all the satellites present in each species with the SATFIND program. Each satellite is defined by its length, number of repeats, and repeat sequence. Only satellites with at least ten repeats are considered. In the second step we build satellite families with a newly developed alignment program. Satellite families are defined by a consensus sequence and the number of satellites in the family. Finally we compare the consensus sequence of satellite families in different species. RESULTS: We give a catalog of individual satellites in each species. We have also identified satellite families with a related sequence and compare them in different species. We analyze the turnover of satellites: they increased in size through duplications of fragments of 100-300 bases. It appears that in many cases they have undergone an explosive expansion. In C. elegans we have identified a subset of large satellites that have strong affinity for the centromere protein CENP-A. We have also compared our results with those obtained from other species, including one nematode and three mammals. CONCLUSIONS: Most satellite families found in Caenorhabditis are species-specific; in particular those with long repeats. A subset of these satellites may facilitate the formation of kinetochores in mitosis. Other satellite families in C. elegans are either related to Helitron transposons or to meiotic pairing centers.


Subject(s)
Caenorhabditis/classification , Caenorhabditis/genetics , DNA, Helminth/genetics , Animals , Autoantigens/genetics , Biological Evolution , Caenorhabditis elegans/genetics , Centromere , Centromere Protein A , Chromosomal Proteins, Non-Histone/genetics , DNA, Satellite/genetics , Repetitive Sequences, Nucleic Acid , Species Specificity
10.
PLoS One ; 10(3): e0120241, 2015.
Article in English | MEDLINE | ID: mdl-25781995

ABSTRACT

The traditional Watson-Crick base pairs in DNA may occasionally adopt a Hoogsteen conformation, with a different organization of hydrogen bonds. Previous crystal structures have shown that the Hoogsteen conformation is favored in alternating AT sequences of DNA. Here we present new data for a different sequence, d(ATTAAT)2, which is also found in the Hoogsteen conformation. Thus we demonstrate that other all-AT sequences of DNA with a different sequence may be found in the Hoogsteen conformation. We conclude that any all-AT sequence might acquire this conformation under appropriate conditions. We also compare the detailed features of DNA in either the Hoogsteen or Watson-Crick conformations.


Subject(s)
Base Pairing , DNA/chemistry , Hydrogen Bonding
11.
Biopolymers ; 103(3): 123-33, 2015 Mar.
Article in English | MEDLINE | ID: mdl-25257185

ABSTRACT

In this work, we explore the influence of different solvents and ions on the crystallization behavior of an all-AT dodecamer d(AATAAATTTATT)2 In all cases, the oligonucleotides are found as continuous columns of stacked duplexes. The spatial organization of such columns is variable; consequently we have obtained seven different crystal forms. The duplexes can be made to crystallize in either parallel or crossed columns. Such versatility in the formation of a variety of crystal forms is characteristic for this sequence. It had not been previously reported for any other sequence. In all cases, the oligonucleotide duplexes have been found to crystallize in the B form. The crystallization conditions determine the organization of the crystal, although no clear local interactions have been detected. Mg(2+) and Ni(2+) can be used in order to obtain compact crossed structures. DNA-DNA interactions in the crystals of our all-AT duplexes present crossovers which are different from those previously reported for mixed sequence oligonucleotides. Our results demonstrate that changes in the ionic atmosphere and the crystallization solvent have a strong influence on the DNA-DNA interactions. Similar ionic changes will certainly influence the biological activity of DNA. Modulation of the crystal structure by ions should also be explored in DNA crystal engineering. Liquid crystals with a peculiar macroscopic shape have also been observed.


Subject(s)
Crystallography, X-Ray/methods , DNA/chemistry , AT Rich Sequence/genetics , Crystallization , DNA/genetics , Liquid Crystals , Nucleic Acid Conformation
12.
Acta Crystallogr D Biol Crystallogr ; 70(Pt 6): 1614-21, 2014 Jun.
Article in English | MEDLINE | ID: mdl-24914972

ABSTRACT

The DNA of several pathogens is very rich in AT base pairs. Typical examples include the malaria parasite Plasmodium falciparum and the causative agents of trichomoniasis and trypanosomiases. This fact has prompted studies of drugs which interact with the minor groove of DNA, some of which are used in medical practice. Previous studies have been performed almost exclusively with the AATT sequence. New features should be uncovered through the study of different DNA sequences. In this paper, the crystal structure of the complex of the DNA duplex d(AAAATTTT)2 with the dicationic drug 4,4'-bis(imidazolinylamino)diphenylamine (CD27) is presented. The drug binds to the minor groove of DNA as expected, but it shows two new features that have not previously been described: (i) the drugs protrude from the DNA and interact with neighbouring molecules, so that they may act as cross-linking agents, and (ii) the drugs completely cover the whole minor groove of DNA and displace bound water. Thus, they may prevent the access to DNA of proteins such as AT-hook proteins. These features are also expected for other minor-groove binding drugs when associated with all-AT DNA. These findings allow a better understanding of this family of compounds and will help in the development of new, more effective drugs. New data on the biological interaction of CD27 with the causative agent of trichomoniasis, Trichomonas vaginalis, are also reported.


Subject(s)
DNA/chemistry , Diphenylamine/analogs & derivatives , Imidazolines/chemistry , Crystallography, X-Ray , Diphenylamine/chemistry , Diphenylamine/pharmacology , Imidazolines/pharmacology , Nucleic Acid Conformation , Trichomonas vaginalis/drug effects
13.
PLoS One ; 8(4): e62221, 2013.
Article in English | MEDLINE | ID: mdl-23638010

ABSTRACT

Centromere sequences in the genome are associated with the formation of kinetochores, where spindle microtubules grow in mitosis. Centromere sequences usually have long tandem repeats (satellites). In holocentric nematodes it is not clear how kinetochores are formed during mitosis; they are distributed throughout the chromosomes. For this reason it appeared of interest to study the satellites in nematodes in order to determine if they offer any clue on how kinetochores are assembled in these species. We have studied the satellites in the genome of six nematode species. We found that the presence of satellites depends on whether the nematode chromosomes are holocentric or monocentric. It turns out that holocentric nematodes are unique because they have a large number of satellites scattered throughout their genome. Their number, length and composition are different in each species: they apparently have very little evolutionary conservation. In contrast, no scattered satellites are found in the monocentric nematode Trichinella spiralis. It appears that the absence/presence of scattered satellites in the genome distinguishes monocentric from holocentric nematodes. We conclude that the presence of satellites is related to the holocentric nature of the chromosomes of most nematodes. Satellites may stabilize a higher order structure of chromatin and facilitate the formation of kinetochores. We also present a new program, SATFIND, which is suited to find satellite sequences.


Subject(s)
DNA, Satellite , Genome, Helminth , Trichinella spiralis/genetics , Animals , Base Composition , Base Sequence , Centromere/genetics , Genome Size , Nucleotide Motifs , Phylogeny , Trichinella spiralis/classification
14.
PLoS One ; 7(5): e37120, 2012.
Article in English | MEDLINE | ID: mdl-22615915

ABSTRACT

We present here for the first time the crystal structure of an AT-hook domain. We show the structure of an AT-hook of the ubiquitous nuclear protein HMGA1, combined with the oligonucleotide d(CGAATTAATTCG)(2), which has two potential AATT interacting groups. Interaction with only one of them is found. The structure presents analogies and significant differences with previous NMR studies: the AT-hook forms hydrogen bonds between main-chain NH groups and thymines in the minor groove, DNA is bent and the minor groove is widened.


Subject(s)
DNA/chemistry , HMGA Proteins/chemistry , Binding Sites , Crystallography, X-Ray/methods , DNA/metabolism , HMGA Proteins/metabolism , Hydrogen Bonding , Magnetic Resonance Spectroscopy/methods , Models, Molecular , Oligonucleotides/metabolism , Protein Structure, Tertiary
15.
J Theor Biol ; 283(1): 28-34, 2011 Aug 21.
Article in English | MEDLINE | ID: mdl-21635904

ABSTRACT

There are general features of chromosome dynamics, such as homologue recognition in early meiosis, which are expected to involve related sequence motifs in non-coding DNA, with a similar distribution in different species. A search for such motifs is presented here. It has been carried out with the CONREPP programme. It has been found that short alternating AT sequences (10-20 bases) have a similar distribution in most eukaryotic organisms, with some exceptions related to unique meiotic features. All other microsatellite and repeat sequences vary significantly in different organisms. It is concluded that the unique structural features and uniform distribution of alternating AT sequences indicate that they may facilitate homologous chromosome pairing in the early preleptotene stage of meiosis. They may also play a role in the compaction of DNA in mitotic chromosomes.


Subject(s)
Eukaryota/genetics , Meiosis/genetics , Microsatellite Repeats/genetics , Animals , Base Sequence , Caenorhabditis elegans/genetics , Chromosome Pairing/genetics , Databases, Nucleic Acid , Genome , Humans , Molecular Sequence Data , Nucleosomes/genetics , Saccharomyces cerevisiae Proteins/genetics , Species Specificity
16.
Gene ; 473(2): 76-81, 2011 Mar 01.
Article in English | MEDLINE | ID: mdl-21167919

ABSTRACT

A very simple new program is presented (G-SQUARES). It is useful in order to visualize the composition and basic structural features of whole genomes and selected chromosome regions. The frequency of all dimer and tetramer sequences is reported. Overall structural features are calculated, such as the tendency for alternation. A direct visual comparison among different sequences is easily available. Furthermore, the features which are visualized indicate further studies which should be carried out. Examples are presented on Alu sequences, CpG islands, whole eukaryotic and bacterial genomes.


Subject(s)
Alu Elements , CpG Islands , Genome , Software , Base Sequence , Genome, Bacterial , Molecular Structure , Sequence Analysis, DNA
17.
Acta Crystallogr Sect F Struct Biol Cryst Commun ; 66(Pt 9): 1019-22, 2010 Sep 01.
Article in English | MEDLINE | ID: mdl-20823516

ABSTRACT

The crystal structure of the telomeric sequence d(UBrAGG) interacting with an anthraquinone derivative has been solved by MAD. In all previously studied complexes of intercalating drugs, the drug is usually sandwiched between two DNA base pairs. Instead, the present structure looks like a crystal of stacked anthraquinone molecules in which isolated base pairs are intercalated. Unusual base pairs are present in the structure, such as G.G and A.UBr reverse Watson-Crick base pairs.


Subject(s)
Anthraquinones/chemistry , DNA/chemistry , Nucleic Acid Conformation , Crystallography, X-Ray , Models, Molecular , Molecular Conformation , Stereoisomerism
19.
Acta Crystallogr D Biol Crystallogr ; 66(Pt 3): 251-7, 2010 Mar.
Article in English | MEDLINE | ID: mdl-20179336

ABSTRACT

The coiled-coil structure formed by the complex of the DNA duplex d(ATATATATAT)(2) with pentamidine is presented. The duplex was found to have a mixed structure containing Watson-Crick and Hoogsteen base pairs. The drug stabilizes the coiled coil through the formation of cross-links between neighbouring duplexes. The central part of the drug is found in the minor groove as expected, whereas the charged terminal amidine groups protrude and interact with phosphates from neighbouring molecules. The formation of cross-links may be related to the biological effects of pentamidine, which is used as an antiprotozoal agent in trypanosomiasis, leishmaniasis and pneumonias associated with AIDS. The DNA sequence that was used is highly abundant in most eukaryotic genomes. However, very few data are available on DNA sequences which only contain A.T base pairs.


Subject(s)
Base Pairing , DNA/chemistry , Pentamidine/chemistry , Base Sequence , Crystallography, X-Ray , Models, Molecular
20.
Nucleic Acids Res ; 38(4): 1172-81, 2010 Mar.
Article in English | MEDLINE | ID: mdl-19966278

ABSTRACT

The purpose of this work is to determine the most frequent short sequences in non-coding DNA. They may play a role in maintaining the structure and function of eukaryotic chromosomes. We present a simple method for the detection and analysis of such sequences in several genomes, including Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. We also study two chromosomes of man and mouse with a length similar to the whole genomes of the other species. We provide a list of the most common sequences of 9-14 bases in each genome. As expected, they are present in human Alu sequences. Our programs may also give a graph and a list of their position in the genome. Detection of clusters is also possible. In most cases, these sequences contain few alternating regions. Their intrinsic structure and their influence on nucleosome formation are not known. In particular, we have found new features of short sequences in C. elegans, which are distributed in heterogeneous clusters. They appear as punctuation marks in the chromosomes. Such clusters are not found in either A. thaliana or D. melanogaster. We discuss the possibility that they play a role in centromere function and homolog recognition in meiosis.


Subject(s)
DNA, Intergenic/chemistry , Alu Elements , Animals , Arabidopsis/genetics , Base Sequence , Caenorhabditis elegans/genetics , Genomics , Humans , Mice , Microsatellite Repeats , Short Interspersed Nucleotide Elements
SELECTION OF CITATIONS
SEARCH DETAIL
...