Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 15 de 15
Filter
Add more filters










Publication year range
1.
Cell Mol Life Sci ; 61(6): 721-6, 2004 Mar.
Article in English | MEDLINE | ID: mdl-15052414

ABSTRACT

The localization of HIV-1 proviruses in compositional DNA fractions from 27 AIDS patients during the chronic phase of the disease with depletion of CD4+ and different levels of viremia showed the following. (1) At low viremia, proviruses are predominantly localized in the GC-richest isochores, which are characterized by an open chromatin structure; this result mimics findings on HIV-1 integration in early infected cells in culture. (2) At higher viremia, an increased distribution of proviruses in GC-poor isochores (which match the GC poorness of HIV-1) was found; this suggests a selection of cells in which the 'isopycnic' localization leads to a higher expression of proviruses and, in turn, to higher viremia. (3) At the highest viremia, integrations in GC-rich isochores are often predominant again, but generally not at the same level as in (1); this may be the consequence of new integrations from the extremely abundant RNA copies.


Subject(s)
Acquired Immunodeficiency Syndrome/genetics , Chromosomes, Human , Genome, Human , HIV-1/genetics , Viremia/genetics , Virus Integration , Base Composition , CD4 Antigens , Chromatin/genetics , DNA, Viral/analysis , DNA, Viral/genetics , GC Rich Sequence , Humans , Isochores , Proviruses/genetics , RNA, Viral/genetics
2.
Gene ; 276(1-2): 39-45, 2001 Oct 03.
Article in English | MEDLINE | ID: mdl-11591470

ABSTRACT

Alus and LINEs (LINE1) are widespread classes of repeats that are very unevenly distributed in the human genome. The majority of GC-poor LINEs reside in the GC-poor isochores whereas GC-rich Alus are mostly present in GC-rich isochores. The discovery that LINES and Alus share similar target site duplication and a common AT-rich insertion site specificity raised the question as to why these two families of repeats show such a different distribution in the genome. This problem was investigated here by studying the isochore distributions of subfamilies of LINES and Alus characterized by different degrees of divergence from the consensus sequences, and of Alus, LINEs and pseudogenes located on chromosomes 21 and 22. Young Alus are more frequent in the GC-poor part of the genome than old Alus. This suggests that the gradual accumulation of Alus in GC-rich isochores has occurred because of their higher stability in compositionally matching chromosomal regions. Densities of Alus and LINEs increase and decrease, respectively, with increasing GC levels, except for the telomeric regions of the analyzed chromosomes. In addition to LINEs, processed pseudogenes are also more frequent in GC-poor isochores. Finally, the present results on Alu and LINE stability/exclusion predict significant losses of Alu DNA from the GC-poor isochores during evolution, a phenomenon apparently due to negative selection against sequences that differ from the isochore composition.


Subject(s)
Alu Elements/genetics , Genome, Human , Long Interspersed Nucleotide Elements/genetics , Base Composition , Chromosomes, Human, Pair 21/genetics , Chromosomes, Human, Pair 22/genetics , DNA/genetics , GC Rich Sequence/genetics , Humans , Mutagenesis, Insertional
3.
Gene ; 247(1-2): 287-92, 2000 Apr 18.
Article in English | MEDLINE | ID: mdl-10773468

ABSTRACT

In the present work we show that in the Drosophila genome (which covers a 37-51% GC range at a DNA size of approx.50kb) a linear correlation holds between GC (or GC(3)50kb) genomic sequences embedding them. This correlation allows us to position the two compositional distributions of (a) coding sequences, and (b) of long DNA segments relative to each other and to calculate gene concentration across the compositional range of the Drosophila genome. Using this approach, we show that gene concentration increases with increasing GC of the regions embedding the genes, reaching a 7-fold higher level in the GC-richest regions compared with the GC-poorest regions. The gene distribution of the Drosophila genome is, therefore, similar to (although less striking than) that of the human genome, whereas it is very different from those of the Arabidopsis genome, which has about the same size as the Drosophila genome.


Subject(s)
Drosophila melanogaster/genetics , Genes, Insect/genetics , Genome , Animals , Base Composition , Codon , DNA/genetics , Databases, Factual , GC Rich Sequence , Genome, Human , Humans
4.
J Mol Evol ; 50(1): 45-55, 2000 Jan.
Article in English | MEDLINE | ID: mdl-10654259

ABSTRACT

Mycobacterium tuberculosis and Mycobacterium leprae are the ethiological agents of tuberculosis and leprosy, respectively. After performing extensive comparisons between genes from these two GC-rich bacterial species, we were able to construct a set of 275 homologous genes. Since these two bacterial species also have a very low growth rate, translational selection could not be so determinant in their codon preferences as it is in other fast-growing bacteria. Indeed, principal-components analysis of codon usage from this set of homologous genes revealed that the codon choices in M. tuberculosis and M. leprae are correlated not only with compositional constraints and translational selection, but also with the degree of amino acid conservation and the hydrophobicity of the encoded proteins. Finally, significant correlations were found between GC3 and synonymous distances as well as between synonymous and nonsynonymous distances.


Subject(s)
Codon/genetics , Mycobacterium leprae/genetics , Mycobacterium tuberculosis/genetics , Amino Acid Sequence , Base Pairing , Conserved Sequence , Evolution, Molecular , Gene Expression Regulation, Bacterial , Genetic Variation , Nucleotides/genetics
5.
Gene ; 238(1): 3-14, 1999 Sep 30.
Article in English | MEDLINE | ID: mdl-10570978

ABSTRACT

The "universal correlation" (D'Onofrio, G., Bernardi, G., 1992. A universal compositional correlation among codon positions. Gene 110, 81-88.) that holds between and or ( values are the average values of the coding sequences of each genome analyzed) at both the inter- and intra-genomic level, was re-analyzed on a vastly larger dataset. The results showed a slight, but significant, difference in the vs. correlations exhibited by prokaryotes and eukaryotes. This finding prompted an analysis of the correlation between and the amino acid frequencies in the encoded proteins, which has shown that positive correlations exist between values of coding sequences and the hydropathy of the corresponding proteins. These correlations are due to the fact that hydrophobic and amphypathic amino acids increase, whereas hydrophilic amino acids decrease with increasing values. Hydropathy values of prokaryotic proteins are systematically higher than those of eukaryotes, but the slopes of the regression lines are identical. The lower hydrophobicity of eukaryotic proteins is due to differences in the amino acid composition. In particular, the twofold higher cysteine (and disulfide bond) level of eukaryotic proteins compared to prokaryotic proteins most probably compensates for their lower hydrophobicity. This supports the viewpoint that hydrophobicity plays a structural and functional role as far as protein stability is concerned.


Subject(s)
Base Composition , Proteins/chemistry , Amino Acids/chemistry , Base Sequence , Codon , Genome , Proteins/genetics
6.
Gene ; 238(1): 15-21, 1999 Sep 30.
Article in English | MEDLINE | ID: mdl-10570979

ABSTRACT

A compositional transition was previously detected by comparing orthologous coding sequences from cold- and warm-blooded vertebrates (see Bernardi, G., Hughes, S., Mouchiroud, D., 1997. The major compositional transitions in the vertebrate genome. J. Mol. Evol. 44, S44-S51 for a review). The transition is characterized by higher GC levels (GC is the molar ratio of guanine+cytosine in DNA) and, especially, by higher GC3 levels (GC3 is the GC level of third codon positions) in coding sequences from warm-blooded vertebrates. This transition essentially affects GC-rich genes, although the nucleotide substitution rate is of the same order of magnitude in both GC-poor and GC-rich genes. In order to understand the evolutionary basis of the changes, we have compared the hydrophobicity of orthologous proteins from Xenopus and human. Although the differences are small in proteins encoded by coding sequences ranging from 0 to 65% in GC3, they are large in the proteins encoded by sequences characterized by GC3 values higher than 65%. The latter proteins are more hydrophobic in human than in Xenopus.


Subject(s)
Proteins/chemistry , Amino Acid Sequence , Animals , Cattle , Humans , Molecular Sequence Data , Water/chemistry , Xenopus
7.
Gene ; 238(1): 23-31, 1999 Sep 30.
Article in English | MEDLINE | ID: mdl-10570980

ABSTRACT

We investigated the relationships between the nucleotide substitution rates and the predicted secondary structures in the three states representation (alpha-helix, beta-sheet, and coil). The analysis was carried out on 34 alignments, each of which comprised sequences belonging to at least four different mammalian orders. The rates of synonymous substitution were found to be significantly different in regions predicted to be alpha-helix, beta-sheet, or coil. Likewise, the nonsynonymous rates also differ, although expectedly at a lower extent, in the three types of secondary structure, suggesting that different selective constraints associated with the different structures are affecting in a similar way the synonymous and nonsynonymous rates. Moreover, the base composition of the third codon positions is different in coding sequence regions corresponding to different secondary structures of proteins.


Subject(s)
Base Composition , Codon , Mutation , Protein Structure, Secondary , Animals , Mammals
8.
J Mol Evol ; 49(3): 330-42, 1999 Sep.
Article in English | MEDLINE | ID: mdl-10473774

ABSTRACT

In this work, we have investigated the relationships between synonymous and nonsynonymous rates and base composition in coding sequences from Gramineae to analyze the factors underlying the variation in substitutional rates. We have shown that in these genes the rates of nucleotide divergence, both synonymous and nonsynonymous, are, to some extent, dependent on each other and on the base composition. In the first place, the variation in nonsynonymous rate is related to the GC level at the second codon position (the higher the GC(2) level, the higher the amino acid replacement rate). The correlation is especially strong with T(2), the coefficients being significant in the three data sets analyzed. This correlation between nonsynonymous rate and base composition at the second codon position is also detectable at the intragenic level, which implies that the factors that tend to increase the intergenic variance in nonsynonymous rates also affect the intragenic variance. On the other hand, we have shown that the synonymous rate is strongly correlated with the GC(3) level. This correlation is observed both across genes and at the intragenic level. Similarly, the nonsynonymous rate is also affected at the intragenic level by GC(3) level, like the silent rate. In fact, synonymous and nonsynonymous rates exhibit a parallel behavior in relation to GC(3) level, indicating that the intragenic patterns of both silent and amino acid divergence rates are influenced in a similar way by the intragenic variation of GC(3). This result, taken together with the fact that the number of genes displaying intragenic correlation coefficients between synonymous and nonsynonymous rates is not very high, but higher than random expectation (in the three data sets analyzed), strongly suggests that the processes of silent and amino acid replacement divergence are, at least in part, driven by common evolutionary forces in genes from Gramineae.


Subject(s)
Edible Grain/genetics , Evolution, Molecular , Genes, Plant , Genetic Variation , Base Composition , Enzymes/genetics , Hordeum/genetics , Oryza/genetics , Phylogeny , Plant Proteins/genetics , Triticum/genetics , Zea mays/genetics
9.
Ann N Y Acad Sci ; 870: 81-94, 1999 May 18.
Article in English | MEDLINE | ID: mdl-10415475

ABSTRACT

The discovery that the vertebrate genomes of warm-blooded vertebrates are mosaics of isochores, long DNA segments homogeneous in base composition, yet belonging to families covering a broad spectrum of GC levels, has led to two major observations. The first is that gene density is strikingly non-uniform in the genome of all vertebrates, gene concentration increasing with increasing GC levels. (Although the genomes of cold-blooded vertebrates are characterized by smaller compositional heterogeneities than those of warm-blooded vertebrates and high GC levels are not attained, their gene distribution is basically similar to that of warm-blooded vertebrates.) The second observation is that the GC-richest and gene-richest isochores underwent a compositional transition (characterized by a strong increase in GC level) between cold- and warm-blooded vertebrates. Evidence to be discussed favors the idea that this compositional transition and the ensuing highly heterogeneous compositional pattern was due to, and was maintained by, natural selection.


Subject(s)
Evolution, Molecular , Vertebrates/genetics , Animals , Humans
10.
J Mol Evol ; 49(1): 27-35, 1999 Jul.
Article in English | MEDLINE | ID: mdl-10368431

ABSTRACT

We have analyzed the patterns of synonymous codon preferences of the nuclear genes of Plasmodium falciparum, a unicellular parasite characterized by an extremely GC-poor genome. When all genes are considered, codon usage is strongly biased toward A and T in third codon positions, as expected, but multivariate statistical analysis detects a major trend among genes. At one end genes display codon choices determined mainly by the extreme genome composition of this parasite, and very probably their expression level is low. At the other end a few genes exhibit an increased relative usage of a particular subset of codons, many of which are C-ending. Since the majority of these few genes is putatively highly expressed, we postulate that the increased C-ending codons are translationally optimal. In conclusion, while codon usage of the majority of P. falciparum genes is determined mainly by compositional constraints, a small number of genes exhibit translational selection.


Subject(s)
Codon/genetics , Plasmodium falciparum/genetics , Selection, Genetic , Animals , Genome, Protozoan , Models, Genetic , Protein Biosynthesis
11.
J Mol Evol ; 46(1): 37-44, 1998 Jan.
Article in English | MEDLINE | ID: mdl-9419223

ABSTRACT

Previous investigations indicated that synonymous and nonsynonymous substitution rates are correlated in mammalian genes. In the present work, this correlation has been studied at the intragenic level using a dataset of 48 orthologous genes from species belonging to at least four different mammalian orders. The results obtained show that the intragenic variability in synonymous rates is correlated with that of nonsynonymous rates. Moreover, the variation in GC level (and especially of C level) of silent positions along each gene is correlated with the variation in synonymous rate. These results reinforce the previous conclusions that synonymous and nonsynonymous rates as well as GC levels of silent positions are to some extent under common selective constraints.


Subject(s)
Mammals/genetics , Models, Genetic , Animals , Biological Evolution , Codon , Genes
12.
J Mol Evol ; 46(1): 45-53, 1998 Jan.
Article in English | MEDLINE | ID: mdl-9419224

ABSTRACT

In this work, we investigated (1) the compositional distributions of all available nuclear coding sequences (and of their three codon positions) of six dicots and four Gramineae; this considerably expanded our knowledge about the differences previously seen between these two groups of plants; (2) the compositional correlations of homologous genes from dicots and from Gramineae, as well as from both groups; all correlations were characterized by very good coefficients, with slopes close to unity in the former two cases and very high in the last; (3) the compositional transition that accompanied the emergence of Gramineae from an ancestral monocot; (4) the compositional correlations between exons and introns, which were very good in Gramineae, but only poor to good in dicots; and (5) the compositional profiles of homologous genes from angiosperms, which were characterized by a series of peaks (exons) and valleys (introns) separated by 15-20% GC. The conservative and transitional modes of compositional evolution in plant genes and their general implications are discussed.

13.
Gene ; 224(1-2): 123-7, 1998 Dec 11.
Article in English | MEDLINE | ID: mdl-9931467

ABSTRACT

A computer analysis of 946 human DNA sequences larger than 50kb and representing about 118Mb of DNA has led to the following observations. (i) Positive correlations hold between CpG levels and the GC levels of isochores and coding sequences, as expected from previous results. (ii) The correlation between CpG levels and the GC levels of pseudogenes is characterized by lower CpG values (at comparable GC levels) and by a lower slope compared with the correlation with coding sequences; this finding suggests that an extensive methylation followed by deamination has taken place on CpG doublets from inactive genes leading to a further CpG shortage. (iii) The frequency of CpG islands in long human sequences increases with increasing GC and almost parallels gene frequency. (iv) The frequency of Alu sequences also increases with increasing GC, but attains a maximum in H2 isochores, in agreement with previous experimental data. (v) The ratio 5mC/CpG (namely, the methylation level over available sites) decreases with increasing GC levels of isochores. This decrease is due only to a small extent to the increase of (unmethylated) CpG islands in GC-rich isochores, and takes place in spite of the increase of strongly methylated Alu sequences in GC-rich isochores; this stresses the much lower relative methylation (5mC/CpG) of single-copy sequences located in GC-rich isochores relative to those located in GC-poor isochores. (vi) CpG levels of Alus and CpG islands are positively correlated with the GC levels of the long sequences in which they are located. (vii) The CpG levels of both Alus and CpG islands increase with their GC levels.


Subject(s)
Alu Elements/genetics , CpG Islands/genetics , DNA/genetics , 5-Methylcytosine , Amino Acid Sequence , Codon , Cytosine/analogs & derivatives , Databases, Factual , Guanine , Humans , Pseudogenes , Statistics as Topic
14.
Gene ; 205(1-2): 109-18, 1997 Dec 31.
Article in English | MEDLINE | ID: mdl-9461384

ABSTRACT

We have analysed the levels of 5-methylcytosine (5mC) in DNAs from 42 vertebrates, and compiled, including data from literature, a table of genomic 5mC and GC levels (as well as the available c-values, i.e., the haploid genome sizes) of 87 species from all vertebrate classes. An analysis of the data indicates that (i) two positive correlations hold between the 5mC and GC levels of the genomes of fishes/amphibians and mammals/birds, respectively; (ii) the genomes of fishes and amphibians are, on average, about twice as methylated as those of mammals, birds and reptiles, this difference being unrelated to the amounts of repetitive DNA sequences; (iii) the 5mC and CpG observed/expected values show no overlap between the two groups of vertebrates and suggest the existence of two equilibria. The transition separating the two equilibria appears to have taken place at the time of appearance of reptiles. Its possible cause(s) and its implications are discussed.


Subject(s)
CpG Islands/genetics , DNA Methylation , Evolution, Molecular , Genome , Vertebrates/genetics , Animals , DNA/genetics , Humans , Repetitive Sequences, Nucleic Acid , Species Specificity
15.
Gene ; 205(1-2): 119-24, 1997 Dec 31.
Article in English | MEDLINE | ID: mdl-9461385

ABSTRACT

5-Methylcytosine (5mC) levels were determined in compositional DNA fractions corresponding to different isochore families from the genomes of Xenopus, chicken, mouse and human, four vertebrates which show different isochore patterns. The results obtained indicate that: (i) positive correlations exist between the 5mC levels and the GC levels of isochores within any given genome; and (ii) DNA from Xenopus isochore families is twice as methylated as DNA from the isochores having the same GC levels from mouse, human and chicken. Moreover, the positive correlations holding between CpG levels and the GC3 levels of coding sequences of warm-blooded vertebrates were shown to comprise two regions with a border at approx. 75% GC3. The correlation corresponding to the higher region (which comprises only very rare high GC3 values in the case of Xenopus) has a higher slope than that corresponding to the lower GC3 values, a phenomenon due in all likelihood, to the increasing contribution of CpG islands. Finally, the observed/expected CpG ratio is higher in Xenopus than in warm-blooded vertebrates.


Subject(s)
DNA Methylation , Genome , Vertebrates/genetics , Animals , CpG Islands/genetics , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...