Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
1.
Trends Genet ; 17(11): 619-21, 2001 Nov.
Article in English | MEDLINE | ID: mdl-11672845

ABSTRACT

To study the genome-wide impact of transposable elements (TEs) on the evolution of protein-coding regions, we examined 13 799 human genes and found 533 (approximately 4%) cases of TEs within protein-coding regions. The majority of these TEs (approximately 89.5%) reside within 'introns' and were recruited into coding regions as novel exons. We found that TE integration often has an effect on gene function. In particular, there were two mouse genes whose coding regions consist largely of TEs, suggesting that TE insertion might create new genes. Thus, there is increasing evidence for an important role of TEs in gene evolution. Because many TEs are taxon-specific, their integration into coding regions could accelerate species divergence.


Subject(s)
DNA Transposable Elements/genetics , Genes , Animals , Evolution, Molecular , Genome, Human , Humans , Mice , Species Specificity
2.
Nature ; 409(6822): 847-9, 2001 Feb 15.
Article in English | MEDLINE | ID: mdl-11237007

ABSTRACT

The completion of the human genome will greatly accelerate the development of a new branch of science--evolutionary genomics. We can now directly address important questions about the evolutionary history of human genes and their regulatory sequences. Computational analyses of the human genome will reveal the number of genes and repetitive elements, the extent of gene duplication and compositional heterogeneity in the human genome, and the extent of domain shuffling and domain sharing among proteins. Here we present some first glimpses of these features.


Subject(s)
Evolution, Molecular , Genome, Human , Conserved Sequence , Gene Duplication , Human Genome Project , Humans , Regulatory Sequences, Nucleic Acid , Repetitive Sequences, Nucleic Acid
3.
Mol Biol Evol ; 18(1): 55-60, 2001 Jan.
Article in English | MEDLINE | ID: mdl-11141192

ABSTRACT

Unlike other mammals, Old World primates have five growth hormone-like genes that are highly divergent at the amino acid level from the single growth hormone genes found in nonprimates. Additionally, there is a change in the interaction of growth hormone with its receptor in humans such that human growth hormone functions in nonprimates, whereas nonprimate growth hormone is ineffective in humans. A Southern blotting analysis of the genome of a prosimian, Galago senegalensis, revealed a single growth hormone locus. This single gene was PCR-amplified from genomic DNA and sequenced. It has a rate of nonsynonymous nucleotide substitution less than one fourth that of the human growth hormone gene, while the rates of synonymous substitution in the two species are less different. Human and rhesus monkey growth hormones exhibit variation at a number of amino acid residues that can affect receptor binding. The galago growth hormone is conservative at each of these sites, indicating that this growth hormone is functionally like nonprimate growth hormones. These observations indicate that the amplification and rapid divergence of primate growth hormones occurred after the separation of the higher primate lineage from the galago lineage.


Subject(s)
Evolution, Molecular , Galago/genetics , Growth Hormone/genetics , Amino Acid Sequence , Animals , Binding Sites , Cloning, Molecular , Conserved Sequence , Galago/classification , Gene Dosage , Growth Hormone/metabolism , Humans , Macaca mulatta/genetics , Molecular Sequence Data , Phylogeny , Polymerase Chain Reaction , Receptors, Somatotropin/metabolism , Sequence Analysis, DNA
4.
Genome Res ; 10(12): 1986-95, 2000 Dec.
Article in English | MEDLINE | ID: mdl-11116093

ABSTRACT

Using large amounts of long genomic sequences, we studied the compositional patterns of eukaryotic genomes. We developed a simple measure, the compositional heterogeneity (or variability) index, to compare the differences in compositional heterogeneity between long genomic sequences. The index measures the average difference in GC content between two adjacent windows normalized by the standard error expected under the assumption of random distribution of nucleotides in a window. We report the following findings: (1) The extent of the compositional heterogeneity in a genomic sequence strongly correlates with its GC content in all multicellular eukaryotes studied regardless of genome size. (2) The human genome appears to be highly compositionally heterogeneous both within and between individual chromosomes; the heterogeneity goes much beyond the predictions of the isochore model. (3) All genomes of multicellular eukaryotes examined in this study are compositionally heterogeneous, although they also contain compositionally uniform segments, or isochores. (4) The true uniqueness of the human (or mammalian) genome is the presence of very high GC regions, which exhibit unusually high compositional heterogeneity and contain few long homogeneous segments (isochores). In general, GC-poor isochores tend to be longer than GC-rich ones. These findings indicate that the genomes of multicellular organisms are much more heterogeneous in nucleotide composition than depicted by the isochore model and so lead to a looser definition of isochores.


Subject(s)
Eukaryotic Cells/physiology , Genetic Heterogeneity , Genome , Algorithms , Animals , Base Composition , Caenorhabditis elegans/genetics , Centrifugation, Density Gradient , Computational Biology , Drosophila melanogaster/genetics , GC Rich Sequence/genetics , Genome, Fungal , Genome, Human , Genome, Plant , Humans , Models, Genetic , Sequence Analysis, DNA , Sequence Homology, Nucleic Acid
5.
J Mol Evol ; 51(2): 166-72, 2000 Aug.
Article in English | MEDLINE | ID: mdl-10948273

ABSTRACT

Microsatellite length variation was investigated at a highly variable microsatellite locus in four species of Apodemus. Information obtained from microsatellite allele sequences was contrasted with allele sizes, which included 18 electromorphs. Additional analysis of a 400-bp unique sequence in the flanking region identified 26 different haplotype sequences or "true" alleles in the sample. Three molecular mechanisms, namely, (1) addition/deletion of repeats, (2) substitutions and indels in the flanking region, and (3) mutations interrupting the repeat, contributed to the generation of allelic variation. Size homoplasy can be inferred for alleles within populations, from different populations of the same species, and from different species. We propose that microsatellite flanking sequences may be informative markers for investigating mutation processes in microsatellite repeats as well as phylogenetic relationships among alleles, populations, and species.


Subject(s)
Evolution, Molecular , Genetic Variation , Microsatellite Repeats , Models, Genetic , Muridae/genetics , Alleles , Animals , Dinucleotide Repeats/genetics , Molecular Sequence Data , Mutation , Phylogeny , Sequence Homology, Nucleic Acid , Tumor Necrosis Factor-alpha/genetics
6.
Gene ; 249(1-2): 47-51, 2000 May 16.
Article in English | MEDLINE | ID: mdl-10831837

ABSTRACT

Representational difference analysis (RDA), a technique for the isolation of differences between highly similar complex genomes, was employed for isolation of species-specific markers. These markers can be easily adapted for a high throughput PCR-based assay in which multiple specimens can be simultaneously identified based on the presence/absence of amplification products. One of the important features of RDA performed on genomes of different species (interspecific RDA) is its ability to preferentially isolate families of repetitive sequences that are unique to one of the compared genomes. Such families of repetitive DNA are homoplasy-free characters that can be used for cost-efficient, mass identification of specimens in a variety of situations ranging from mark-recapture studies to screenings of egg or larval stages.


Subject(s)
Arvicolinae/genetics , Genetic Markers/genetics , Animals , Arvicolinae/classification , DNA/chemistry , DNA/isolation & purification , DNA Primers , Molecular Sequence Data , Polymerase Chain Reaction , Repetitive Sequences, Nucleic Acid , Sequence Analysis, DNA , Species Specificity
7.
Gene ; 259(1-2): 81-8, 2000 Dec 23.
Article in English | MEDLINE | ID: mdl-11163965

ABSTRACT

The densities of repetitive elements in the human genome were calculated in each GC content class using non-overlapping windows of 50kb. The density of Alu is two to three times higher in GC-rich regions than in AT-rich regions, while the opposite is true for LINE1. In contrast, LINE2 and other elements, such as DNA transposons, are more uniformly distributed in the genome. The number of Alus in the human genome was estimated to be 1.4 million, higher than previous estimates. About 40% of the autosomes and approximately 51% of the X and Y chromosomes are occupied by repetitive elements. In total, the human genome is estimated to contain more than 4 million repetitive elements. The GC contents (%) of repetitive elements and their flanking regions were also calculated. The GC contents of almost all kinds of repeats are positively correlated with the window GC contents, suggesting that a repetitive sequence is subject to the same mutation pressure as its surrounding regions, so it tends to have the same GC content as its surrounding regions. This observation supports the regional mutation hypothesis. The only two exceptions are AluYa and AluYb8, the two youngest Alu subfamilies. The GC content of AluYb8 is negatively correlated with that of its surrounding regions, while AluYa shows no correlation, suggesting different insertion patterns for these two young Alu subfamilies. This suggestion was supported by the fact that the average genetic distance between members of AluYb8 in each GC window class is positively correlated with the GC content of the window, but no correlation was found for AluYa. AluYa is more frequent in Y chromosome than in other chromosomes; the same is true for LTR retroviruses. This pattern might be correlated with the evolutionary history of Y chromosome.


Subject(s)
DNA/genetics , Genome, Human , Repetitive Sequences, Nucleic Acid/genetics , Alu Elements/genetics , Base Composition , Female , Humans , Male
9.
Mol Biol Evol ; 15(12): 1674-84, 1998 Dec.
Article in English | MEDLINE | ID: mdl-9866202

ABSTRACT

In this study, we report cDNA sequences of the cytosolic NADP-dependent isocitrate dehydrogenase for humans, mice, and two species of voles (Microtus mexicanus and Microtus ochrogaster). Inferred amino acid sequences from these taxa display a high level of amino acid sequence conservation, comparable to that of myosin beta heavy chain, and share known structural features. A Caenorhabditis elegans enzyme that was previously identified as a protein similar to isocitrate dehydrogenase is most likely the NADP-dependent cytosolic isocitrate dehydrogenase enzyme equivalent, based on amino acid similarity to mammalian enzymes and phylogenetic analysis. We also suggest that NADP-dependent isocitrate dehydrogenases characterized from alfalfa, soybean, and eucalyptus are most likely cytosolic enzymes. The phylogenetic tree of various isocitrate dehydrogenases from eukaryotic sources revealed that independent gene duplications may have given rise to the cytosolic and mitochondrial forms of NADP-dependent isocitrate dehydrogenase in animals and fungi. There appears to be no statistical support for a hypothesis that the mitochondrial and cytosolic forms of the enzyme are orthologous in these groups. A possible scenario of the evolution of NADP-dependent isocitrate dehydrogenases is proposed.


Subject(s)
Arvicolinae/genetics , Evolution, Molecular , Isocitrate Dehydrogenase/genetics , Mice/genetics , Phylogeny , Amino Acid Sequence , Animals , Cytosol/enzymology , DNA, Complementary , Humans , Isocitrate Dehydrogenase/biosynthesis , Isocitrate Dehydrogenase/chemistry , Likelihood Functions , Models, Statistical , Molecular Sequence Data , Open Reading Frames , Recombinant Proteins/biosynthesis , Recombinant Proteins/chemistry , Saccharomyces cerevisiae/enzymology , Saccharomyces cerevisiae/genetics , Sequence Alignment , Sequence Homology, Amino Acid
SELECTION OF CITATIONS
SEARCH DETAIL
...