Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 59
Filter
Add more filters










Publication year range
1.
J Mol Biol ; 308(4): 587-96, 2001 May 11.
Article in English | MEDLINE | ID: mdl-11350162

ABSTRACT

When viewed from the perspective of time, human genetic disorders give new insights into their etiology and evolution. Here, we have correlated a specific set of Alu repetitive DNA elements, known to be the basis of certain genetic defects, with their phylogenetic roots in primate evolution. From a differential distribution of Alu repeats among primate species, we identify the phylogenetic roots of three human genetic diseases involving the LPL, ApoB, and HPRT genes. The different phylogenetic age of these genetic disorders could explain the different susceptibility of various primate species to genetic diseases. Our results show that LPL deficiency is the oldest and should affect humans, apes, and monkeys. ApoB deficiency should affect humans and great apes, while a disorder in the HPRT gene (leading to the Lesch-Nyhan syndrome) is unique to human, chimpanzee, and gorilla. Similar results can be obtained for cancer. We submit that de novo transpositions of Alu elements, and saltatory appearances of Alu-mediated genetic disorders, represent singularities, places where behavior changes suddenly. Alus' propensity to spread, not only increased the regulatory and developmental complexity of the primate genome, it also increased its instability and susceptibility to genetic defects and cancer. The dynamic spread not only provided markers of primate phylogeny, it must have actively shaped the course of that phylogeny.


Subject(s)
Genetic Diseases, Inborn/genetics , Genetic Predisposition to Disease/genetics , Phylogeny , Primates/genetics , Alu Elements/genetics , Animals , Apolipoproteins B/deficiency , Apolipoproteins B/genetics , Base Sequence , Evolution, Molecular , Exons/genetics , Genetics, Population , Genotype , Globins/genetics , Humans , Hypoxanthine Phosphoribosyltransferase/deficiency , Hypoxanthine Phosphoribosyltransferase/genetics , Introns/genetics , Lipoprotein Lipase/deficiency , Lipoprotein Lipase/genetics , Molecular Sequence Data , Mutagenesis, Insertional/genetics , Neoplasms/genetics , Point Mutation/genetics
2.
J Mol Biol ; 299(4): 931-9, 2000 Jun 16.
Article in English | MEDLINE | ID: mdl-10843848

ABSTRACT

Differential gene expression lies at the heart of biology and is responsible for all developmental processes, including the growth and differentiation of cells. Perhaps even speciation could be defined as a change in differential gene expression over evolutionary time. The present work is a phylogenetic study of four Alu elements known to have gene regulatory functions in the human. The four elements have been shown to regulate the parathyroid hormone (PTH) gene via a negative calcium-response element, the hematopoietic cell-specific FcepsilonRI-gamma receptor gene via a cis-acting positive/negative regulatory element, the CNS-specific nicotinic acetylcholine receptor alpha3 gene via a cis-acting positive/negative control element, and the T-cell-specific CD8alpha gene via a complex transcriptional regulator. The four Alu elements that impact differential gene expression were found to be differentially distributed among seven primate species (human, chimpanzee, gorilla, orangutan, baboon, rhesus, and macaque) in a way that is congruent with an accepted phylogeny of these species. The results establish a link between gene regulation and the divergence of primates. This evolutionary variation in gene regulation also suggests a novel experimental system to study the very complex transcriptional regulation of gene expression, by studying side-by-side the regulation of the same gene from two primate species that differ in the cis-acting regulatory elements of the gene.


Subject(s)
Alu Elements/genetics , Gene Expression Regulation/genetics , Phylogeny , Primates/genetics , Promoter Regions, Genetic/genetics , Animals , Base Sequence , CD8 Antigens/genetics , Calcium/pharmacology , Cercopithecidae/genetics , Enhancer Elements, Genetic/genetics , Evolution, Molecular , Gene Expression Regulation/drug effects , Hominidae/genetics , Humans , Molecular Sequence Data , Mutagenesis, Insertional/genetics , Organ Specificity , Parathyroid Hormone/genetics , Polymerase Chain Reaction , Receptors, IgE/genetics , Receptors, Nicotinic/genetics , Response Elements/genetics , Species Specificity , T-Lymphocytes/metabolism
3.
J Mol Biol ; 289(4): 861-71, 1999 Jun 18.
Article in English | MEDLINE | ID: mdl-10369767

ABSTRACT

Over the past 60 million years, or so, approximately one million copies of Alu DNA repeats have accumulated in the genome of primates, in what appears to be an ongoing process. We determined the phylogenetic distribution of specific Alu (and other) DNA repeats in the genome of several primates: human, chimpanzee, gorilla, orangutan, baboon, rhesus, and macaque. At the population level studied, the majority of the repeats was found to be fixed in the primate species. Our data suggest that new Alu elements arise in unique, irreversible events, in a mechanism that seems to preclude precise excision and loss. The same insertions did not arise independently in two species. Once inserted and genetically fixed, the DNA elements are retained in all descendant lineages. The irreversible expansion of Alu s introduces a vector of time into the evolutionary process, and provides realistic (rather than statistical) answers to questions on phylogenies. In contrast to point mutations, the present distribution of individual Alu s is congruent with just one phylogeny. We submit that only irreversible and taxonomically relevant events are at the molecular basis of evolution. Most point mutations do not belong to this category.


Subject(s)
Alu Elements , Biological Evolution , Evolution, Molecular , Primates/genetics , Animals , Base Sequence , Cercopithecidae/classification , Cercopithecidae/genetics , Hominidae/classification , Hominidae/genetics , Humans , Macaca/genetics , Macaca mulatta/genetics , Molecular Sequence Data , Mutagenesis, Insertional , Pan troglodytes/genetics , Phylogeny , Point Mutation , Polymorphism, Genetic , Primates/classification
4.
Biol Chem ; 380(12): 1431-4, 1999 Dec.
Article in English | MEDLINE | ID: mdl-10661870

ABSTRACT

The albumin gene family arose in a series of duplication events which gave rise to symmetry in its structure. The four genes are tandemly linked on human chromosome 4q in the order: 5'ALB-5'AFP-5'ALF-5'DBP-centromere, and their introns display a symmetrical and repetitive pattern that is shared by members of the gene family. These repetitive motifs provide an internal reference, allowing observations of evolutionary changes within a single line (human) of evolutionary descent. The four genes and three intergenic regions between them increase in size as they get closer to the centromere. An invasion by multiple repetitive DNA elements may account, in part, for this expansion.


Subject(s)
Albumins/genetics , Chromosome Mapping , Chromosomes, Human, Pair 4 , Centromere , Humans
5.
J Mol Evol ; 46(5): 552-61, 1998 May.
Article in English | MEDLINE | ID: mdl-9545466

ABSTRACT

The serum albumin gene family is composed of four members that have arisen by a series of duplications from a common ancestor. From sequence differences between members of the gene family, we infer that a gene duplication some 580 Myr ago gave rise to the vitamin D-binding protein (DBP) gene and a second lineage, which reduplicated about 295 Myr ago to give the albumin (ALB) gene and a common precursor to alpha-fetoprotein (AFP) and alpha-albumin (ALF). This precursor itself duplicated about 250 Myr ago, giving rise to the youngest family members, AFP and ALF. It should be possible to correlate these dates with the phylogenetic distribution of members of the gene family among different species. All four genes are found in mammals, but AFP and ALF are not found in amphibia, which diverged from reptiles about 360 Myr ago, before the divergence of the AFP-ALF progenitor from albumin. Although individual family members display an approximate clock-like evolution, there are significant deviations-the rates of divergence for AFP differ by a factor of 7, the rates for ALB differ by a factor of 2.1. Since the progenitor of this gene family itself arose by triplication of a smaller gene, the rates of evolution of individual domains were also calculated and were shown to vary within and between family members. The great variation in the rates of the molecular clock raises questions concerning whether it can be used to infer evolutionary time from contemporary sequence differences.


Subject(s)
Albumins/genetics , Evolution, Molecular , Vitamin D-Binding Protein/genetics , alpha-Fetoproteins/genetics , Animals , Genetics, Population , Humans , Models, Genetic , Multigene Family , Serum Albumin , Time Factors
6.
Proc Natl Acad Sci U S A ; 93(15): 7557-61, 1996 Jul 23.
Article in English | MEDLINE | ID: mdl-8755513

ABSTRACT

The nucleotide sequence of the human alpha-albumin gene, including 887 bp of the 5'-flanking region and 1311 bp of the 3-flanking region (24,454 in total), was determined from three overlapping lambda phage clones. The sequence spans 22,256 bp from the cap site to the polyadenylylation site, revealing a gene structure of 15 exons separated by 14 introns. The methionine initiation codon ATG is within exon 1; the termination codon TGA is within exon 14. Exon 15 is entirely untranslated and contains the polyadenylylation signal AATAAA. The deduced polypeptide chain is composed of a 21-amino-acid leader peptide, followed by 578 amino acids of the mature protein. There are seven repetitive DNA elements (Alu and Kpn) in the introns and 3-flanking region. The sizes of the 15 alpha-albumin exons match closely those of the albumin, alpha-fetoprotein, and vitamin D-binding protein genes. The exons are symmetrically placed within the three domains of the individual proteins, and they share a characteristic codon splitting pattern that is conserved among members of the gene family. The results provide strong evidence that alpha-albumin belongs to, and most likely completes with, the serum albumin gene family. Based on structural similarity, alpha-albumin appears to be most closely related to alpha-fetoprotein. The complete structure of this family of four tandemly linked genes provides a well-characterized approximately 200 kb locus in the 4q subcentromeric region of the human genome.


Subject(s)
Albumins/genetics , Chromosomes, Human, Pair 4 , Multigene Family , Serum Albumin/genetics , Albumins/biosynthesis , Albumins/chemistry , Amino Acid Sequence , Bacteriophage lambda , Base Sequence , Centromere , Chromosome Mapping , Cloning, Molecular , Exons , Humans , Introns , Molecular Sequence Data , Protein Structure, Secondary , RNA Splicing , Regulatory Sequences, Nucleic Acid , Repetitive Sequences, Nucleic Acid , Restriction Mapping , Sequence Homology, Amino Acid , Serum Albumin/chemistry , TATA Box , Vitamin D-Binding Protein/chemistry , alpha-Fetoproteins/chemistry
7.
J Mol Biol ; 259(1): 113-9, 1996 May 31.
Article in English | MEDLINE | ID: mdl-8648639

ABSTRACT

The albumin gene family is comprised of four genes encoding: serum albumin (ALB), alpha-fetoprotein (AFP), alpha-albumin (ALF), and vitamin D-binding protein (DBP; also known as GC). The genes are regulated developmentally, expressed in the liver, and the proteins are secreted into the bloodstream. The GC gene, and the tandemly linked ALB and AFP genes, have been previously localized to human chromosome 4q11-13. Using techniques of fluorescence in situ hybridization to chromatin fibres, chromosome walking and DNA sequencing of genomic clones, we now report on the chromosomal location of the ALF gene and the organization of the entire gene family. The four genes are tandemly linked in the 4q sub-centromeric region: 5'ALB-5'AFP-5'ALF-5'GC3'-centromere, and hence are transcribed in the same, centromere-bound, direction. The linear arrangement of the four genes along the chromosome is not correlated with their temporal expression in the human ontogeny. It appears that GC is very close (and may be the gene proximal) to the centromere. The linear chromosomal arrangement of the four genes and the structural differences between them are congruent with the following evolutionary divergence of the gene family. Starting with the first duplication of an ancestral progenitor gene, a single evolutionary line led to the contemporary GC, leaving ALB/AFP/ALF on the other line of descent. The second duplication occurred in this ALB lineage, giving rise to ALB and the AFP/ALF progenitor, and the third, most recent one, gave rise to the AFP-ALF pair.


Subject(s)
Centromere/genetics , Chromosomes, Human, Pair 4 , Evolution, Molecular , Multigene Family , Serum Albumin/genetics , Transcription, Genetic , Albumins/chemistry , Albumins/genetics , Amino Acid Sequence , Base Sequence , Chromosome Mapping , Exons , Humans , In Situ Hybridization, Fluorescence/methods , Models, Genetic , Molecular Sequence Data , Serum Albumin/physiology , Vitamin D-Binding Protein/genetics , Vitamin D-Binding Protein/physiology , alpha-Fetoproteins/genetics , alpha-Fetoproteins/physiology
8.
Gene ; 162(2): 213-20, 1995 Sep 11.
Article in English | MEDLINE | ID: mdl-7557431

ABSTRACT

The chimpanzee (Pan troglodytes) alpha-fetoprotein (AFP)-encoding gene (AFP) spans 18,867 bp from the transcription start point to the polyadenylation site, and the nucleotide (nt) sequence reveals that the gene is composed of 15 exons, which are symmetrically placed within three domains of AFP. In addition, we report 3121 bp of 5'-flanking sequence and 4886 bp of 3'-flanking sequence. The entire 26,874 bp of contiguous DNA reported here was determined from three overlapping lambda phage clones. The deduced polypeptide chain is composed of a 19-amino-acid (aa) putative leader peptide, followed by 590 aa of the mature protein. The sequence of chimpanzee AFP was compared with those of the previously published human AFP [Gibbs et al., Biochemistry 26 (1987) 1332-1343] and gorilla AFP [Ryan et al., Genomics 9 (1991) 60-72]. At the aa level, the human AFP differs from the chimpanzee at 6 aa positions and from the gorilla at 4 aa positions; the chimpanzee and gorilla differ at 8 aa positions. There are four types of repetitive sequence elements (X, Alu, Xba and Kpn) in the introns and flanking regions of chimpanzee AFP, and they are located in orthologous positions in the human and gorilla AFP. However, one specific Alu and one Xba repeat in introns 4 and 7, respectively, found in human AFP, are absent from orthologous positions in chimpanzee and gorilla AFP. These two repeats represent human-specific novelties that arose from recent DNA transpositions in primate phylogeny.


Subject(s)
Genes , Gorilla gorilla/genetics , Pan troglodytes/genetics , alpha-Fetoproteins/genetics , Amino Acid Sequence , Animals , Base Sequence , Humans , Introns , Molecular Sequence Data , Phylogeny , Regulatory Sequences, Nucleic Acid , Restriction Mapping , Sequence Homology, Amino Acid
9.
Proc Natl Acad Sci U S A ; 91(8): 3413-7, 1994 Apr 12.
Article in English | MEDLINE | ID: mdl-7512733

ABSTRACT

The closely related serum albumin, alpha-fetoprotein, and vitamin D-binding proteins are derived from a common ancestor, which itself was the result of a triplication of an ancestral gene. We have aligned the sequences of these proteins against themselves to assess the degree to which the ancestral 3-fold symmetry has been retained; in a dot plot, relics of the molecular symmetry appear as a series of alignments parallel to the main diagonal. The decay of internal symmetry reflects the rate of change of a gene in a single line of evolutionary descent. We examined 11 serum albumins, 2 ceruloplasmins, 3 alpha-fetoproteins, and 3 vitamin D-binding proteins. We have found that ceruloplasmin evolves at the same rate in human and rat, whereas albumin, alpha-fetoprotein, and vitamin D-binding protein evolve at different rates. The human genes had the highest alignment scores, indicating the most preserved ancestral features. We conclude that the molecular clock may run at different rates for the same gene in different species.


Subject(s)
Biological Evolution , Ceruloplasmin/genetics , Genes , Multigene Family , Sequence Analysis/methods , Serum Albumin/genetics , Vitamin D-Binding Protein/genetics , alpha-Fetoproteins/genetics , Animals , Base Sequence , Biological Clocks , Humans , Mice , Molecular Sequence Data , Rats , Sheep
10.
Hum Mol Genet ; 2(8): 1271-88, 1993 Aug.
Article in English | MEDLINE | ID: mdl-8401509

ABSTRACT

As part of our efforts to construct a high-resolution physical map of human chromosome 4, we developed a systematic approach for efficiently generating large numbers of chromosome-specific sequence-tagged sites (STSs). In this paper, we describe how rate-limiting steps in our STS development were identified and overcome, and detail our current development strategy. We present information for 822 new human chromosome 4-specific STSs, including PCR amplification conditions and subchromosomal localization data, obtained by analysis of the STS with somatic cell hybrids containing different portions of human chromosome 4. Although most STSs presented here were developed from anonymous clones whose sequences were determined in this laboratory, several STSs were developed for genes and other DNA sequences that were previously mapped to chromosome 4. Our data indicate that the availability of DNA sequence for an STS locus, in addition to the sequences of the two PCR oligonucleotides, significantly increases the transfer of that STS by allowing investigators to select new oligonucleotides best suited to the standard conditions used in their laboratories.


Subject(s)
Chromosomes, Human, Pair 4 , Sequence Tagged Sites , Animals , Base Sequence , Chromosome Mapping , Cosmids , Cricetinae , DNA Primers , Humans , Hybrid Cells , Molecular Sequence Data , Plasmids , Polymerase Chain Reaction/methods , Repetitive Sequences, Nucleic Acid
11.
Genomics ; 16(3): 751-4, 1993 Jun.
Article in English | MEDLINE | ID: mdl-8325650

ABSTRACT

The sequence of the human Gc gene, including 4228 base pairs of the 5'-flanking region and 8514 base pairs of the 3' flanking region (55,136 in total), was determined from five overlapping lambda phage clones. The sequence spans 42,394 base pairs from the cap site to the polyadenylation site, and it reveals that the gene is composed of 13 exons, which are symmetrically placed within the three domains of the Gc protein. The first exon is partially untranslated, as is exon 12, which contains the termination codon TAG. Exon 13 is entirely untranslated, but contains the polyadenylation signal AATAAA. Ten central introns split the coding sequence between codon positions 2 and 3 and between codon positions 3 and 1 in an alternating pattern, exactly as has been observed in the structure of the albumin and alpha-fetoprotein genes. The Gc gene has several distinctive features which set it apart from the other members of the family. First, the gene is smaller by two exons, which results in a protein some 130 amino acids shorter than albumin or AFP. This decrease in size may result from the loss of two internal exons during the evolutionary history of the Gc gene. Second, exons 6, 8, 9, and 11 are smaller than their counterparts in albumin or AFP by a total of 8 codons (1, 4, 1, and 2, respectively). Although the mRNA and protein expressed from the Gc gene are significantly smaller, the gene itself is about 2.5 times larger than the other genes of the family.(ABSTRACT TRUNCATED AT 250 WORDS)


Subject(s)
Albumins/genetics , Genetic Variation , Multigene Family , Vitamin D-Binding Protein/genetics , Base Sequence , Cloning, Molecular , DNA , Exons , Humans , Introns , Molecular Sequence Data , Restriction Mapping
12.
Proc Natl Acad Sci U S A ; 90(5): 1872-6, 1993 Mar 01.
Article in English | MEDLINE | ID: mdl-8446601

ABSTRACT

We have identified four genetic novelties that are fixed in specific primate lineages and hence can serve as phylogenetic time markers. One Alu DNA repeat is present in the human lineage but is absent from the great apes. Another Alu DNA repeat is present in the gorilla lineage but is absent from the human, chimpanzee, and orangutan. A progenitor Xba1 element is present in the human, chimpanzee, gorilla, and orangutan, but only in the human lineage did it give rise to a transposed progeny, Xba2. The saltatory appearance of Xba2 is an example of a one-time event in the evolutionary history of a species. The enolase pseudogene, known to be present as a single copy in the human, was found to be present in four other primates, including the baboon, an Old World monkey. Using the accepted value of 5 x 10(-9) nucleotide substitutions per site per year as the evolutionary rate for pseudogenes, we calculated that the enolase pseudogene arose approximately 14 million years ago. The calculated age for this pseudogene and its presence in the baboon are incongruent with each other, since Old World monkeys are considered to have diverged from the hominid lineage some 30 million years ago. Thus the rate of evolution in the enolase pseudogene is only about 2.5 x 10(-9) substitutions per site per year, or half the rate in other pseudogenes. It is concluded that rates of substitution vary between species, even for similar DNA elements such as pseudogenes. We submit that new DNA repeats arise in the genomes of species in irreversible and punctuated events and hence can be used as molecular time markers to decipher phylogenies.


Subject(s)
Primates/genetics , Repetitive Sequences, Nucleic Acid , Animals , Base Sequence , Biological Evolution , Genes , Globins/genetics , Gorilla gorilla/genetics , Humans , Molecular Sequence Data , Pan troglodytes/genetics , Polymerase Chain Reaction , Pongo pygmaeus/genetics , Pseudogenes
14.
Genomics ; 9(1): 60-72, 1991 Jan.
Article in English | MEDLINE | ID: mdl-1706310

ABSTRACT

The sequence of the gorilla alpha-fetoprotein gene, including 869 base pairs of the 5' flanking region and 4892 base pairs of the 3' flanking region (24,607 in total), was determined from two overlapping lambda phage clones. The sequence extends 18,846 base pairs from the Cap site to the polyadenylation site, and it reveals that the gene is composed of 15 exons, which are symmetrically placed within three domains of alpha-fetoprotein. The deduced polypeptide chain is composed of a 19-amino-acid leader peptide, followed by 590 amino acids of the mature protein. The RNA polymerase II binding site, TATAAAA, and the promoter element, CCAAC, are positioned at -21 and -65 from the Cap site, respectively. The polyadenylation signal, AATAAA, is located in the last exon, which is untranslated. The sequence for the gorilla alpha-fetoprotein gene was compared with that of the previously published human alpha-fetoprotein gene (P. E. M. Gibbs, R. Zielinski, C. Boyd, and A. Dugaiczyk, 1987, Biochemistry 26: 1332-1343). Four types of repetitive sequence elements were found in identical positions in both species. However, one Alu and one Xba DNA repeat within introns 4 and 7, respectively, of the human gene are absent from orthologous positions in the gorilla. The Alu and the Xba DNA repeats probably emerged in the human genome after the human/gorilla divergence and became established novelties in the human lineage. There are 363/21,523 mutational changes between human and gorilla, amounting to 1.69% DNA divergence between the two primate species. The value of 1.69% is lower than the 2.27% obtained from melting temperatures of hybrids between human and gorilla genomic DNA (C. G. Sibley and J. E. Ahlquist, 1984, J. Mol. Evol. 26: 99-121). At the protein level, Homo sapiens differs from Gorilla gorilla only at 4 of 609 amino acid positions (0.66%) in the alpha-fetoprotein sequence. This difference signifies a lower rate of molecular divergence for the alpha-fetoprotein gene in primates, as compared to rodents.


Subject(s)
Genetic Variation , Gorilla gorilla/genetics , alpha-Fetoproteins/genetics , Amino Acid Sequence , Animals , Base Sequence , Binding Sites , Exons , Genes , Humans , Molecular Sequence Data , Poly A , Promoter Regions, Genetic , Repetitive Sequences, Nucleic Acid , Restriction Mapping , Sequence Homology, Nucleic Acid
15.
Proc Natl Acad Sci U S A ; 86(23): 9360-4, 1989 Dec.
Article in English | MEDLINE | ID: mdl-2480599

ABSTRACT

We discovered the presence of an Alu and an Xba repetitive DNA element within introns 4 and 7, respectively, of the human alpha-fetoprotein (AFP) gene; these elements are absent from the same gene in the gorilla. The Alu element is flanked by 12-base-pair direct repeats, AGGATGTTGTGG ... (Alu) ... AGGATGTTGTGG, which presumably arose by way of duplication of the intronic target site AGGATGTTGTGG at the time of the Alu insertion. In the gorilla, only a single copy of the unoccupied target site is present, which is identical to the terminal repeat flanking the human Alu element. There are two copies of an Xba repeat in the human AFP gene, apparently the only two in the genome. Xba1 and Xba2, located within introns 8 and 7, respectively, differ from each other at 3 of 303 positions. Xba1 is referred to as the old (ancestral) repeat because it lacks direct repeats. The new (derived) Xba2 is flanked by direct repeats, TTTCTTTTT ... (Xba) ... TTTCTTCTT, and is thought to have arisen as a result of transposition of Xba1. The ancestral Xba1 and a single copy of the Xba2 target site are present at orthologous positions in the gorilla, but the new Xba2 is absent. We conclude that the Alu and Xba DNA repeats emerged in the human genome at a time postdating the human-gorilla divergence and became established as genetic novelties in the human lineage. We submit that the chronology of divergence of primate lines of evolution can be correlated with the timing of insertion of new DNA repeats into the genomes of those primates.


Subject(s)
DNA/genetics , Genes , Gorilla gorilla/genetics , Hominidae/genetics , Phylogeny , Repetitive Sequences, Nucleic Acid , alpha-Fetoproteins/genetics , Animals , Base Sequence , Cloning, Molecular , DNA Transposable Elements , Genomic Library , Humans , Molecular Sequence Data , Restriction Mapping , Sequence Homology, Nucleic Acid
16.
Proc Natl Acad Sci U S A ; 85(7): 2125-9, 1988 Apr.
Article in English | MEDLINE | ID: mdl-3353369

ABSTRACT

We have identified a structural defect in the serum albumin gene in human analbuminemia. Sequence determination of 1.1 kilobases (kb) of the 5' regulatory region and of 6 kb across exonic regions revealed a single AG-to-GG mutation within the 3' splice site of intron 6 in the defective gene of an analbuminemic individual. In an in vitro assay on the RNA transcript this mutation causes a defect in splicing of the intron 6 sequence and in subsequent ligation of the exon 6-exon 7 sequences. Using polymerase-amplified genomic DNA and allele-specific oligodeoxynucleotide probes, we have also shown that the sequence of this intron 6/exon 7 splice junction is normal in a different, unrelated analbuminemic individual. Analbuminemia in humans is therefore the result of one of multiple defects in our genome.


Subject(s)
RNA Splicing , RNA, Messenger/metabolism , Serum Albumin/deficiency , Base Sequence , Genes , Humans , Molecular Sequence Data , Mutation , Serum Albumin/genetics
17.
Mol Biol Evol ; 4(4): 364-79, 1987 Jul.
Article in English | MEDLINE | ID: mdl-2451776

ABSTRACT

We have recently determined complete DNA sequences for the human albumin and alpha-fetoprotein [AFP] genes and thus have identified their detailed structures. Each is composed of three domains of four exons, three of which are internal and one of which is a domain-linking exon. Equivalent exons in each domain show sufficient sequence and structural similarity to be considered homologous; additional unique exons at each end of the gene show no similarity to the internal triplicated structures. Since earlier, conflicting evolutionary models were based on analysis of single gene structures, we derived from five genes a series of consensus sequences representing the three internal exons as well as the domain-linking exon. The five genes were human and rat albumin and human, mouse, and rat AFP genes. Structurally equivalent exons of the different domains are shown to have arisen from a single exon in a one-domain precursor. Exons that bridge the domains arose from an unequal crossover that fused two exons of the precursor. Our model suggests that part of the coding sequence of the one-domain precursor may have been derived from an intron, by way of loss of a splice site. The consensus sequences were used to propose an intron-exon structure for the related gene encoding the serum vitamin D-binding protein (DBP). DBP is truncated relative to albumin and AFP, and we submit that this results from deletion of two internal exons in the third domain of the gene rather than from premature termination of the coding sequence.


Subject(s)
Genes , Serum Albumin/genetics , Vitamin D-Binding Protein/genetics , Amino Acid Sequence , Animals , Base Sequence , Exons , Humans , Mice , Molecular Sequence Data , Rats , Species Specificity , alpha-Fetoproteins/genetics
18.
Biochemistry ; 26(5): 1332-43, 1987 Mar 10.
Article in English | MEDLINE | ID: mdl-2436661

ABSTRACT

The human alpha-fetoprotein gene spans 19,489 base pairs from the putative "Cap" site to the polyadenylation site. It is composed of 15 exons separated by 14 introns, which are symmetrically placed within the three domains of alpha-fetoprotein. In the 5' region, a putative TATAAA box is at position -21, and a variant sequence, CCAAC, of the common CAT box is at -65. Enhancer core sequences GTGGTTTAAAG are found in introns 3 and 4, and several copies of glucocorticoid response sequences AGATACAGTA are found on the template strand of the gene. There are six polymorphic sites within 4690 base pairs of contiguous DNA derived from two allelic alpha-fetoprotein genes. This amounts to a measured polymorphic frequency of 0.13%, or 6.4 X 10(-4)/site, which is about 5-10 times lower than values estimated from studies on polymorphic restriction sites in other regions of the human genome. There are four types of repetitive sequence elements in the introns and flanking regions of the human alpha-fetoprotein gene. At least one of these is apparently a novel structure (designated Xba) and is found as a pair of direct repeats, with one copy in intron 7 and the other in intron 8. It is conceivable that within the last 2 million years the copy in intron 8 gave rise to the repeat in intron 7. Their present location on both sides of exon 8 gives these sequences a potential for disrupting the functional integrity of the gene in the event of an unequal crossover between them. There are three Alu elements, one of which is in intron 4; the others are located in the 3' flanking region. A solitary Kpn repeat is found in intron 3. The Xba and Kpn repeats were only detected by complete sequencing of the introns. Neither X, Xba, nor Kpn elements are present in the related human albumin gene, whereas Alu's are present in different positions. From phylogenetic evidence, it appears that Alu elements were inserted into the alpha-fetoprotein gene at some time postdating the mammalian radiation 85 million years ago.


Subject(s)
Repetitive Sequences, Nucleic Acid , alpha-Fetoproteins/genetics , Base Sequence , Biological Evolution , Genes , Genes, Regulator , Humans , Multigene Family , Poly A/genetics , Polymorphism, Genetic
19.
Mol Biol Evol ; 4(1): 1-9, 1987 Jan.
Article in English | MEDLINE | ID: mdl-2451775

ABSTRACT

The human albumin-alpha-fetoprotein genomic domain contains 13 repetitive DNA elements randomly distributed throughout the symmetrical structures of these genes. These repeated sequences are located at different sites within the two genes. The human albumin gene contains five Alu elements within four of its 14 intervening sequences. Two of these repeats are located in intron 2, and the remaining three are located in introns 7, 8, and 11. The human alpha-fetoprotein gene contains three of these Alu elements, one in intron 4 and the remaining two in the 3'-untranslated region. In addition, the human alpha-fetoprotein gene contains a Kpn repeat and two classes of novel repeats that are absent from the human albumin gene. Six of the Alu elements within the two genes are bound by short direct repeats that harbor five base substitutions in 120 possible positions (60 bp times 2 termini). The absence of Alu repeats from analogous positions in rodents indicates that these repeats invaded the albumin-alpha-fetoprotein domain less than 85 Myr ago (the time of mammalian radiation). Furthermore, considering the conservation of terminal repeats flanking the Alu sequences of the albumin-alpha-fetoprotein domain (0.042 changes per site), we submit that the average time of Alu insertion into this gene family could have been as recently as 15-30 Myr ago.


Subject(s)
Genes , Repetitive Sequences, Nucleic Acid , Serum Albumin/genetics , alpha-Fetoproteins/genetics , Base Sequence , Humans , Molecular Sequence Data
20.
J Biol Chem ; 261(15): 6747-57, 1986 May 25.
Article in English | MEDLINE | ID: mdl-3009475

ABSTRACT

The human albumin gene spans 16,961 nucleotides from the putative "Cap" site to the first poly(A) addition site. It is split into 15 exons by 14 intervening sequences which are symmetrically placed within the three domains of albumin. The 5' region is highly conserved up to position -250 and contains the putative TATA (-32) and CAT (-88) boxes. A consensus 5' splice sequence reads /GTAGAGT while the 3' splice sequence is pyrimidine rich and contains CTAG/ at the splice junction. The gene contains three polyadenylation signals, and this 3' region presumably arose by triplication of a shorter fragment prior to mammalian radiation. The albumin gene exhibits a high degree of DNA polymorphism and appears to have been recently invaded by Alu repetitive sequences.


Subject(s)
Chromosomes, Human, 4-5 , Genes , Serum Albumin/genetics , Amino Acid Sequence , Animals , Base Sequence , Chickens , Cloning, Molecular , DNA/analysis , DNA Restriction Enzymes , Deoxyribonuclease EcoRI , Humans , Nucleic Acid Conformation , Promoter Regions, Genetic , Protein Conformation , RNA Splicing , Rats , Species Specificity
SELECTION OF CITATIONS
SEARCH DETAIL
...