Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 11 de 11
Filter
Add more filters










Publication year range
1.
Proc Natl Acad Sci U S A ; 97(3): 1172-7, 2000 Feb 01.
Article in English | MEDLINE | ID: mdl-10655503

ABSTRACT

The identification of the cystic fibrosis transmembrane conductance regulator gene (CFTR) in 1989 represents a landmark accomplishment in human genetics. Since that time, there have been numerous advances in elucidating the function of the encoded protein and the physiological basis of cystic fibrosis. However, numerous areas of cystic fibrosis biology require additional investigation, some of which would be facilitated by information about the long-range sequence context of the CFTR gene. For example, the latter might provide clues about the sequence elements responsible for the temporal and spatial regulation of CFTR expression. We thus sought to establish the sequence of the chromosomal segments encompassing the human CFTR and mouse Cftr genes, with the hope of identifying conserved regions of biologic interest by sequence comparison. Bacterial clone-based physical maps of the relevant human and mouse genomic regions were constructed, and minimally overlapping sets of clones were selected and sequenced, eventually yielding approximately 1.6 Mb and approximately 358 kb of contiguous human and mouse sequence, respectively. These efforts have produced the complete sequence of the approximately 189-kb and approximately 152-kb segments containing the human CFTR and mouse Cftr genes, respectively, as well as significant amounts of flanking DNA. Analyses of the resulting data provide insights about the organization of the CFTR/Cftr genes and potential sequence elements regulating their expression. Furthermore, the generated sequence reveals the precise architecture of genes residing near CFTR/Cftr, including one known gene (WNT2/Wnt2) and two previously unknown genes that immediately flank CFTR/Cftr.


Subject(s)
Cystic Fibrosis Transmembrane Conductance Regulator/genetics , Genes , Mice/genetics , Animals , Humans , Mice, Inbred C57BL , Molecular Sequence Data , Regulatory Sequences, Nucleic Acid , Sequence Alignment , Sequence Homology, Nucleic Acid , Species Specificity
2.
Nature ; 402(6761): 489-95, 1999 Dec 02.
Article in English | MEDLINE | ID: mdl-10591208

ABSTRACT

Knowledge of the complete genomic DNA sequence of an organism allows a systematic approach to defining its genetic components. The genomic sequence provides access to the complete structures of all genes, including those without known function, their control elements, and, by inference, the proteins they encode, as well as all other biologically important sequences. Furthermore, the sequence is a rich and permanent source of information for the design of further biological studies of the organism and for the study of evolution through cross-species sequence comparison. The power of this approach has been amply demonstrated by the determination of the sequences of a number of microbial and model organisms. The next step is to obtain the complete sequence of the entire human genome. Here we report the sequence of the euchromatic part of human chromosome 22. The sequence obtained consists of 12 contiguous segments spanning 33.4 megabases, contains at least 545 genes and 134 pseudogenes, and provides the first view of the complex chromosomal landscapes that will be found in the rest of the genome.


Subject(s)
Chromosomes, Human, Pair 22 , Human Genome Project , Sequence Analysis, DNA , Animals , Chromosome Mapping/methods , DNA , Gene Dosage , Humans , Mice , Molecular Sequence Data , Repetitive Sequences, Nucleic Acid , Species Specificity
3.
Am J Hum Genet ; 63(2): 625-37, 1998 Aug.
Article in English | MEDLINE | ID: mdl-9683604

ABSTRACT

Chromosome rearrangements associated with neoplasms provide a rich resource for definition of the pathways of tumorigenesis. The power of comparative genome hybridization (CGH) to identify novel genes depends on the existence of suitable markers, which are lacking throughout most of the genome. We now report a general approach that translates CGH data into higher-resolution genomic-clone data that are then used to define the genes located in aneuploid regions. We used CGH to study 33 thyroid-tumor DNAs and two tumor-cell-line DNAs. The results revealed amplifications of chromosome band 2p21, with less-intense amplification on 2p13, 19q13.1, and 1p36 and with least-intense amplification on 1p34, 1q42, 5q31, 5q33-34, 9q32-34, and 14q32. To define the 2p21 region amplified, a dense array of 373 FISH-mapped chromosome 2 bacterial artificial chromosomes (BACs) was constructed, and 87 of these were hybridized to a tumor-cell line. Four BACs carried genomic DNA that was amplified in these cells. The maximum amplified region was narrowed to 3-6 Mb by multicolor FISH with the flanking BACs, and the minimum amplicon size was defined by a contig of 420 kb. Sequence analysis of the amplified BAC 1D9 revealed a fragment of the gene, encoding protein kinase C epsilon (PKCepsilon), that was then shown to be amplified and rearranged in tumor cells. In summary, CGH combined with a dense mapped resource of BACs and large-scale sequencing has led directly to the definition of PKCepsilon as a previously unmapped candidate gene involved in thyroid tumorigenesis.


Subject(s)
Chromosome Aberrations , Chromosome Mapping , Chromosomes, Human , Thyroid Neoplasms/genetics , Adenocarcinoma, Follicular/genetics , Adenoma/genetics , Carcinoma/genetics , Carcinoma, Papillary/genetics , Chromosomes, Bacterial , Chromosomes, Human, Pair 1 , Chromosomes, Human, Pair 2 , DNA, Neoplasm/genetics , Gene Amplification , Genetic Markers , Humans , In Situ Hybridization, Fluorescence , Karyotyping , Tumor Cells, Cultured
4.
Hum Mol Genet ; 6(9): 1483-9, 1997 Sep.
Article in English | MEDLINE | ID: mdl-9285785

ABSTRACT

PKD1 is the major locus of the common genetic disorder autosomal dominant polycystic kidney disease (ADPKD). Analysis of the predicted protein sequence of the human PKD1 gene, polycystin, shows a large molecule with a unique arrangement of extracellular domains and multiple putative transmembrane regions. The precise function of polycystin remains unclear with a paucity of mutations to define key structural and functional domains. To refine the structure of this protein we have cloned the genomic region encoding the Fugu PKD1 gene. Fugu PKD1 spans 36 kb of genomic DNA and has greater complexity with 54 exons compared with 46 in man. Comparative analysis of the predicted protein sequences shows a lower level of homology than in similar studies with identity of 40 and 59% similarity. However key structural motifs including leucine rich repeats (LRR), a C-type lectin and LDL-A like domains and 16 PKD repeats are maintained. A region of homology with the sea urchin REJ protein was also confirmed in Fugu but found to extend over 1000 amino acids. Several highly conserved intra- and extra-cellular regions, with no known sequence homologies, that are likely to be of functional importance were detected. The likely structure of the membrane associated region has been refined with similarity to the PKD2 protein and voltage gated Ca2+ and Na+ channels highlighted over part of this area. The overall protein structure has therefore been clarified and this comparative analysis derived structure will form the basis for the functional study of polycystin and its individual domains.


Subject(s)
Membrane Glycoproteins/genetics , Polycystic Kidney, Autosomal Dominant/genetics , Proteins/genetics , Amino Acid Sequence , Animals , Conserved Sequence , Fishes, Poisonous/genetics , Humans , Membrane Glycoproteins/isolation & purification , Molecular Sequence Data , Protein Structure, Secondary , Proteins/isolation & purification , Sequence Analysis, DNA , Sequence Homology, Amino Acid , Species Specificity , TRPP Cation Channels
5.
Nucleic Acids Res ; 25(15): 2960-6, 1997 Aug 01.
Article in English | MEDLINE | ID: mdl-9224593

ABSTRACT

Representation of subcloned Caenorhabditis elegans and human DNA sequences in both M13 and pUC sequencing vectors was determined in the context of large scale genomic sequencing. In many cases, regions of subclone under-representation correlated with the occurrence of repeat sequences, and in some cases the under-representation was orientation specific. Factors which affected subclone representation included the nature and complexity of the repeat sequence, as well as the length of the repeat region. In some but not all cases, notable differences between the M13 and pUC subclone distributions existed. However, in all regions lacking one type of subclone (either M13 or pUC), an alternate subclone was identified in at least one orientation. This suggests that complementary use of M13 and pUC subclones would provide the most comprehensive subclone coverage of a given genomic sequence.


Subject(s)
Cloning, Molecular , Genetic Vectors , Animals , Base Sequence , Binding Sites , Caenorhabditis elegans/genetics , Cosmids , Genome , Humans
6.
Genome Res ; 7(5): 551-63, 1997 May.
Article in English | MEDLINE | ID: mdl-9149950

ABSTRACT

Repetitive DNA is a significant component of eukaryotic genomes. We have developed a strategy to efficiently and accurately sequence repetitive DNA in the nematode Caenorhabditis elegans using integrated artificial transposons and automated fluorescent sequencing. Mapping and assembly tools represent important components of this strategy and facilitate sequence assembly in complex regions. We have applied the strategy to several cosmid assembly gaps resulting from repetitive DNA and have accurately recovered the sequences of these regions. Analysis of these regions revealed six novel transposon-like repetitive elements, IR-1, IR-2, IR-3, IR-4, IR-5, and TR-1. Each of these elements represents a middle-repetitive DNA family in C. elegans containing at least 3-140 copies per genome. Copies of IR-1, IR-2, IR-4, and IR-5 are located on all (or most) of the six nematode chromosomes, whereas IR-3 is predominantly located on chromosome X. These elements are almost exclusively interspersed between predicted genes or within the predicted introns of these genes, with the exception of a single IR-5 element, which is located within a predicted exon. IR-1, IR-2, and IR-3 are flanked by short sequence duplications resembling the target site duplications of transposons. We have established a website database (http:(/)/www.welch.jhu.edu/approximately devine/RepDNAdb.html) to track and cross-reference these transposon-like repetitive elements that contains detailed information on individual element copies and provides links to appropriate GenBank records. This set of tools may be used to sequence, track, and study repetitive DNA in model organisms and humans.


Subject(s)
Caenorhabditis elegans/genetics , DNA Transposable Elements/genetics , Eukaryotic Cells/physiology , Repetitive Sequences, Nucleic Acid , Sequence Analysis, DNA/methods , Animals , Automation , Chromosome Mapping/methods , Computer Communication Networks , Cosmids/genetics , Databases, Factual , Fluorescence , Humans , Molecular Sequence Data , Multigene Family , X Chromosome
7.
Genome Res ; 6(9): 807-28, 1996 Sep.
Article in English | MEDLINE | ID: mdl-8889549

ABSTRACT

We report the generation of 319,311 single-pass sequencing reactions (known as expressed sequence tags, or ESTs) obtained from the 5' and 3' ends of 194,031 human cDNA clones. Our goal has been to obtain tag sequences from many different genes and to deposit these in the publicly accessible Data Base for Expressed Sequence Tags. Highly efficient automatic screening of the data allows deposition of the annotated sequences without delay. Sequences have been generated from 26 oligo(dT) primed directionally cloned libraries, of which 18 were normalized. The libraries were constructed using mRNA isolated from 17 different tissues representing three developmental states. Comparisons of a subset of our data with nonredundant human mRNA and protein data bases show that the ESTs represent many known sequences and contain many that are novel. Analysis of protein families using Hidden Markov Models confirms this observation and supports the contention that although normalization reduces significantly the relative abundance of redundant cDNA clones, it does not result in the complete removal of members of gene families.


Subject(s)
Gene Library , Genome, Human , Sequence Tagged Sites , Adult , Cloning, Molecular , DNA, Complementary , Databases, Factual , Female , Humans , Infant , Introns , Markov Chains , Molecular Sequence Data , Pregnancy , Proteins/genetics , RNA, Messenger/genetics
8.
Genomics ; 27(1): 67-82, 1995 May 01.
Article in English | MEDLINE | ID: mdl-7665185

ABSTRACT

The complete human BCR gene (152-141 nt) on chromosome 22 and greater than 80% of the human ABL gene (179-512 nt) on chromosome 9 have been sequenced from mapped cosmid and plasmid clones via a shotgun strategy. Because these two chromosomes are translocated with breakpoints within the BCR and ABL genes in Philadelphia chromosome-positive leukemias, knowledge of these sequences also might provide insight into the validity of various theories of chromosomal rearrangements. Comparison of these genes with their cDNA sequences reveal the positions of 23 BCR exons and putative alternative BCR first and second exons, as well as the common ABL exons 2-11, respectively. Additionally, these regions include the alternative ABL first exons 1b and 1a, a new gene 5' to the first ABL exon, and an open reading frame with homology to an EST within the BCR fourth intron. Further analysis reveals an Alu homology of 38.83 and 39.35% for the BCR and ABL genes, respectively, with other repeat elements present to a lesser extent. Four new Philadelphia chromosome translocation breakpoints from chronic myelogenous leukemia patients also were sequenced, and the positions of these and several other previously sequenced breakpoints now have been mapped precisely, although no consistent breakpoint features immediately were apparent. Comparative analysis of genomic sequences encompassing the murine homologues to the human ABL exons 1b and 1a, as well as regions encompassing the ABL exons 2 and 3, reveals that although there is a high degree of homology in their corresponding exons and promoter regions, these two vertebrate species show a striking lack of homology outside these regions.


Subject(s)
Genes, abl , Genes , Oncogene Proteins/genetics , Philadelphia Chromosome , Protein-Tyrosine Kinases , Proto-Oncogene Proteins , Animals , Base Sequence , Chromosomes, Human, Pair 22 , Chromosomes, Human, Pair 9 , DNA, Complementary/genetics , Humans , Leukemia, Myelogenous, Chronic, BCR-ABL Positive/genetics , Mice/genetics , Minisatellite Repeats , Molecular Sequence Data , Proto-Oncogene Proteins c-bcr , Sequence Alignment , Sequence Homology, Nucleic Acid , Species Specificity
9.
Genomics ; 26(2): 334-44, 1995 Mar 20.
Article in English | MEDLINE | ID: mdl-7601460

ABSTRACT

Proteins with seven transmembrane segments (7TM) define a superfamily of receptors (7TM receptors) sharing the same topology: an extracellular N-terminus, three extramembranous loops on either side of the plasma membrane, and a cytoplasmic C-terminal tail. Upon ligand binding, cytoplasmic portions of the activated receptor interact with heterotrimeric G-coupled proteins to induce various second messengers. A small group, recently recognized on the basis of homologous primary amino acid sequences, comprises receptors to hormones of the secretin/vasoactive intestinal peptide/glucagon family, parathyroid hormone and parathyroid hormone-related peptides, growth hormone-releasing factor, corticotropin-releasing factor, and calcitonin. A cDNA, extracted from a neuroectodermal cDNA library, was predicted to encode a new 886-amino-acid protein with three distinct domains. The C-terminal third contains the seven hydrophobic segments and characteristic residues that allow the protein to be readily aligned with the various hormone receptors in the family. Six egf-like modules, at the N-terminus of the predicted mature protein, are separated from the transmembrane segments by a serine/threonine-rich domain, a feature reminiscent of mucin-like, single-span, integral membrane glycoproteins with adhesive properties. Because of its unique characteristics, this putative egf module-containing, mucin-like hormone receptor has been named EMR1. Southern analysis of a panel of somatic cell hybrids and fluorescence in situ hybridization have assigned the EMR1 gene to human chromosome 19p13.3.


Subject(s)
Genes , Membrane Glycoproteins/genetics , Mucins/genetics , Multigene Family , Amino Acid Sequence , Animals , Base Sequence , Calcium-Binding Proteins , Chromosomes, Human, Pair 19 , Consensus Sequence , Cricetinae , DNA, Complementary/genetics , Epidermal Growth Factor/chemistry , Humans , Hybrid Cells , Membrane Glycoproteins/chemistry , Molecular Sequence Data , Mucins/chemistry , Protein Conformation , RNA, Heterogeneous Nuclear/genetics , Receptors, Cell Surface/chemistry , Receptors, G-Protein-Coupled , Sequence Alignment , Sequence Homology, Amino Acid
10.
Plasmid ; 32(2): 222-7, 1994 Sep.
Article in English | MEDLINE | ID: mdl-7846147

ABSTRACT

The restriction enzyme and genetic map of the antibiotic-resistance region of plasmid pSa is related to Tn21 integrons by the insertion of 5.4 kb containing a chloramphenicol resistance gene (catII) and a 1.1-kb direct repeat. We report here the nucleotide sequences of both copies of the repeat with adjoining sequences. They were identical for 1065 bp and contained the entire coding sequence of the sulfanilamide resistance gene, sulI. Since only the first copy of the repeat confers sulfonamide resistance, this leads to the conclusion that no promoter was available for the second copy. The sequence of the pSa sulI gene was identical to several published sulI sequences from other plasmids. The first junction point of the catII-containing insert was identical to the sequence for pDG0100; the second junction occurred farther into the 3'-conserved segment of integrons than does that of pDG0100. A recent report of these junction sequences for pSa and pDG0100 differs from our sequences by one nucleotide. Two additional differences were an insert of 41 bases and a single base insertion between sulI and ORF341 in our sequence. Our sequenced regions have been assigned GenBank Accession Nos. UO4277 and UO4278 for the first and second sulI genes of pSa, respectively.


Subject(s)
Bacterial Proteins/genetics , DNA, Bacterial/genetics , Escherichia coli Proteins , Genes, Bacterial , Plasmids/genetics , Repetitive Sequences, Nucleic Acid , Amino Acid Sequence , Base Sequence , Chloramphenicol/pharmacology , Drug Resistance, Microbial/genetics , Molecular Sequence Data , Promoter Regions, Genetic , Shigella/genetics , Sulfanilamide , Sulfanilamides/pharmacology , Sulfonamides/pharmacology
11.
Genet Anal Tech Appl ; 11(5-6): 181-6, 1994.
Article in English | MEDLINE | ID: mdl-7710784

ABSTRACT

The complete nucleotide sequence of the 16,009-bp SacBII Kan domain of the P1 pAD10-SacBII cloning vector and the sequences of three cosmid cloning vectors, pTCF (7941 bp), svPHEP (9201 bp), and LAWRIST16 (5194 bp) have been determined. A modified diatomaceous earth (Prep-A-Gene)-based procedure, which rapidly yields highly supercoiled double-stranded DNA from recombinant P1 and cosmid clones suitable for generating shotgun libraries, also has been developed. The isolated recombinant DNAs were physically sheared to generate 1- to 2-kb fragments that then were blunt-ended and subcloned into double-stranded pUC-based sequencing vectors. The double-stranded sequencing templates were isolated by an alkaline lysis method and subjected to Taq polymerase catalyzed fluorescent end-labeled primer cycle sequencing. After shotgun sequence assembly, contig gaps were closed and ambiguities were resolved via Sequenase catalyzed fluorescent dye-terminator sequencing.


Subject(s)
Bacteriophage P1/genetics , Cosmids/genetics , Genetic Vectors/genetics , Hexosyltransferases , Kanamycin Resistance/genetics , Sequence Analysis, DNA/methods , Chromosomes, Human, Pair 22 , Chromosomes, Human, Pair 9 , Cloning, Molecular , DNA/genetics , DNA/isolation & purification , DNA, Recombinant/genetics , DNA, Recombinant/isolation & purification , DNA, Superhelical/genetics , DNA, Superhelical/isolation & purification , DNA, Viral/genetics , DNA, Viral/isolation & purification , DNA-Directed DNA Polymerase , Diatomaceous Earth , Genes, abl , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...