Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 25
Filter
Add more filters











Publication year range
2.
Nature ; 409(6822): 824-6, 2001 Feb 15.
Article in English | MEDLINE | ID: mdl-11236998

ABSTRACT

There are a number of ways to investigate the structure, function and evolution of the human genome. These include examining the morphology of normal and abnormal chromosomes, constructing maps of genomic landmarks, following the genetic transmission of phenotypes and DNA sequence variations, and characterizing thousands of individual genes. To this list we can now add the elucidation of the genomic DNA sequence, albeit at 'working draft' accuracy. The current challenge is to weave together these disparate types of data to produce the information infrastructure needed to support the next generation of biomedical research. Here we provide an overview of the different sources of information about the human genome and how modern information technology, in particular the internet, allows us to link them together.


Subject(s)
Genome, Human , Human Genome Project , Amino Acid Sequence , Chromosome Mapping , Computational Biology , Genes , Genetic Variation , Humans , Internet , Molecular Sequence Data , Sequence Analysis, DNA
3.
Nucleic Acids Res ; 29(1): 11-6, 2001 Jan 01.
Article in English | MEDLINE | ID: mdl-11125038

ABSTRACT

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources that operate on the data in GenBank and a variety of other biological data made available through NCBI's Web site. NCBI data retrieval resources include Entrez, PubMed, LocusLink and the Taxonomy Browser. Data analysis resources include BLAST, Electronic PCR, OrfFinder, RefSeq, UniGene, HomoloGene, Database of Single Nucleotide Polymorphisms (dbSNP), Human Genome Sequencing, Human MapViewer, GeneMap'99, Human-Mouse Homology Map, Cancer Chromosome Aberration Project (CCAP), Entrez Genomes, Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, Cancer Genome Anatomy Project (CGAP), SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheri-tance in Man (OMIM), the Molecular Modeling Database (MMDB) and the Conserved Domain Database (CDD). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih. gov.


Subject(s)
Biotechnology , Databases, Factual , Information Services , Animals , Gene Expression Profiling , Genome , Genome, Human , Humans , Information Storage and Retrieval , Internet , Molecular Biology , National Institutes of Health (U.S.) , National Library of Medicine (U.S.) , Phenotype , Sequence Alignment , United States
4.
Genome Res ; 10(7): 1051-60, 2000 Jul.
Article in English | MEDLINE | ID: mdl-10899154

ABSTRACT

We have constructed a public gene expression data repository and online data access and analysis, WWW and FTP sites for serial analysis of gene expression (SAGE) data. The WWW and FTP components of this resource, SAGEmap, are located at http://www.ncbi.nlm.nih. gov/sage and ftp://ncbi.nlm.nih.gov/pub/sage, respectively. We herein describe SAGE data submission procedures, the construction and characteristics of SAGE tags to gene assignments, the derivation and use of a novel statistical test designed specifically for differential-type analyses of SAGE data, and the organization and use of this resource.


Subject(s)
Databases, Factual , Gene Expression/genetics , Internet , Female , Gene Library , Humans , Male , Sequence Analysis, DNA/methods , Sequence Tagged Sites , Signal Processing, Computer-Assisted
6.
Genome Res ; 10(3): 350-64, 2000 Mar.
Article in English | MEDLINE | ID: mdl-10720576

ABSTRACT

This paper describes a fast and scalable strategy for constructing a radiation hybrid (RH) map from data on different RH panels. The maps on each panel are then integrated to produce a single RH map for the genome. Recurring problems in using maps from several sources are that the maps use different markers, the maps do not place the overlapping markers in same order, and the objective functions for map quality are incomparable. We use methods from combinatorial optimization to develop a strategy that addresses these issues. We show that by the standard objective functions of obligate chromosome breaks and maximum likelihood, software for the traveling salesman problem produces RH maps with better quality much more quickly than using software specifically tailored for RH mapping. We use known algorithms for the longest common subsequence problem as part of our map integration strategy. We demonstrate our methods by reconstructing and integrating maps for markers typed on the Genebridge 4 (GB4) and the Stanford G3 panels publicly available from the RH database. We compare map quality of our integrated map with published maps for GB4 panel and G3 panel by considering whether markers occur in the same order on a map and in DNA sequence contigs submitted to GenBank. We find that all of the maps are inconsistent with the sequence data for at least 50% of the contigs, but our integrated maps are more consistent. The map integration strategy not only scales to multiple RH maps but also to any maps that have comparable criteria for measuring map quality. Our software improves on current technology for doing RH mapping in areas of computation time and algorithms for considering a large number of markers for mapping. The essential impediments to producing dense high-quality RH maps are data quality and panel size, not computation.


Subject(s)
Algorithms , Chromosome Mapping/methods , Computational Biology/methods , Software , Chromosome Mapping/statistics & numerical data , Computational Biology/statistics & numerical data , Genetic Markers/genetics , Genetic Vectors/genetics , Haploidy , Humans , Hybrid Cells , Likelihood Functions , Probability
7.
Nucleic Acids Res ; 28(1): 10-4, 2000 Jan 01.
Article in English | MEDLINE | ID: mdl-10592169

ABSTRACT

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval and resources that operate on the data in GenBank and a variety of other biological data made available through NCBI's Web site. NCBI data retrieval resources include Entrez, PubMed, LocusLink and the Taxonomy Browser. Data analysis resources include BLAST, Electronic PCR, OrfFinder, RefSeq, UniGene, Database of Single Nucleotide Polymorphisms (dbSNP), Human Genome Sequencing pages, GeneMap'99, Davis Human-Mouse Homology Map, Cancer Chromosome Aberration Project (CCAP) pages, Entrez Genomes, Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, Cancer Genome Anatomy Project (CGAP) pages, SAGEmap, Online Mendelian Inheritance in Man (OMIM) and the Molecular Modeling Database (MMDB). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih. gov


Subject(s)
Biology , Databases, Factual , Animals , Gene Expression , Genome, Human , Humans , Information Storage and Retrieval , Mice , Models, Molecular , National Library of Medicine (U.S.) , Neoplasms/genetics , Phenotype , United States
9.
Nat Genet ; 22(4): 384-7, 1999 Aug.
Article in English | MEDLINE | ID: mdl-10431245

ABSTRACT

Radiation hybrid (RH) maps are a useful tool for genome analysis, providing a direct method for localizing genes and anchoring physical maps and genomic sequence along chromosomes. The construction of a comprehensive RH map for the human genome has resulted in gene maps reflecting the location of more than 30,000 human genes. Here we report the first comprehensive RH map of the mouse genome. The map contains 2,486 loci screened against an RH panel of 93 cell lines. Most loci (93%) are simple sequence length polymorphisms (SSLPs) taken from the mouse genetic map, thereby providing direct integration between these two key maps. We performed RH mapping by a new and efficient approach in which we replaced traditional gel- or hybridization-based assays by a homogeneous 5'-nuclease assays involving a single common probe for all genetic markers. The map provides essentially complete connectivity and coverage across the genome, and good resolution for ordering loci, with 1 centiRay (cR) corresponding to an average of approximately 100 kb. The RH map, together with an accompanying World-Wide Web server, makes it possible for any investigator to rapidly localize sequences in the mouse genome. Together with the previously constructed genetic map and a YAC-based physical map reported in a companion paper, the fundamental maps required for mouse genomics are now available.


Subject(s)
Genetic Techniques , Genome , Mice/genetics , Physical Chromosome Mapping , Animals , Lod Score , Models, Genetic , Models, Statistical , Polymorphism, Genetic
12.
Trends Biotechnol ; 16(11): 456-9, 1998 Nov.
Article in English | MEDLINE | ID: mdl-9830153

ABSTRACT

A crucial event in the history of the Human Genome Project was the decision to use sequence-tagged sites (STSs) as common landmarks for genomic mapping. Following several years of constructing STS-based maps of ever-increasing detail, the emphasis has recently shifted towards large-scale genomic sequencing. A computational procedure called 'electronic PCR' allows STS landmarks to be revealed as data emerge from the sequencing pipeline, thereby bridging the gap between mapping and sequencing activities.


Subject(s)
Chromosome Mapping , Models, Genetic , Polymerase Chain Reaction/methods , Sequence Analysis, DNA/methods , Sequence Tagged Sites , Base Sequence , Genome, Human , Humans , Molecular Sequence Data
13.
Science ; 282(5389): 744-6, 1998 Oct 23.
Article in English | MEDLINE | ID: mdl-9784132

ABSTRACT

A map of 30,181 human gene-based markers was assembled and integrated with the current genetic map by radiation hybrid mapping. The new gene map contains nearly twice as many genes as the previous release, includes most genes that encode proteins of known function, and is twofold to threefold more accurate than the previous version. A redesigned, more informative and functional World Wide Web site (www.ncbi.nlm.nih.gov/genemap) provides the mapping information and associated data and annotations. This resource constitutes an important infrastructure and tool for the study of complex genetic traits, the positional cloning of disease genes, the cross-referencing of mammalian genomes, and validated human transcribed sequences for large-scale studies of gene expression.


Subject(s)
Chromosomes, Human/genetics , Genome, Human , Physical Chromosome Mapping , Animals , Expressed Sequence Tags , Gene Expression , Genetic Markers , Human Genome Project , Humans , Internet , Rats , Sequence Tagged Sites
14.
Nat Genet ; 20(1): 19-23, 1998 Sep.
Article in English | MEDLINE | ID: mdl-9731524

ABSTRACT

Microarray technology makes it possible to simultaneously study the expression of thousands of genes during a single experiment. We have developed an information system, ArrayDB, to manage and analyse large-scale expression data. The underlying relational database was designed to allow flexibility in the nature and structure of data input and also in the generation of standard or customized reports through a web-browser interface. ArrayDB provides varied options for data retrieval and analysis tools that should facilitate the interpretation of complex hybridization results. A sampling of ArrayDB storage, retrieval and analysis capabilities is available (www.nhgri.nih.gov/DIR/LCG/15K/HTML/ ), along with information on a set of approximately 15,000 genes used to fabricate several widely used microarrays. Information stored in ArrayDB is used to provide integrated gene expression reports by linking array target sequences with NCBI's Entrez retrieval system, UniGene and KEGG pathway views. The integration of external information resources is essential in interpreting intrinsic patterns and relationships in large-scale gene expression data.


Subject(s)
Database Management Systems , Gene Expression , Molecular Biology/methods , Computer Communication Networks , Databases, Factual , Information Storage and Retrieval , Online Systems , User-Computer Interface
17.
Science ; 274(5287): 540-6, 1996 Oct 25.
Article in English | MEDLINE | ID: mdl-8849440

ABSTRACT

The human genome is thought to harbor 50,000 to 100,000 genes, of which about half have been sampled to date in the form of expressed sequence tags. An international consortium was organized to develop and map gene-based sequence tagged site markers on a set of two radiation hybrid panels and a yeast artificial chromosome library. More than 16,000 human genes have been mapped relative to a framework map that contains about 1000 polymorphic genetic markers. The gene map unifies the existing genetic and physical maps with the nucleotide and protein sequence databases in a fashion that should speed the discovery of genes underlying inherited human disease. The integrated resource is available through a site on the World Wide Web at http://www.ncbi.nlm.nih.gov/SCIENCE96/.


Subject(s)
Chromosome Mapping , Genome, Human , Human Genome Project , Amino Acid Sequence , Animals , Base Sequence , Cell Line , Chromosomes, Artificial, Yeast , Computer Communication Networks , DNA, Complementary/genetics , Databases, Factual , Gene Expression , Genetic Markers , Humans , Multigene Family , RNA, Messenger/genetics , Sequence Homology, Nucleic Acid , Sequence Tagged Sites
SELECTION OF CITATIONS
SEARCH DETAIL