Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Language
Publication year range
1.
Nature ; 428(6982): 522-8, 2004 Apr 01.
Article in English | MEDLINE | ID: mdl-15057823

ABSTRACT

Chromosome 13 is the largest acrocentric human chromosome. It carries genes involved in cancer including the breast cancer type 2 (BRCA2) and retinoblastoma (RB1) genes, is frequently rearranged in B-cell chronic lymphocytic leukaemia, and contains the DAOA locus associated with bipolar disorder and schizophrenia. We describe completion and analysis of 95.5 megabases (Mb) of sequence from chromosome 13, which contains 633 genes and 296 pseudogenes. We estimate that more than 95.4% of the protein-coding genes of this chromosome have been identified, on the basis of comparison with other vertebrate genome sequences. Additionally, 105 putative non-coding RNA genes were found. Chromosome 13 has one of the lowest gene densities (6.5 genes per Mb) among human chromosomes, and contains a central region of 38 Mb where the gene density drops to only 3.1 genes per Mb.


Subject(s)
Chromosomes, Human, Pair 13/genetics , Genes/genetics , Physical Chromosome Mapping , Chromosome Mapping , Genetics, Medical , Humans , Pseudogenes/genetics , RNA, Untranslated/genetics , Sequence Analysis, DNA
2.
Nature ; 425(6960): 805-11, 2003 Oct 23.
Article in English | MEDLINE | ID: mdl-14574404

ABSTRACT

Chromosome 6 is a metacentric chromosome that constitutes about 6% of the human genome. The finished sequence comprises 166,880,988 base pairs, representing the largest chromosome sequenced so far. The entire sequence has been subjected to high-quality manual annotation, resulting in the evidence-supported identification of 1,557 genes and 633 pseudogenes. Here we report that at least 96% of the protein-coding genes have been identified, as assessed by multi-species comparative sequence analysis, and provide evidence for the presence of further, otherwise unsupported exons/genes. Among these are genes directly implicated in cancer, schizophrenia, autoimmunity and many other diseases. Chromosome 6 harbours the largest transfer RNA gene cluster in the genome; we show that this cluster co-localizes with a region of high transcriptional activity. Within the essential immune loci of the major histocompatibility complex, we find HLA-B to be the most polymorphic gene on chromosome 6 and in the human genome.


Subject(s)
Chromosomes, Human, Pair 6/genetics , Genes/genetics , Physical Chromosome Mapping , Animals , Exons/genetics , Genetic Diseases, Inborn/genetics , HLA-B Antigens/genetics , Humans , Pseudogenes/genetics , RNA, Transfer/genetics , Sequence Analysis, DNA
3.
Genome Biol ; 3(12): RESEARCH0082, 2002.
Article in English | MEDLINE | ID: mdl-12537571

ABSTRACT

The well-established inaccuracy of purely computational methods for annotating genome sequences necessitates an interactive tool to allow biological experts to refine these approximations by viewing and independently evaluating the data supporting each annotation. Apollo was developed to meet this need, enabling curators to inspect genome annotations closely and edit them. FlyBase biologists successfully used Apollo to annotate the Drosophila melanogaster genome and it is increasingly being used as a starting point for the development of customized annotation editing tools for other genome projects.


Subject(s)
Databases, Nucleic Acid , Software Design , Animals , Database Management Systems , Humans , User-Computer Interface
4.
Nature ; 409(6822): 827-8, 2001 Feb 15.
Article in English | MEDLINE | ID: mdl-11236999

ABSTRACT

Now that the draft human genome sequence is available, everyone wants to be able to use it. However, we have perhaps become complacent about our ability to turn new genomes into lists of genes. The higher volume of data associated with a larger genome is accompanied by a much greater increase in complexity. We need to appreciate both the scale of the challenge of vertebrate genome analysis and the limitations of current gene prediction methods and understanding.


Subject(s)
Genome, Human , Information Storage and Retrieval , Animals , Databases, Factual , Genes , Human Genome Project , Humans , Internet
5.
Bioinformatics ; 16(2): 111-6, 2000 Feb.
Article in English | MEDLINE | ID: mdl-10842731

ABSTRACT

MOTIVATION: An automatic sequence searching method (ProtEST) is described which constructs multiple protein sequence alignments from protein sequences and translated expressed sequence tags (ESTs). ProtEST is more effective than a simple TBLASTN search of the query against the EST database, as the sequences are automatically clustered, assembled, made non-redundant, checked for sequence errors, translated into protein and then aligned and displayed. RESULTS: A ProtEST search found a non-redundant, translated, error- and length-corrected EST sequence for > 58% of sequences when single sequences from 1407 Pfam-A seed alignments were used as the probe. The average family size of the resulting alignments of translated EST sequences contained > 10 sequences. In a cross-validated test of protein secondary structure prediction, alignments from the new procedure led to an improvement of 3.4% average Q3 prediction accuracy over single sequences. AVAILABILITY: The ProtEST method is available as an Internet World Wide Web service http://barton.ebi.ac.uk/servers/protest.html+ ++ The Wise2 package for protein and genomic comparisons and the ProtESTWise script can be found at http://www.sanger.ac.uk/Software/Wise2 CONTACT: geoff@ebi.ac.uk


Subject(s)
Expressed Sequence Tags , Proteins/analysis , Sequence Alignment/methods , Amino Acid Sequence , Molecular Sequence Data , Protein Biosynthesis
6.
Bioinformatics ; 14(10): 892-3, 1998.
Article in English | MEDLINE | ID: mdl-9927721

ABSTRACT

UNLABELLED: An interactive protein secondary structure prediction Internet server is presented. The server allows a single sequence or multiple alignment to be submitted, and returns predictions from six secondary structure prediction algorithms that exploit evolutionary information from multiple sequences. A consensus prediction is also returned which improves the average Q3 accuracy of prediction by 1% to 72.9%. The server simplifies the use of current prediction algorithms and allows conservation patterns important to structure and function to be identified. AVAILABILITY: http://barton.ebi.ac.uk/servers/jpred.h tml CONTACT: geoff@ebi.ac.uk


Subject(s)
Internet , Protein Structure, Secondary , Software , Algorithms , Computational Biology , Consensus Sequence , Sequence Alignment
SELECTION OF CITATIONS
SEARCH DETAIL
...