Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Publication year range
1.
Nucleic Acids Res ; 31(1): 388-9, 2003 Jan 01.
Article in English | MEDLINE | ID: mdl-12520029

ABSTRACT

The CluSTr database (http://www.ebi.ac.uk/clustr/) offers an automatic classification of SWISS-PROT+TrEMBL proteins into groups of related proteins. The clustering is based on analysis of all pair-wise sequence comparisons between proteins using the Smith-Waterman algorithm. The analysis, carried out on different levels of protein similarity, yields a hierarchical organization of clusters. Information about domain content of the clustered proteins is provided via the InterPro resource. The introduced InterPro 'condensed graphical view' simplifies the visual analysis of represented domain architectures. Integrated applications allow users to visualize and edit multiple alignments and build sequence divergence trees. Links to the relevant structural data in Protein Data Bank (PDB) and Homology derived Secondary Structure of Proteins (HSSP) are also provided.


Subject(s)
Databases, Protein , Proteins/chemistry , Proteins/classification , Animals , Cluster Analysis , Computer Graphics , Internet , Protein Structure, Tertiary , Sequence Alignment , User-Computer Interface
2.
Bioinformatics ; 18(2): 374-5, 2002 Feb.
Article in English | MEDLINE | ID: mdl-11847096

ABSTRACT

MOTIVATION: The SWISS-PROT group at the EBI has developed the Proteome Analysis Database utilizing existing resources and providing comprehensive and integrated comparative analysis of the predicted protein coding sequences of the complete genomes of bacteria, archaea and eukaryotes. The Proteome Analysis Database is accompanied by a program that has been designed to carry out interactive InterPro proteome comparisons for any one proteome against any other one or more of the proteomes in the database.


Subject(s)
Databases, Protein , Proteome , Software , Computational Biology , Genome
3.
Curr Opin Struct Biol ; 11(3): 334-9, 2001 Jun.
Article in English | MEDLINE | ID: mdl-11406384

ABSTRACT

Various sequence-motif and sequence-cluster databases have been integrated into a new resource known as InterPro. Because the contributing databases have different clustering principles and scoring sensitivities, the combined assignments complement each other for grouping protein families and delineating domains. InterPro and new developments in the analysis of both the phylogenetic profiles of protein families and domain fusion events improve the prediction of specific functions for numerous proteins.


Subject(s)
Amino Acid Motifs , Databases, Factual , Phylogeny , Proteins , Sequence Alignment/methods
4.
Nucleic Acids Res ; 29(1): 33-6, 2001 Jan 01.
Article in English | MEDLINE | ID: mdl-11125042

ABSTRACT

The CluSTr (Clusters of SWISS-PROT and TrEMBL proteins) database offers an automatic classification of SWISS-PROT and TrEMBL proteins into groups of related proteins. The clustering is based on analysis of all pairwise comparisons between protein sequences. Analysis has been carried out for different levels of protein similarity, yielding a hierarchical organisation of clusters. The database provides links to InterPro, which integrates information on protein families, domains and functional sites from PROSITE, PRINTS, Pfam and ProDom. Links to the InterPro graphical interface allow users to see at a glance whether proteins from the cluster share particular functional sites. CluSTr also provides cross-references to HSSP and PDB. The database is available for querying and browsing at http://www.ebi.ac.uk/clustr.


Subject(s)
Databases, Factual , Proteins , Animals , Carrier Proteins/genetics , Humans , Information Services , Internet , Proteins/genetics , Sequence Alignment , Sodium/metabolism
5.
Nucleic Acids Res ; 29(1): 44-8, 2001 Jan 01.
Article in English | MEDLINE | ID: mdl-11125045

ABSTRACT

The SWISS-PROT group at EBI has developed the Proteome Analysis Database utilising existing resources and providing comparative analysis of the predicted protein coding sequences of the complete genomes of bacteria, archaea and eukaryotes (http://www.ebi.ac. uk/proteome/). The two main projects used, InterPro and CluSTr, give a new perspective on families, domains and sites and cover 31-67% (InterPro statistics) of the proteins from each of the complete genomes. CluSTr covers the three complete eukaryotic genomes and the incomplete human genome data. The Proteome Analysis Database is accompanied by a program that has been designed to carry out InterPro proteome comparisons for any one proteome against any other one or more of the proteomes in the database.


Subject(s)
Databases, Factual , Proteome , Animals , Genome , Genome, Human , Humans , Information Services , Internet , Proteins/classification , Proteins/genetics
6.
Bioinformatics ; 16(12): 1159-60, 2000 Dec.
Article in English | MEDLINE | ID: mdl-11159338

ABSTRACT

SUMMARY: A collection of transmembrane proteins with annotated transmembrane regions, for which good experimental evidence exist, was created as a test or training set for algorithms to predict transmembrane regions in proteins.


Subject(s)
Algorithms , Membrane Proteins/chemistry , Software , Computational Biology , Internet
7.
J Biomol Struct Dyn ; 17(2): 281-8, 1999 Oct.
Article in English | MEDLINE | ID: mdl-10563578

ABSTRACT

Statistics of the exon-intron structure and splicing sites of several diverse eukaryotes was studied. The yeast exon-intron structures have a number of unique features. A yeast gene usually have at most one intron. The branch site is strongly conserved, whereas the polypirimidine tract is short. Long yeast introns tend to have stronger acceptor sites. In other species the branch site is less conserved and often cannot be determined. In non-yeast samples there is an almost universal correlation between lengths of neighboring exons (all samples excluding protists) and correlation between lengths of neighboring introns (human, drosophila, protists). On the average first introns are longer, and anomalously long introns are usually first introns in a gene. There is a universal preference for exons and exon pairs with the (total) length divisible by 3. Introns positioned between codons are preferred, whereas those positioned between the first and second positions in codon are avoided. The choice of A or G at the third position of intron (the donor splice sites generally prefer purines at this position) is correlated with the overall GC-composition of the gene. In all samples dinucleotide AG is avoided in the region preceding the acceptor site.


Subject(s)
Eukaryotic Cells/chemistry , Exons , Introns , Models, Statistical , Algorithms , Animals , Apicomplexa/genetics , Arabidopsis/genetics , Aspergillus/genetics , Databases, Factual , Humans , Models, Genetic , Pyrimidines , RNA Splicing/genetics , Saccharomyces cerevisiae/genetics , Sequence Analysis, DNA
8.
Biofizika ; 44(4): 595-600, 1999.
Article in Russian | MEDLINE | ID: mdl-10544807

ABSTRACT

The exon-intron structure of human, insect (Drosophila sp.), and dicot plant (Arabidopsis thaliana) genes was considered. In each genome there exists a characteristic intron length. Anomalously long introns was usually the first introns in genes. In each sample there are correlations between the lengths of neighboring exons and between exon lengths and closeness to the consensus of the sites at exon boundaries. Exons and exon pairs containing an integer number of triplets are preferred. These results are relevant to the study of splicing mechanism and evolution of introns, as well as construction of gene recognition algorithms.


Subject(s)
Arabidopsis/genetics , Drosophila/genetics , Exons , Genome , Introns , Algorithms , Animals , Humans , RNA Splicing
SELECTION OF CITATIONS
SEARCH DETAIL
...