Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Science ; 294(5543): 849-52, 2001 Oct 26.
Article in English | MEDLINE | ID: mdl-11679669

ABSTRACT

Listeria monocytogenes is a food-borne pathogen with a high mortality rate that has also emerged as a paradigm for intracellular parasitism. We present and compare the genome sequences of L. monocytogenes (2,944,528 base pairs) and a nonpathogenic species, L. innocua (3,011,209 base pairs). We found a large number of predicted genes encoding surface and secreted proteins, transporters, and transcriptional regulators, consistent with the ability of both species to adapt to diverse environments. The presence of 270 L. monocytogenes and 149 L. innocua strain-specific genes (clustered in 100 and 63 islets, respectively) suggests that virulence in Listeria results from multiple gene acquisition and deletion events.


Subject(s)
Bacterial Proteins/genetics , Genome, Bacterial , Listeria monocytogenes/genetics , Listeria/genetics , Adaptation, Physiological , Amino Acid Motifs , Bacillus subtilis/genetics , Bacterial Proteins/chemistry , Bacterial Proteins/physiology , Base Composition , Carrier Proteins/chemistry , Carrier Proteins/genetics , Chromosomes, Bacterial/genetics , DNA, Bacterial/chemistry , DNA, Bacterial/genetics , Gene Transfer, Horizontal , Genes, Bacterial , Genomics , Listeria/chemistry , Listeria/physiology , Listeria monocytogenes/chemistry , Listeria monocytogenes/pathogenicity , Listeria monocytogenes/physiology , Membrane Proteins/chemistry , Membrane Proteins/genetics , Sequence Analysis, DNA , Staphylococcus aureus/genetics , Transcription Factors/chemistry , Transcription Factors/genetics , Virulence/genetics
2.
Bioinformatics ; 15(5): 391-412, 1999 May.
Article in English | MEDLINE | ID: mdl-10366660

ABSTRACT

MOTIVATION: Large-scale genome projects generate a rapidly increasing number of sequences, most of them biochemically uncharacterized. Research in bioinformatics contributes to the development of methods for the computational characterization of these sequences. However, the installation and application of these methods require experience and are time consuming. RESULTS: We present here an automatic system for preliminary functional annotation of protein sequences that has been applied to the analysis of sets of sequences from complete genomes, both to refine overall performance and to make new discoveries comparable to those made by human experts. The GeneQuiz system includes a Web-based browser that allows examination of the evidence leading to an automatic annotation and offers additional information, views of the results, and links to biological databases that complement the automatic analysis. System structure and operating principles concerning the use of multiple sequence databases, underlying sequence analysis tools, lexical analyses of database annotations and decision criteria for functional assignments are detailed. The system makes automatic quality assessments of results based on prior experience with the underlying sequence analysis tools; overall error rates in functional assignment are estimated at 2.5-5% for cases annotated with highest reliability ('clear' cases). Sources of over-interpretation of results are discussed with proposals for improvement. A conservative definition for reporting 'new findings' that takes account of database maturity is presented along with examples of possible kinds of discoveries (new function, family and superfamily) made by the system. System performance in relation to sequence database coverage, database dynamics and database search methods is analysed, demonstrating the inherent advantages of an integrated automatic approach using multiple databases and search methods applied in an objective and repeatable manner. AVAILABILITY: The GeneQuiz system is publicly available for analysis of protein sequences through a Web server at http://www.sander.ebi.ac. uk/gqsrv/submit


Subject(s)
Computer Systems , Databases, Factual , Proteins/analysis , Sequence Analysis/methods , Software , Amino Acid Sequence , Automation , Humans , Molecular Sequence Data
4.
Nucleic Acids Res ; 25(1): 226-30, 1997 Jan 01.
Article in English | MEDLINE | ID: mdl-9016541

ABSTRACT

HSSP is a derived database merging structural (3-D) and sequence (1-D) information. For each protein of known 3-D structure from the Protein Data Bank (PDB), the database has a multiple sequence alignment of all available homologues and a sequence profile characteristic of the family. The list of homologues is the result of a database search in SwissProt using a position-weighted dynamic programming method for sequence profile alignment (MaxHom). The database is updated frequently. The listed homologues are very likely to have the same 3-D structure as the PDB protein to which they have been aligned. As a result, the database is not only a database of aligned sequence families, but also a database of implied secondary and tertiary structures covering 29% of all SwissProt-stored sequences.


Subject(s)
Databases, Factual , Proteins/genetics , Sequence Alignment , Sequence Homology, Amino Acid , Amino Acid Sequence , Animals , Humans , Molecular Sequence Data , Protein Structure, Secondary , Protein Structure, Tertiary , Proteins/chemistry
SELECTION OF CITATIONS
SEARCH DETAIL
...