Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 1 de 1
Filter
Add more filters










Database
Language
Publication year range
1.
BMC Bioinformatics ; 8 Suppl 4: S6, 2007 May 22.
Article in English | MEDLINE | ID: mdl-17570149

ABSTRACT

BACKGROUND: Existing methods for whole-genome comparisons require prior knowledge of related species and provide little automation in the function prediction process. Bacteriophage genomes are an example that cannot be easily analyzed by these methods. This work addresses these shortcomings and aims to provide an automated prediction system of gene function. RESULTS: We have developed a novel system called SynFPS to perform gene function prediction over completed genomes. The prediction system is initialized by clustering a large collection of weakly related genomes into groups based on their resemblance in gene distribution. From each individual group, data are then extracted and used to train a Support Vector Machine that makes gene function predictions. Experiments were conducted with 9 different gene functions over 296 bacteriophage genomes. Cross validation results gave an average prediction accuracy of ~80%, which is comparable to other genomic-context based prediction methods. Functional predictions are also made on 3 uncharacterized genes and 12 genes that cannot be identified by sequence alignment. The software is publicly available at http://www.synteny.net/. CONCLUSION: The proposed system employs genomic context to predict gene function and detect gene correspondence in whole-genome comparisons. Although our experimental focus is on bacteriophages, the method may be extended to other microbial genomes as they share a number of similar characteristics with phage genomes such as gene order conservation.


Subject(s)
Artificial Intelligence , Bacteriophages/genetics , Chromosome Mapping/methods , Cluster Analysis , Genome, Viral/genetics , Multigene Family/genetics , Sequence Analysis, DNA/methods , Algorithms , Base Sequence , Discriminant Analysis , Molecular Sequence Data , Pattern Recognition, Automated/methods , Sequence Alignment/methods , Sequence Homology, Nucleic Acid
SELECTION OF CITATIONS
SEARCH DETAIL
...