Comparative genomics using data mining tools.
J Biosci
;
2002 Feb; 27(1 Suppl 1): 15-25
Artículo
en Inglés
| IMSEAR
| ID: sea-110630
ABSTRACT
We have analysed the genomes of representatives of three kingdoms of life, namely, archaea, eubacteria and eukaryota using data mining tools based on compositional analyses of the protein sequences. The representatives chosen in this analysis were Methanococcus jannaschii, Haemophilus influenzae and Saccharomyces cerevisiae. We have identified the common and different features between the three genomes in the protein evolution patterns. M. jannaschii has been seen to have a greater number of proteins with more charged amino acids whereas S. cerevisiae has been observed to have a greater number of hydrophilic proteins. Despite the differences in intrinsic compositional characteristics between the proteins from the different genomes we have also identified certain common characteristics. We have carried out exploratory Principal Component Analysis of the multivariate data on the proteins of each organism in an effort to classify the proteins into clusters. Interestingly, we found that most of the proteins in each organism cluster closely together, but there are a few 'outliers'. We focus on the outliers for the functional investigations, which may aid in revealing any unique features of the biology of the respective organisms
Texto completo:
Disponible
Índice:
IMSEAR (Asia Sudoriental)
Asunto principal:
Saccharomyces cerevisiae
/
Proteínas Bacterianas
/
Humanos
/
Haemophilus influenzae
/
Methanococcus
/
Genoma Fúngico
/
Genoma Bacteriano
/
Análisis de Secuencia de ADN
/
Biología Computacional
/
Proteínas Arqueales
Tipo de estudio:
Estudio pronóstico
Idioma:
Inglés
Revista:
J Biosci
Año:
2002
Tipo del documento:
Artículo
Similares
MEDLINE
...
LILACS
LIS