A Machine Learning Bioinformatics Method to Predict Biological Activity from Biosynthetic Gene Clusters.
J Chem Inf Model
; 61(6): 2560-2571, 2021 06 28.
Article
em En
| MEDLINE
| ID: mdl-34042443
Research in natural products, the genetically encoded small molecules produced by organisms in an idiosyncratic fashion, deals with molecular structure, biosynthesis, and biological activity. Bioinformatics analyses of microbial genomes can successfully reveal the genetic instructions, biosynthetic gene clusters, that produce many natural products. Genes to molecule predictions made on biosynthetic gene clusters have revealed many important new structures. There is no comparable method for genes to biological activity predictions. To address this missing pathway, we developed a machine learning bioinformatics method for predicting a natural product's antibiotic activity directly from the sequence of its biosynthetic gene cluster. We trained commonly used machine learning classifiers to predict antibacterial or antifungal activity based on features of known natural product biosynthetic gene clusters. We have identified classifiers that can attain accuracies as high as 80% and that have enabled the identification of biosynthetic enzymes and their corresponding molecular features that are associated with antibiotic activity.
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Produtos Biológicos
/
Biologia Computacional
Tipo de estudo:
Prognostic_studies
/
Risk_factors_studies
Idioma:
En
Revista:
J Chem Inf Model
Assunto da revista:
INFORMATICA MEDICA
/
QUIMICA
Ano de publicação:
2021
Tipo de documento:
Article
País de afiliação:
Estados Unidos
País de publicação:
Estados Unidos