RESUMO
Feed-forward neural networks (FFNNs) were used to predict the skeletal type of molecules belonging to six classes of terpenoids. A database that contains the 13C NMR spectra of about 5000 compounds was used to train the FFNNs. An efficient representation of the spectra was designed and the constitution of the best FFNN input vector format resorted from an heuristic approach. The latter was derived from general considerations on terpenoid structures.
RESUMO
This paper describes the application of artificial neural nets as an alternative and efficient method for the classification of botanical taxa based on chemical data (chemosystematics). A total of 28,000 botanical occurrences of chemical compounds isolated from the Asteraceae family were chosen from the literature, and grouped by chemical class for each species. Four tests were carried out to differentiate and classify different botanical taxa. The qualifying capacity of the artificial neural nets was dichotomically tested at different hierarchical levels of the family, such as subfamilies and groups of Heliantheae subtribes. Furthermore, two specific subtribes of the Heliantheae and two genera of one of these subtribes were also tested. In general, the artificial neural net gave rise to good results, with multiple-correlation values R>0.90. Hence, it was possible to differentiate the dichotomic character of the botanical taxa studied.
Assuntos
Asteraceae/química , Asteraceae/classificação , Redes Neurais de Computação , Asteraceae/metabolismo , FilogeniaRESUMO
The training and the application of a neural network system for the prediction of occurrences of secondary metabolites belonging to diverse chemical classes in the Asteraceae is described. From a database containing about 604 genera and 28,000 occurrences of secondary metabolites in the plant family, information was collected encompassing nine chemical classes and their respective occurrences for training of a multi-layer net using the back-propagation algorithm. The net supplied as output the presence or absence of the chemical classes as well as the number of compounds isolated from each taxon. The results provided by the net from the presence or absence of a chemical class showed a 89% hit rate; by excluding triterpenes from the analysis, only 5% of the genera studied exhibited errors greater than 10%.
Assuntos
Asteraceae/química , Redes Neurais de Computação , Técnicas de Química Analítica/métodosRESUMO
The aim of this paper is to present a procedure that utilizes 13C NMR for identification of substituent groups which are bonded to carbon skeletons of natural products. For so much was developed a new version of the program MACRONO, that presents a database with 161 substituent types found in the most varied terpenoids. This new version was widely tested in the identification of the substituents of 60 compounds that, after removal of the signals that did not belong to the carbon skeleton, served to test the prediction of skeletons by using other programs of the expert system SISTEMAT.