RESUMO
Ubiquitin-proteasome system (UPS) is a highly regulated mechanism of intracellular protein degradation and turnover. The UPS is involved in different biological activities, such as the regulation of gene transcription and cell cycle. Several researchers have applied cheminformatics and artificial intelligence methods to study the inhibition of proteasomes, including the prediction of UPP inhibitors. Following this idea, we applied a new tool for obtaining molecular descriptors (MDs) for modeling proteasome Inhibition in terms of EC50 (µmol/L), in which a set of new MDs called atomic weighted vectors (AWV) and several prediction algorithms were used in cheminformatics studies. In the manuscript, a set of descriptors based on AWV are presented as datasets for training different machine learning techniques, such as linear regression, multiple linear regression (MLR), random forest (RF), K-nearest neighbors (IBK), multi-layer perceptron, best-first search, and genetic algorithm. The results suggest that these atomic descriptors allow adequate modeling of proteasome inhibitors despite artificial intelligence techniques, as a variant to build efficient models for the prediction of inhibitory activity.
RESUMO
In this report are used two data sets involving the main antidiabetic enzyme targets α-amylase and α-glucosidase. The prediction of α-amylase and α-glucosidase inhibitory activity as antidiabetic is carried out using LDA and classification trees (CT). A large data set of 640 compounds for α-amylase and 1546 compounds in the case of α-glucosidase are selected to develop the tree model. In the case of CT-J48 have the better classification model performances for both targets with values above 80%-90% for the training and prediction sets, correspondingly. The best model shows an accuracy higher than 95% for training set; the model was also validated using 10-fold cross-validation procedure and through a test set achieving accuracy values of 85.32% and 86.80%, correspondingly. Additionally, the obtained model is compared with other approaches previously published in the international literature showing better results. Finally, we can say that the present results provided a double-target approach for increasing the estimation of antidiabetic chemicals identification aimed by double-way workflow in virtual screening pipelines.
Assuntos
Inibidores Enzimáticos/química , Modelos Estatísticos , alfa-Amilases/antagonistas & inibidores , alfa-Glucosidases/química , Bases de Dados de Compostos Químicos , Diabetes Mellitus/tratamento farmacológico , Análise Discriminante , Inibidores Enzimáticos/metabolismo , Inibidores Enzimáticos/uso terapêutico , Inibidores de Glicosídeo Hidrolases/química , Inibidores de Glicosídeo Hidrolases/metabolismo , Inibidores de Glicosídeo Hidrolases/uso terapêutico , Humanos , Hipoglicemiantes/química , Hipoglicemiantes/metabolismo , Hipoglicemiantes/uso terapêutico , Análise de Componente Principal , Relação Quantitativa Estrutura-Atividade , alfa-Amilases/metabolismo , alfa-Glucosidases/metabolismoRESUMO
This study presents the impact of carbon nanotubes (CNTs) on mitochondrial oxygen mass flux (Jm) under three experimental conditions. New experimental results and a new methodology are reported for the first time and they are based on CNT Raman spectra star graph transform (spectral moments) and perturbation theory. The experimental measures of Jm showed that no tested CNT family can inhibit the oxygen consumption profiles of mitochondria. The best model for the prediction of Jm for other CNTs was provided by random forest using eight features, obtaining test R-squared (R²) of 0.863 and test root-mean-square error (RMSE) of 0.0461. The results demonstrate the capability of encoding CNT information into spectral moments of the Raman star graphs (SG) transform with a potential applicability as predictive tools in nanotechnology and material risk assessments.
RESUMO
The present work is devoted to the development and application of a multi-agent Quantitative Structure-Activity Relationship (QSAR) classification system for tyrosinase inhibitor identification, in which the individual QSAR outputs are the inputs of a fusion approach based on the voting mechanism. The individual models are based on TOMOCOMD-CARDD (TOpological Molecular COMputational Design-Computer Aided Rational Drug Design) atom-based bilinear descriptors and Linear Discriminant Analysis (LDA) on a novel enlarged, balanced database of 1,429 compounds within 701 greatly dissimilar molecules presenting anti-tyrosinase activity. A total of 21 adequate models are obtained taking into account the requirements of the Organization for Economic Cooperation and Development (OECD) principles for QSAR validation and present global accuracies (Q) above 84.50 and 79.27% in the training and test sets, respectively. The resulted fusion system is used for the in silico identification of synthesized coumarin derivatives as novel tyrosinase inhibitors. The 7-hydroxycoumarin (compound C07) shows potent activity for the inhibition of monophenolase activity of mushroom tyrosinase giving a value of inhibition percentage close to 100% in vitro assays, by means of spectrophotometric analysis. The current report could help to shed some clues in the identification of new chemicals that inhibit tyrosinase enzyme, for entering in the pipeline of drug discovery development.
Assuntos
Cumarínicos/química , Bases de Dados Factuais , Descoberta de Drogas , Inibidores Enzimáticos/química , Monofenol Mono-Oxigenase/antagonistas & inibidores , Relação Quantitativa Estrutura-Atividade , Algoritmos , Simulação por Computador , Desenho Assistido por Computador , Desenho de Fármacos , Ligantes , Modelos Teóricos , Reprodutibilidade dos Testes , Projetos de PesquisaRESUMO
In this review an overview of the application of computational approaches is given. Specifically, the uses of Quantitative Structure-Activity Relationship (QSAR) methods for in silico identification of new families of compounds as novel tyrosinase inhibitors are revised. Assembling, validation of models through prediction series, and virtual screening of external data sets are also shown, to prove the accuracy of the QSAR models obtained with the TOMOCOMD-CARDD (TOpological MOlecular COMputational Design-Computer-Aided Rational Drug Design) software and Linear Discriminant Analysis (LDA) as statistical technique. Together with this, a database is collected for these QSAR studies, and could be considered a useful tool in future QSAR modeling of tyrosinase activity and for scientists that work in the field of this enzyme and its inhibitors. Finally, a translation to real world applications is shown by the use of QSAR models in the identification and posterior in-vitro evaluation of different families of compounds. Several different classes of compounds from various sources (natural and synthetic) were identified. Between them, we can find tetraketones, cycloartanes, ethylsteroids, lignans, dicoumarins and vanilloid derivatives. Finally, some considerations are discussed in order to improve the identification of novel drug-like compounds based on the use of QSAR-Ligand-Based Virtual Screening (LBVS).
Assuntos
Desenho Assistido por Computador , Descoberta de Drogas , Inibidores Enzimáticos , Monofenol Mono-Oxigenase/antagonistas & inibidores , Biologia Computacional , Simulação por Computador , Bases de Dados Factuais , Desenho de Fármacos , Inibidores Enzimáticos/química , Inibidores Enzimáticos/farmacologia , Ligantes , Modelos Biológicos , Estrutura Molecular , Monofenol Mono-Oxigenase/química , Relação Quantitativa Estrutura-Atividade , SoftwareRESUMO
Two-dimensional atom- and bond-based TOMOCOMD-CARDD descriptors and linear discriminant analysis (LDA) are used in this report to perform a quantitative structure-activity relationship (QSAR) study of tyrosinase-inhibitory activity. A database of inhibitors of the enzyme is collected for this study, within 246 highly dissimilar molecules presenting antityrosinase activity. In total, 7 discriminant functions are obtained by using the whole set of atom- and bond-based 2D indices. All the LDA-based QSAR models show accuracies above 90% in the training set and values of the Matthews correlation coefficient (C) varying from 0.85 to 0.90. The external validation set shows globally good classifications between 89% and 91% and C values ranging from 0.75 to 0.81. Finally, QSAR models are used in the selection/identification of the 20 new dicoumarins subset to search for tyrosinase inhibitory activity. Theoretical and experimental results show good correspondence between one another. It is important to remark that most compounds in this series exhibit a more potent inhibitory activity against the mushroom tyrosinase enzyme than the reference compound, Kojic acid (IC(50) = 16.67 muM), resulting in a novel nucleus base (lead) with antityrosinase activity, and this could serve as a starting point for the drug discovery of novel tyrosinase inhibitor lead compounds. ( Journal of Biomolecular Screening 2008:1014-1024).
Assuntos
Descoberta de Drogas/métodos , Peptídeos/análise , Relação Quantitativa Estrutura-Atividade , Análise por Conglomerados , Biologia Computacional , Dicumarol/química , Análise Discriminante , Ligantes , Reprodutibilidade dos Testes , Bibliotecas de Moléculas Pequenas/análiseRESUMO
QSAR (quantitative structure-activity relationship) studies of tyrosinase inhibitors employing Dragon descriptors and linear discriminant analysis (LDA) are presented here. A data set of 653 compounds, 245 with tyrosinase inhibitory activity and 408 having other clinical uses were used. The active data set was processed by k-means cluster analysis in order to design training and prediction series. Seven LDA-based QSAR models were obtained. The discriminant functions applied showed a globally good classification of 99.79% for the best model Class=-96.067+1.988 x 10(2)X0Av +9 1.907 BIC3 + 6.853 CIC1 in the training set. External validation processes to assess the robustness and predictive power of the obtained model were carried out. This external prediction set had an accuracy of 99.44%. After that, the developed models were used in ligand-based virtual screening of tyrosinase inhibitors from the literature and never considered in either training or predicting series. In this case, all screened chemicals were correctly classified by the LDA-based QSAR models. As a final point, these fitted models were used in the screening of new bipiperidine series as new tyrosinase inhibitors. These methods are an adequate alternative to the process of selection/identification of new bioactive compounds. The biosilico assays and in vitro results of inhibitory activity on mushroom tyrosinase showed good correspondence. It is important to stand out that compound BP4 (IC(50)=1.72 microM) showed higher activity in the inhibition against the enzyme than reference compound kojic acid (IC(50)=16.67 microM) and l-mimosine (IC(50)=3.68 microM). These results support the role of biosilico algorithm for the identification of new tyrosinase inhibitor compounds.
Assuntos
Biologia Computacional , Simulação por Computador , Peptídeos/análise , Peptídeos/farmacologia , Software , Bases de Dados Factuais , Análise Discriminante , Desenho de Fármacos , Ligantes , Peptídeos/química , Peptídeos/classificação , Piperidinas/química , Piperidinas/farmacologia , Relação Quantitativa Estrutura-Atividade , Reprodutibilidade dos TestesRESUMO
A new set of bond-level molecular descriptors (bond-based linear indices) are used here in QSAR (quantitative structure-activity relationship) studies of tyrosinase inhibitors, for finding functions that discriminate between the tyrosinase inhibitor compounds and inactive ones. A database of 246 compounds was collected for this study; all organic chemicals were reported as tyrosinase inhibitors; they had great structural diversity. This dataset can be considered as a helpful tool, not only for theoretical chemists but also for other researchers in this area. The set used as inactive has 412 drugs with other clinical uses. Twelve LDA-based QSAR models were obtained, the first six using the non-stochastic total and local bond-based linear indices as well as the last six ones, the stochastic molecular descriptors. The best two discriminant models computed using the non-stochastic and stochastic molecular descriptors (Eqs. , respectively) had globally good classifications of 98.95% and 89.75% in the training set, with high Matthews correlation coefficients (C) of 0.98 and 0.78. The external prediction sets had accuracies of 98.89% and 89.44%, and (C) values of 0.98 and 0.78, for models 7 and 13, respectively. A virtual screening of compounds reported in the literature with such activity was carried out, to prove the ability of present models to search for tyrosinase inhibitors, not included in the training or test set. At the end, the fitted discriminant functions were used in the selection/identification of new ethylsteroids isolated from herbal plants, looking for tyrosinase inhibitory activity. A good behavior is shown between the theoretical and experimental results on mushroom tyrosinase enzyme. It might be highlighted that all the compounds showed values under 10microM and that ES2 (IC(50)=1.25microM) showed higher activity in the inhibition against the enzyme than reference compounds kojic acid (IC(50)=16.67microM) and l-mimosine (IC(50)=3.68microM). In addition, a comparison with other established methods was carried to prove the adequate discriminatory performance of the molecular descriptors used here. The present algorithm provided useful clues that can be used to speed up in the identification of new tyrosinase inhibitor compounds.
Assuntos
Simulação por Computador , Monofenol Mono-Oxigenase/antagonistas & inibidores , Peptídeos/farmacologia , Relação Quantitativa Estrutura-Atividade , Agaricales/enzimologia , Algoritmos , Análise Discriminante , Modelos Biológicos , Modelos Químicos , Modelos Moleculares , Estrutura Molecular , Peptídeos/química , Peptídeos/classificaçãoRESUMO
In the present report, the use of the atom-based linear indices for finding functions that discriminate between the tyrosinase inhibitor compounds and inactive ones is presented. In this sense, discriminant models were applied and globally good classifications of 93.51% and 92.46% were observed for non-stochastic and stochastic linear indices best models, respectively, in the training set. The external prediction sets had accuracies of 91.67% and 89.44%. In addition, these fitted models were used in the screening of new cycloartane compounds isolated from herbal plants. A good behavior is shown between the theoretical and experimental results. These results provide a tool that can be used in the identification of new tyrosinase inhibitor compounds.