Pesquisa | Portal Regional da BVS (teste)

1.

MultiCASE Platform for In Silico Toxicology.

Chakravarti, Suman K; Saiakhov, Roustem D.

Methods Mol Biol ; 2425: 497-518, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35188644

RESUMO

Predictive and computational toxicology, a highly scientific and research-based field, is rapidly progressing with wider acceptance by regulatory agencies around the world. Almost every aspect of the field has seen fundamental changes during the last decade due to the availability of more data, usage, and acceptance of a variety of predictive tools and an increase in the overall awareness. Also, the influence from the recent explosive developments in the field of artificial intelligence has been significant. However, the need for sophisticated, easy to use and well-maintained software platforms for in silico toxicological assessments remains very high. The MultiCASE suite of software is one such platform that consists of an integrated collection of software programs, tools, and databases. While providing easy-to-use and highly useful tools that are relevant at present, it has always remained at the forefront of research and development by inventing new technologies and discovering new insights in the area of QSAR, artificial intelligence, and machine learning. This chapter gives the background, an overview of the software and databases involved, and a brief description of the usage methodology with the aid of examples.

Assuntos

Relação Quantitativa Estrutura-Atividade , Toxicologia , Inteligência Artificial , Simulação por Computador , Bases de Dados Factuais , Software , Toxicologia/métodos

2.

Reason Vectors: Abstract Representation of Chemistry-Biology Interaction Outcomes, for Reasoning and Prediction.

Chakravarti, Suman K.

J Chem Inf Model ; 60(10): 4614-4628, 2020 10 26.

Artigo em Inglês | MEDLINE | ID: mdl-32960051

RESUMO

Many traditional quantitative structure-activity relationship (QSAR) models are based on correlation with high-dimensional, highly variable molecular features in their raw form, limiting their generalizing capabilities despite the use of large training sets. They also lack elements of causality and reasoning. With these issues in mind, we developed a method for learning higher-level abstract representations of the effects of the interactions between molecular features and biology. We named the representations as the reason vectors. They are composed of a series of computed activity of substructures obtained from stepwise reconstruction of the molecule. This representation is very different from fingerprints, which are composed of molecular features directly. These vectors capture reasons of bioactivity of chemicals (or absence thereof) in an abstract form, uncover causality in interactions between chemical features, and generalize beyond specific chemical classes or bioactivity. Reason vectors contain only a few key attributes and are much smaller than molecular fingerprints. They allow vague and conceptual similarity searches, less susceptible to failure on novel combinations of query molecule features and more likely to identify reasons of activity in chemical classes that are absent in training data. Reason vectors can be compared with each other and their activity can be computed by matching with vectors from molecules with known bioactivity. A single molecule produces as many reason vectors as heavy atoms in it, and a simple count of these vectors in a series of activity ranges is all what is needed to predict its bioactivity. Thus, the prediction method is devoid of gradient optimization or statistical fitting.

Assuntos

Biologia , Relação Quantitativa Estrutura-Atividade

3.

Descriptor Free QSAR Modeling Using Deep Learning With Long Short-Term Memory Neural Networks.

Chakravarti, Suman K; Alla, Sai Radha Mani.

Front Artif Intell ; 2: 17, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-33733106

RESUMO

Current practice of building QSAR models usually involves computing a set of descriptors for the training set compounds, applying a descriptor selection algorithm and finally using a statistical fitting method to build the model. In this study, we explored the prospects of building good quality interpretable QSARs for big and diverse datasets, without using any pre-calculated descriptors. We have used different forms of Long Short-Term Memory (LSTM) neural networks to achieve this, trained directly using either traditional SMILES codes or a new linear molecular notation developed as part of this work. Three endpoints were modeled: Ames mutagenicity, inhibition of P. falciparum Dd2 and inhibition of Hepatitis C Virus, with training sets ranging from 7,866 to 31,919 compounds. To boost the interpretability of the prediction results, attention-based machine learning mechanism, jointly with a bidirectional LSTM was used to detect structural alerts for the mutagenicity data set. Traditional fragment descriptor-based models were used for comparison. As per the results of the external and cross-validation experiments, overall prediction accuracies of the LSTM models were close to the fragment-based models. However, LSTM models were superior in predicting test chemicals that are dissimilar to the training set compounds, a coveted quality of QSAR models in real world applications. In summary, it is possible to build QSAR models using LSTMs without using pre-computed traditional descriptors, and models are far from being "black box." We wish that this study will be helpful in bringing large, descriptor-less QSARs to mainstream use.

4.

Computing similarity between structural environments of mutagenicity alerts.

Chakravarti, Suman K; Saiakhov, Roustem D.

Mutagenesis ; 34(1): 55-65, 2019 03 06.

Artigo em Inglês | MEDLINE | ID: mdl-30346583

RESUMO

This article describes a method to generate molecular fingerprints from structural environments of mutagenicity alerts and calculate similarity between them. This approach was used to improve classification accuracy of alerts and for searching structurally similar analogues of an alerting chemical. It builds fingerprints using molecular fragments from the vicinity of the alerts and automatically accounts for the activating and deactivating/mitigating features of alerts needed for accurate predictions. This study also demonstrates the usefulness of transfer learning in which a distributed representation of chemical fragments was first trained on millions of unlabelled chemicals and then used for generating fingerprints and similarity search on smaller data sets labelled with Ames test outcomes. The distributed fingerprints gave better prediction performance and increased coverage compared to traditional binary fingerprints. The methodology was applied to four common mutagenic functionalities-primary aromatic amine, aromatic nitro, epoxide and alkyl chloride. Effects of various hyperparameters on prediction accuracy and test coverage for the k-nearest neighbours prediction method are also described, e.g. similarity thresholds, number of neighbours and size of the alert environment.

Assuntos

Aminas/química , Compostos de Epóxi/química , Mutagênicos/química , Nitrocompostos/química , Aminas/toxicidade , Compostos de Epóxi/toxicidade , Mutagênese/efeitos dos fármacos , Testes de Mutagenicidade/métodos , Mutagênicos/toxicidade , Nitrocompostos/toxicidade

5.

Distributed Representation of Chemical Fragments.

Chakravarti, Suman K.

ACS Omega ; 3(3): 2825-2836, 2018 Mar 31.

Artigo em Inglês | MEDLINE | ID: mdl-30023852

RESUMO

This article describes an unsupervised machine learning method for computing distributed vector representation of molecular fragments. These vectors encode fragment features in a continuous high-dimensional space and enable similarity computation between individual fragments, even for small fragments with only two heavy atoms. The method is based on a word embedding algorithm borrowed from natural language processing field, and approximately 6 million unlabeled PubChem chemicals were used for training. The resulting dense fragment vectors are in contrast to the traditional sparse "one-hot" fragment representation and capture rich relational structure in the fragment space. The vectors of small linear fragments were averaged to yield distributed vectors of bigger fragments and molecules, which were used for different tasks, e.g., clustering, ligand recall, and quantitative structure-activity relationship modeling. The distributed vectors were found to be better at clustering ring systems and recall of kinase ligands as compared to standard binary fingerprints. This work demonstrates unsupervised learning of fragment chemistry from large sets of unlabeled chemical structures and subsequent application to supervised training on relatively small data sets of labeled chemicals.

6.

Critical evaluation of human oral bioavailability for pharmaceutical drugs by using various cheminformatics approaches.

Kim, Marlene T; Sedykh, Alexander; Chakravarti, Suman K; Saiakhov, Roustem D; Zhu, Hao.

Pharm Res ; 31(4): 1002-14, 2014 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-24306326

RESUMO

PURPOSE: Oral bioavailability (%F) is a key factor that determines the fate of a new drug in clinical trials. Traditionally, %F is measured using costly and time-consuming experimental tests. Developing computational models to evaluate the %F of new drugs before they are synthesized would be beneficial in the drug discovery process. METHODS: We employed Combinatorial Quantitative Structure-Activity Relationship approach to develop several computational %F models. We compiled a %F dataset of 995 drugs from public sources. After generating chemical descriptors for each compound, we used random forest, support vector machine, k nearest neighbor, and CASE Ultra to develop the relevant QSAR models. The resulting models were validated using five-fold cross-validation. RESULTS: The external predictivity of %F values was poor (R(2) = 0.28, n = 995, MAE = 24), but was improved (R(2) = 0.40, n = 362, MAE = 21) by filtering unreliable predictions that had a high probability of interacting with MDR1 and MRP2 transporters. Furthermore, classifying the compounds according to the %F values (%F < 50% as "low", %F ≥ 50% as 'high") and developing category QSAR models resulted in an external accuracy of 76%. CONCLUSIONS: In this study, we developed predictive %F QSAR models that could be used to evaluate new drug compounds, and integrating drug-transporter interactions data greatly benefits the resulting models.

Assuntos

Química Farmacêutica/normas , Preparações Farmacêuticas/administração & dosagem , Preparações Farmacêuticas/química , Relação Quantitativa Estrutura-Atividade , Administração Oral , Disponibilidade Biológica , Química Farmacêutica/métodos , Bases de Dados Factuais , Humanos

7.

Optimizing predictive performance of CASE Ultra expert system models using the applicability domains of individual toxicity alerts.

Chakravarti, Suman K; Saiakhov, Roustem D; Klopman, Gilles.

J Chem Inf Model ; 52(10): 2609-18, 2012 Oct 22.

Artigo em Inglês | MEDLINE | ID: mdl-22947043

RESUMO

Fragment based expert system models of toxicological end points are primarily comprised of a set of substructures that are statistically related to the toxic property in question. These special substructures are often referred to as toxicity alerts, toxicophores, or biophores. They are the main building blocks/classifying units of the model, and it is important to define the chemical structural space within which the alerts are expected to produce reliable predictions. Furthermore, defining an appropriate applicability domain is required as part of the OECD guidelines for the validation of quantitative structure-activity relationships (QSARs). In this respect, this paper describes a method to construct applicability domains for individual toxicity alerts that are part of the CASE Ultra expert system models. Defining applicability domain for individual alerts was necessary because each CASE Ultra model is comprised of multiple alerts, and different alerts of a model usually represent different toxicity mechanisms and cover different structural space; the use of an applicability domain for the overall model is often not adequate. The domain for each alert was constructed using a set of fragments that were found to be statistically related to the end point in question as opposed to using overall structural similarity or physicochemical properties. Use of the applicability domains in reducing false positive predictions is demonstrated. It is now possible to obtain ROC (receiver operating characteristic) profiles of CASE Ultra models by applying domain adherence cutoffs on the alerts identified in test chemicals. This helps in optimizing the performance of a model based on their true positive-false positive prediction trade-offs and reduce drastic effects on the predictive performance caused by the active/inactive ratio of the model's training set. None of the major currently available commercial expert systems for toxicity prediction offer the possibility to explore a model's full range of sensitivity-specificity spectrum, and therefore, the methodology developed in this study can be of benefit in improving the predictive ability of the alert based expert systems.

Assuntos

Produtos Biológicos/química , Produtos Biológicos/toxicidade , Mutagênicos/química , Mutagênicos/toxicidade , Relação Quantitativa Estrutura-Atividade , Animais , Aspergillus/efeitos dos fármacos , Aspergillus/genética , Simulação por Computador , Bases de Dados de Compostos Químicos , Drosophila melanogaster/efeitos dos fármacos , Drosophila melanogaster/genética , Modelos Moleculares , Estrutura Molecular , Mutação , Neurospora crassa/efeitos dos fármacos , Neurospora crassa/genética , Curva ROC , Saccharomyces cerevisiae/efeitos dos fármacos , Saccharomyces cerevisiae/genética , Salmonella typhimurium/efeitos dos fármacos , Salmonella typhimurium/genética

8.

A structural analysis of the differential cytotoxicity of chemicals in the NCI-60 cancer cell lines.

Chakravarti, Suman K; Klopman, Gilles.

Bioorg Med Chem ; 16(7): 4052-63, 2008 Apr 01.

Artigo em Inglês | MEDLINE | ID: mdl-18243714

RESUMO

The primary functions of cancer chemotherapeutic agents are not only to inhibit the growth or kill the cancer cells, but to do so without eliciting unreasonable cytotoxic effects on the healthy cells and to withstand the ability of the cancer cells to develop resistance against it. This has unfortunately been proven so far to be a very difficult objective. In this perspective, the ability of small molecules (anti-tumor agents) to 'see' different cell types differently can be a key attribute. Thus the term 'differential cytotoxicity' is normally used to describe the drug's specificity. In the present paper, we have quantified differential cytotoxicity from a study of the chemicals tested in the National Cancer Institute's Developmental Therapeutics Program. The MULTICASE (Multiple Computer Automated Structure Evaluation) methodology was used to discover statistically significant structural fragments (biophores) related to the differential cytotoxicity of the compounds. We found that even small structural features often become important in this regard which is evident from the biophores that were found in structurally diverse chemicals. By utilizing the difference between the raw and normalized differential cytotoxicity indices, we found that the alpha,beta-unsaturated carbonyl group (O=C-C=CH(2)) is the major biophore associated with compounds with essentially parallel concentration profiles in the cell lines in question. These compounds have high non-normalized differential cytotoxicity but considerably low normalized differential cytotoxocity. The models developed were cross validated for their predictive ability.

Assuntos

Antineoplásicos/química , Antineoplásicos/toxicidade , Linhagem Celular Tumoral , Sobrevivência Celular/efeitos dos fármacos , Ensaios de Seleção de Medicamentos Antitumorais , Humanos , Estrutura Molecular , National Cancer Institute (U.S.) , Estados Unidos

9.

ESP: a method to predict toxicity and pharmacological properties of chemicals using multiple MCASE databases.

Klopman, Gilles; Chakravarti, Suman K; Zhu, Hao; Ivanov, Julian M; Saiakhov, Roustem D.

J Chem Inf Comput Sci ; 44(2): 704-15, 2004.

Artigo em Inglês | MEDLINE | ID: mdl-15032553

RESUMO

We describe here the development of a computer program which uses a new method called Expert System Prediction (ESP), to predict toxic end points and pharmacological properties of chemicals based on multiple modules created by the MCASE artificial intelligence system. The modules are generally based on different biological models measuring related end points. The purpose is to improve the decision making process regarding the overall activity or inactivity of the chemicals and also to enable rapid in silico screening. ESP evaluates the significance of the biophores from a different viewpoint and uses this information for predicting the activity of new chemicals. We have used a unique encoding system to represent relevant features of a chemical in the form of a pattern vector and a genetic artificial neural network (GA-ANN) to gain knowledge of the relationship between these patterns and the overall pharmacological property. The effectiveness of ESP is illustrated in the prediction of general carcinogenicity of a diverse set of chemicals using MCASE male/female rats and mice carcinogenicity modules.

Assuntos

Sistemas Inteligentes , Farmacologia , Testes de Toxicidade , Algoritmos , Animais , Carcinógenos/química , Carcinógenos/toxicidade , Bases de Dados Genéticas , Feminino , Hidrazinas/química , Hidrazinas/toxicidade , Masculino , Camundongos , Nitrosaminas/química , Nitrosaminas/toxicidade , Compostos Nitrosos/química , Compostos Nitrosos/toxicidade , Valor Preditivo dos Testes , Relação Quantitativa Estrutura-Atividade , Ratos , Roedores , Software

10.

Structure-activity relationship study of a diverse set of estrogen receptor ligands (I) using MultiCASE expert system.

Klopman, Gilles; Chakravarti, Suman K.

Chemosphere ; 51(6): 445-59, 2003 May.

Artigo em Inglês | MEDLINE | ID: mdl-12615096

RESUMO

The MultiCASE expert system was used to construct a quantitative structure-activity relationship model to screen chemicals with estrogen receptor (ER) binding potential. Structures and ER binding data of 313 chemicals were used as inputs to train the expert system. The training data set covers inactive, weak as well as very powerful ER binders and represents a variety of chemical compounds. Substructural features associated with ER binding activity (biophores) and features that prevent receptor binding (biophobes) were identified. Although a single phenolic hydroxyl group was found to be the most important biophore responsible for the estrogenic activity of most of the chemicals, MultiCASE also identified other biophores and structural features that modulate the activity of the chemicals. Furthermore, the findings supported our previous hypothesis that a 6 A distant descriptor may describe a ligand-binding site on an ER. Quantitative structure-activity relationship models for the chemicals associated with each biophore were constructed as part of the expert system and can be used to predict the activity of new chemicals. The model was cross validated via 10 x 10%-off tests, giving an average concordance of 84.04%.

Assuntos

Sistema Endócrino/efeitos dos fármacos , Receptores de Estrogênio/efeitos dos fármacos , Receptores de Estrogênio/metabolismo , Animais , Avaliação Pré-Clínica de Medicamentos , Humanos , Ligantes , Fenóis/farmacologia , Relação Estrutura-Atividade , Xenobióticos/farmacologia

11.

Screening of high production volume chemicals for estrogen receptor binding activity (II) by the MultiCASE expert system.

Klopman, Gilles; Chakravarti, Suman K.

Chemosphere ; 51(6): 461-8, 2003 May.

Artigo em Inglês | MEDLINE | ID: mdl-12615097

RESUMO

A structurally and functionally diverse and cross-validated quantitative structure-activity knowledge database generated by the MultiCASE expert system was used to screen 2526 high production volume chemicals (HPVCs) for their estrogen receptor binding activity. 73 HPVCs were found to contain structural features or biophores that have been documented as having the ability to bind to the estrogen receptor. Potential chemicals were ranked according to their quantitatively predicted ER binding potential and the details of the biophores found in them are discussed.

Assuntos

Inteligência Artificial , Sistema Endócrino/efeitos dos fármacos , Receptores de Estrogênio/efeitos dos fármacos , Receptores de Estrogênio/metabolismo , Xenobióticos/efeitos adversos , Animais , Sítios de Ligação , Bases de Dados Factuais , Previsões , Humanos , Relação Estrutura-Atividade

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA