Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 97
Filter
2.
J Comput Aided Mol Des ; 38(1): 22, 2024 May 16.
Article in English | MEDLINE | ID: mdl-38753096

ABSTRACT

Although the size of virtual libraries of synthesizable compounds is growing rapidly, we are still enumerating only tiny fractions of the drug-like chemical universe. Our capability to mine these newly generated libraries also lags their growth. That is why fragment-based approaches that utilize on-demand virtual combinatorial libraries are gaining popularity in drug discovery. These à la carte libraries utilize synthetic blocks found to be effective binders in parts of target protein pockets and a variety of reliable chemistries to connect them. There is, however, no data on the potential impact of the chemistries used for making on-demand libraries on the hit rates during virtual screening. There are also no rules to guide in the selection of these synthetic methods for production of custom libraries. We have used the SAVI (Synthetically Accessible Virtual Inventory) library, constructed using 53 reliable reaction types (transforms), to evaluate the impact of these chemistries on docking hit rates for 40 well-characterized protein pockets. The data shows that the virtual hit rates differ significantly for different chemistries with cross coupling reactions such as Sonogashira, Suzuki-Miyaura, Hiyama and Liebeskind-Srogl coupling producing the highest hit rates. Virtual hit rates appear to depend not only on the property of the formed chemical bond but also on the diversity of available building blocks and the scope of the reaction. The data identifies reactions that deserve wider use through increasing the number of corresponding building blocks and suggests the reactions that are more effective for pockets with certain physical and hydrogen bond-forming properties.


Subject(s)
Molecular Docking Simulation , Protein Binding , Proteins , Small Molecule Libraries , Small Molecule Libraries/chemistry , Small Molecule Libraries/pharmacology , Proteins/chemistry , Proteins/metabolism , Binding Sites , Drug Discovery/methods , Ligands , Drug Design , Humans
3.
Front Oncol ; 13: 1144153, 2023.
Article in English | MEDLINE | ID: mdl-37182134

ABSTRACT

STAT3 N-terminal domain is a promising molecular target for cancer treatment and modulation of immune responses. However, STAT3 is localized in the cytoplasm, mitochondria, and nuclei, and thus, is inaccessible to therapeutic antibodies. Its N-terminal domain lacks deep pockets on the surface and represents a typical "non-druggable" protein. In order to successfully identify potent and selective inhibitors of the domain, we have used virtual screening of billion structure-sized virtual libraries of make-on-demand screening samples. The results suggest that the expansion of accessible chemical space by cutting-edge ultra-large virtual compound databases can lead to successful development of small molecule drugs for hard-to-target intracellular proteins.

4.
J Chem Inf Model ; 62(9): 2009-2010, 2022 05 09.
Article in English | MEDLINE | ID: mdl-35527682

Subject(s)
Informatics
5.
J Chem Inf Model ; 62(9): 2021-2034, 2022 05 09.
Article in English | MEDLINE | ID: mdl-35421301

ABSTRACT

Designing new medicines more cheaply and quickly is tightly linked to the quest of exploring chemical space more widely and efficiently. Chemical space is monumentally large, but recent advances in computer software and hardware have enabled researchers to navigate virtual chemical spaces containing billions of chemical structures. This review specifically concerns collections of many millions or even billions of enumerated chemical structures as well as even larger chemical spaces that are not fully enumerated. We present examples of chemical libraries and spaces and the means used to construct them, and we discuss new technologies for searching huge libraries and for searching combinatorially in chemical space. We also cover space navigation techniques and consider new approaches to de novo drug design and the impact of the "autonomous laboratory" on synthesis of designed compounds. Finally, we summarize some other challenges and opportunities for the future.


Subject(s)
Drug Discovery , Small Molecule Libraries , Drug Design , Drug Discovery/methods , Small Molecule Libraries/chemistry , Small Molecule Libraries/pharmacology
6.
J Am Chem Soc ; 144(11): 4925-4941, 2022 03 23.
Article in English | MEDLINE | ID: mdl-35282679

ABSTRACT

Germline antibodies, the initial set of antibodies produced by the immune system, are critical for host defense, and information about their binding properties can be useful for designing vaccines, understanding the origins of autoantibodies, and developing monoclonal antibodies. Numerous studies have found that germline antibodies are polyreactive with malleable, flexible binding pockets. While insightful, it remains unclear how broadly this model applies, as there are many families of antibodies that have not yet been studied. In addition, the methods used to obtain germline antibodies typically rely on assumptions and do not work well for many antibodies. Herein, we present a distinct approach for isolating germline antibodies that involves immunizing activation-induced cytidine deaminase (AID) knockout mice. This strategy amplifies antigen-specific B cells, but somatic hypermutation does not occur because AID is absent. Using synthetic haptens, glycoproteins, and whole cells, we obtained germline antibodies to an assortment of clinically important tumor-associated carbohydrate antigens, including Lewis Y, the Tn antigen, sialyl Lewis C, and Lewis X (CD15/SSEA-1). Through glycan microarray profiling and cell binding, we demonstrate that all but one of these germline antibodies had high selectivity for their glycan targets. Using molecular dynamics simulations, we provide insights into the structural basis of glycan recognition. The results have important implications for designing carbohydrate-based vaccines, developing anti-glycan monoclonal antibodies, and understanding antibody evolution within the immune system.


Subject(s)
Antibodies, Monoclonal , Antigens, Tumor-Associated, Carbohydrate , Animals , Antibodies, Monoclonal/chemistry , Biomarkers, Tumor , Carbohydrates , Germ Cells , Mice , Mice, Knockout , Polysaccharides/chemistry
7.
J Chem Inf Model ; 61(2): 653-663, 2021 02 22.
Article in English | MEDLINE | ID: mdl-33533614

ABSTRACT

Computational methods to predict molecular properties regarding safety and toxicology represent alternative approaches to expedite drug development, screen environmental chemicals, and thus significantly reduce associated time and costs. There is a strong need and interest in the development of computational methods that yield reliable predictions of toxicity, and many approaches, including the recently introduced deep neural networks, have been leveraged towards this goal. Herein, we report on the collection, curation, and integration of data from the public data sets that were the source of the ChemIDplus database for systemic acute toxicity. These efforts generated the largest publicly available such data set comprising > 80,000 compounds measured against a total of 59 acute systemic toxicity end points. This data was used for developing multiple single- and multitask models utilizing random forest, deep neural networks, convolutional, and graph convolutional neural network approaches. For the first time, we also reported the consensus models based on different multitask approaches. To the best of our knowledge, prediction models for 36 of the 59 end points have never been published before. Furthermore, our results demonstrated a significantly better performance of the consensus model obtained from three multitask learning approaches that particularly predicted the 29 smaller tasks (less than 300 compounds) better than other models developed in the study. The curated data set and the developed models have been made publicly available at https://github.com/ncats/ld50-multitask, https://predictor.ncats.io/, and https://cactus.nci.nih.gov/download/acute-toxicity-db (data set only) to support regulatory and research applications.


Subject(s)
Deep Learning , Consensus , Databases, Factual , Neural Networks, Computer
8.
J Cheminform ; 12(1): 72, 2020 Dec 03.
Article in English | MEDLINE | ID: mdl-33292568

ABSTRACT

In the past two decades a lot of different formats for molecules and reactions have been created. These formats were mostly developed for the purposes of identifiers, representation, classification, analysis and data exchange. A lot of efforts have been made on molecule formats but only few for reactions where the endeavors have been made mostly by companies leading to proprietary formats. Here, we present ReactionCode: a new open-source format that allows one to encode and decode a reaction into multi-layer machine readable code, which aggregates reactants and products into a condensed graph of reaction (CGR). This format is flexible and can be used in a context of reaction similarity searching and classification. It is also designed for database organization, machine learning applications and as a new transform reaction language.

9.
Molecules ; 25(23)2020 Dec 02.
Article in English | MEDLINE | ID: mdl-33276504

ABSTRACT

Due to its antiangiogenic and anti-immunomodulatory activity, thalidomide continues to be of clinical interest despite its teratogenic actions, and efforts to synthesize safer, clinically active thalidomide analogs are continually underway. In this study, a cohort of 27 chemically diverse thalidomide analogs was evaluated for antiangiogenic activity in an ex vivo rat aorta ring assay. The protein cereblon has been identified as the target for thalidomide, and in silico pharmacophore analysis and molecular docking with a crystal structure of human cereblon were used to investigate the cereblon binding abilities of the thalidomide analogs. The results suggest that not all antiangiogenic thalidomide analogs can bind cereblon, and multiple targets and mechanisms of action may be involved.


Subject(s)
Adaptor Proteins, Signal Transducing/metabolism , Angiogenesis Inhibitors/pharmacology , Aorta/drug effects , Molecular Docking Simulation , Neovascularization, Physiologic/drug effects , Thalidomide/analogs & derivatives , Thalidomide/pharmacology , Ubiquitin-Protein Ligases/metabolism , Angiogenesis Inhibitors/chemistry , Animals , Computer Simulation , Humans , Male , Rats , Rats, Sprague-Dawley
10.
Sci Data ; 7(1): 384, 2020 11 11.
Article in English | MEDLINE | ID: mdl-33177514

ABSTRACT

We have made available a database of over 1 billion compounds predicted to be easily synthesizable, called Synthetically Accessible Virtual Inventory (SAVI). They have been created by a set of transforms based on an adaptation and extension of the CHMTRN/PATRAN programming languages describing chemical synthesis expert knowledge, which originally stem from the LHASA project. The chemoinformatics toolkit CACTVS was used to apply a total of 53 transforms to about 150,000 readily available building blocks (enamine.net). Only single-step, two-reactant syntheses were calculated for this database even though the technology can execute multi-step reactions. The possibility to incorporate scoring systems in CHMTRN allowed us to subdivide the database of 1.75 billion compounds in sets according to their predicted synthesizability, with the most-synthesizable class comprising 1.09 billion synthetic products. Properties calculated for all SAVI products show that the database should be well-suited for drug discovery. It is being made publicly available for free download from https://doi.org/10.35115/37n9-5738.

11.
J Chem Inf Model ; 60(7): 3336-3341, 2020 07 27.
Article in English | MEDLINE | ID: mdl-32539385

ABSTRACT

We have adopted and extended the CHMTRN language and used it for the knowledge base of a computer program to generate a large database of synthetically accessible, drug-like chemical structures, the Synthetically Accessible Virtual Inventory (SAVI) Database. CHMTRN is a powerful language originally developed in the LHASA (Logic and Heuristics Applied to Synthetic Analysis) project at Harvard University and used together with the chemical pattern description language, PATRAN, to describe chemical retro-reactions. The languages have proven to be useful beyond the design of retrosynthetic routes and have the potential for much wider use in chemistry; this paper describes CHMTRN and PATRAN as now reimplemented for the forward-synthetic SAVI project but able to describe both forward and retro-reactions.


Subject(s)
Combinatorial Chemistry Techniques , Software , Databases, Factual , Humans
12.
J Chem Inf Model ; 60(3): 1253-1275, 2020 03 23.
Article in English | MEDLINE | ID: mdl-32043883

ABSTRACT

We have collected 86 different transforms of tautomeric interconversions. Out of those, 54 are for prototropic (non-ring-chain) tautomerism, 21 for ring-chain tautomerism, and 11 for valence tautomerism. The majority of these rules have been extracted from experimental literature. Twenty rules, covering the most well-known types of tautomerism such as keto-enol tautomerism, were taken from the default handling of tautomerism by the chemoinformatics toolkit CACTVS. The rules were analyzed against nine differerent databases totaling over 400 million (non-unique) structures as to their occurrence rates, mutual overlap in coverage, and recapitulation of the rules' enumerated tautomer sets by InChI V.1.05, both in InChI's Standard and a Nonstandard version with the increased tautomer-handling options 15T and KET turned on. These results and the background of this study are discussed in the context of the IUPAC InChI Project tasked with the redesign of handling of tautomerism for an InChI version 2. Applying the rules presented in this paper would approximately triple the number of compounds in typical small-molecule databases that would be affected by tautomeric interconversion by InChI V2. A web tool has been created to test these rules at https://cactus.nci.nih.gov/tautomerizer.


Subject(s)
Cheminformatics , Databases, Factual
13.
J Chem Inf Model ; 60(3): 1090-1100, 2020 03 23.
Article in English | MEDLINE | ID: mdl-32027495

ABSTRACT

We report a database of tautomeric structures that contains 2819 tautomeric tuples extracted from 171 publications. Each tautomeric entry has been annotated with experimental conditions reported in the respective publication, plus bibliographic details, structural identifiers (e.g., NCI/CADD identifiers FICTS, FICuS, uuuuu, and Standard InChI), and chemical information (e.g., SMILES, molecular weight). The majority of tautomeric tuples found were pairs; the remaining 10% were triples, quadruples, or quintuples, amounting to a total number of structures of 5977. The types of tautomerism were mainly prototropic tautomerism (79%), followed by ring-chain (13%) and valence tautomerism (8%). The experimental conditions reported in the publications included about 50 pure solvents and 9 solvent mixtures with 26 unique spectroscopic or nonspectroscopic methods. 1H and 13C NMR were the most frequently used methods. A total of 77 different tautomeric transform rules (SMIRKS) are covered by at least one example tuple in the database. This database is freely available as a spreadsheet at https://cactus.nci.nih.gov/download/tautomer/.


Subject(s)
Isomerism , Databases, Factual , Magnetic Resonance Spectroscopy
14.
Bioinformatics ; 36(3): 978-979, 2020 02 01.
Article in English | MEDLINE | ID: mdl-31418763

ABSTRACT

MOTIVATION: Identification of new molecules promising for treatment of HIV-infection and HIV-associated disorders remains an important task in order to provide safer and more effective therapies. Utilization of prior knowledge by application of computer-aided drug discovery approaches reduces time and financial expenses and increases the chances of positive results in anti-HIV R&D. To provide the scientific community with a tool that allows estimating of potential agents for treatment of HIV-infection and its comorbidities, we have created a freely-available web-resource for prediction of relevant biological activities based on the structural formulae of drug-like molecules. RESULTS: Over 50 000 experimental records for anti-retroviral agents from ChEMBL database were extracted for creating the training sets. After careful examination, about seven thousand molecules inhibiting five HIV-1 proteins were used to develop regression and classification models with the GUSAR software. The average values of R2 = 0.95 and Q2 = 0.72 in validation procedure demonstrated the reasonable accuracy and predictivity of the obtained (Q)SAR models. Prediction of 81 biological activities associated with the treatment of HIV-associated comorbidities with 92% mean accuracy was realized using the PASS program. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://www.way2drug.com/hiv/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
HIV Infections , HIV , Prednisolone , Software , Viral Proteins , Computer Simulation , HIV/genetics , HIV Infections/drug therapy , Prednisolone/analogs & derivatives , Proteins , Structure-Activity Relationship
15.
Molecules ; 25(1)2019 Dec 25.
Article in English | MEDLINE | ID: mdl-31881687

ABSTRACT

Despite the achievements of antiretroviral therapy, discovery of new anti-HIV medicines remains an essential task because the existing drugs do not provide a complete cure for the infected patients, exhibit severe adverse effects, and lead to the appearance of resistant strains. To predict the interaction of drug-like compounds with multiple targets for HIV treatment, ligand-based drug design approach is widely applied. In this study, we evaluated the possibilities and limitations of (Q)SAR analysis aimed at the discovery of novel antiretroviral agents inhibiting the vital HIV enzymes. Local (Q)SAR models are based on the analysis of structure-activity relationships for molecules from the same chemical class, which significantly restrict their applicability domain. In contrast, global (Q)SAR models exploit data from heterogeneous sets of drug-like compounds, which allows their application to databases containing diverse structures. We compared the information for HIV-1 integrase, protease and reverse transcriptase inhibitors available in the EBI ChEMBL, NIAID HIV/OI/TB Therapeutics, and Clarivate Analytics Integrity databases as the sources for (Q)SAR training sets. Using the PASS and GUSAR software, we developed and validated a variety of (Q)SAR models, which can be further used for virtual screening of new antiretrovirals in the SAVI library. The developed models are implemented in the freely available web resource AntiHIV-Pred.


Subject(s)
Anti-HIV Agents/pharmacology , HIV-1/metabolism , Quantitative Structure-Activity Relationship , Viral Proteins/antagonists & inhibitors , Anti-HIV Agents/chemistry , Databases as Topic , HIV-1/drug effects , Humans , Inhibitory Concentration 50 , Regression Analysis , Reproducibility of Results , Viral Proteins/metabolism
16.
Cell Death Dis ; 10(10): 689, 2019 09 18.
Article in English | MEDLINE | ID: mdl-31534138

ABSTRACT

The C-terminal binding protein (CtBP) is an NADH-dependent dimeric family of nuclear proteins that scaffold interactions between transcriptional regulators and chromatin-modifying complexes. Its association with poor survival in several cancers implicates CtBP as a promising target for pharmacological intervention. We employed computer-assisted drug design to search for CtBP inhibitors, using quantitative structure-activity relationship (QSAR) modeling and docking. Functional screening of these drugs identified 4 compounds with low toxicity and high water solubility. Micro molar concentrations of these CtBP inhibitors produces significant de-repression of epigenetically silenced pro-epithelial genes, preferentially in the triple-negative breast cancer cell line MDA-MB-231. This epigenetic reprogramming occurs through eviction of CtBP from gene promoters; disrupted recruitment of chromatin-modifying protein complexes containing LSD1, and HDAC1; and re-wiring of activating histone marks at targeted genes. In functional assays, CtBP inhibition disrupts CtBP dimerization, decreases cell migration, abolishes cellular invasion, and improves DNA repair. Combinatorial use of CtBP inhibitors with the LSD1 inhibitor pargyline has synergistic influence. Finally, integrated correlation of gene expression in breast cancer patients with nuclear levels of CtBP1 and LSD1, reveals new potential therapeutic vulnerabilities. These findings implicate a broad role for this class of compounds in strategies for epigenetically targeted therapeutic intervention.


Subject(s)
Alcohol Oxidoreductases/genetics , Breast Neoplasms/genetics , DNA-Binding Proteins/genetics , Epigenesis, Genetic/genetics , Female , Humans
17.
J Chem Inf Model ; 59(9): 3635-3644, 2019 09 23.
Article in English | MEDLINE | ID: mdl-31453694

ABSTRACT

A lot of high quality data on the biological activity of chemical compounds are required throughout the whole drug discovery process: from development of computational models of the structure-activity relationship to experimental testing of lead compounds and their validation in clinics. Currently, a large amount of such data is available from databases, scientific publications, and patents. Biological data are characterized by incompleteness, uncertainty, and low reproducibility. Despite the existence of free and commercially available databases of biological activities of compounds, they usually lack unambiguous information about peculiarities of biological assays. On the other hand, scientific papers are the primary source of new data disclosed to the scientific community for the first time. In this study, we have developed and validated a data-mining approach for extraction of text fragments containing description of bioassays. We have used this approach to evaluate compounds and their biological activity reported in scientific publications. We have found that categorization of papers into relevant and irrelevant may be performed based on the machine-learning analysis of the abstracts. Text fragments extracted from the full texts of publications allow their further partitioning into several classes according to the peculiarities of bioassays. We demonstrate the applicability of our approach to the comparison of the endpoint values of biological activity and cytotoxicity of reference compounds.


Subject(s)
Data Mining/methods , Drug Discovery/methods , Databases, Factual , HIV Infections/drug therapy , HIV Reverse Transcriptase/antagonists & inhibitors , HIV-1/drug effects , HIV-1/enzymology , Humans , PubMed , Reverse Transcriptase Inhibitors/pharmacology
18.
Curr Opin Struct Biol ; 58: 53-58, 2019 10.
Article in English | MEDLINE | ID: mdl-31233975

ABSTRACT

Subatomic resolution macromolecular crystallography has been revealing the most fascinating details of macromolecular structures for many years. This most extreme form of macromolecular crystallography is going through rapid changes. A new generation of superbrilliant X-ray sources and detectors is facilitating the rapid acquisition of high-quality datasets. Equally important, a new breed of methods and highly integrated advanced computational tools for structure refinement and analysis is poised to change the way we use subatomic resolution data and reposition high-resolution macromolecular crystallography in medicinal chemistry studies. Subatomic resolution macromolecular crystallography may soon be a routine source of detailed molecular information besides precise geometries, including binding energies and other chemical descriptors, opening new possibilities of application.


Subject(s)
Crystallography, X-Ray/methods , Macromolecular Substances/chemistry , Solvents/chemistry
19.
Eur J Pharm Sci ; 111: 337-348, 2018 Jan 01.
Article in English | MEDLINE | ID: mdl-29037996

ABSTRACT

Novel piperidinyl-based sulfamide derivatives were designed and synthesized through various synthetic routes. Anticancer activities of these sulfamides were evaluated by phenotypic screening on National Cancer Institute's 60 human tumor cell lines (NCI-60). Preliminary screening at 10µM concentration showed that piperidinyl sulfamide aminoester 26 (NSC 749204) was sensitive to most of the cell lines in the panel. Further dose-response studies showed that 26 was highly selective for inhibition of colon cancer cell lines with minimum GI50=1.88µM for COLO-205 and maximum GI50=11.1µM for SW-620 cells. These newly synthesized sulfamides were also screening for their Tdp1 inhibition activity. Compound 18 (NSC 750706) showed significant inhibition of Tdp1 with IC50=23.7µM. Molecular-docking studies showed that 18 bind to Tdp1 in its binding pocket similar to a known Tdp1 inhibitor.


Subject(s)
Antineoplastic Agents/chemical synthesis , Antineoplastic Agents/pharmacology , Phosphodiesterase Inhibitors/chemical synthesis , Phosphodiesterase Inhibitors/pharmacology , Phosphoric Diester Hydrolases/metabolism , Antineoplastic Agents/chemistry , Binding Sites , Cell Line, Tumor , Humans , Models, Molecular , Molecular Structure , Phosphodiesterase Inhibitors/chemistry , Phosphoric Diester Hydrolases/chemistry , Protein Conformation , Structure-Activity Relationship
20.
Medchemcomm ; 9(12): 2000-2007, 2018 Dec 01.
Article in English | MEDLINE | ID: mdl-30647878

ABSTRACT

Non-B DNA structures represent intriguing and challenging targets for small molecules. For example, the promoter of the HRAS oncogene contains multiple G-quadruplex and i-motif structures, atypical globular folds that serve as molecular switches for gene expression. Of the two, i-motif structures are far less studied. Here, we report the first example of small organic compounds that directly interact with the hras-1Y i-motif. We use a small molecule microarray screen to identify drug-like small molecules that bind to the hras-1Y i-motif but not to several other DNA or RNA secondary structures. Two different lead compounds, 1 and 2, were discovered to have 7.4 ± 5.3 µM and 5.9 ± 3.7 µM binding affinity by surface plasmon resonance and similar affinity by fluorescence titration. A structure-activity relationship (SAR) was developed and two improved analogues of 2 demonstrated submicromolar binding affinities. Both compounds display pH-dependent binding, indicating that they interact with the DNA only when the i-motif is properly folded. Chemical shift perturbation shows that 1 alters the structure of the i-motif, while 2 has no effect on the i-motif conformation, indicating different modes of interaction.

SELECTION OF CITATIONS
SEARCH DETAIL
...