Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
1.
J Mol Graph Model ; 129: 108734, 2024 06.
Article in English | MEDLINE | ID: mdl-38442440

ABSTRACT

Application of Artificial intelligence (AI) in drug discovery has led to several success stories in recent times. While traditional methods mostly relied upon screening large chemical libraries for early-stage drug-design, de novo design can help identify novel target-specific molecules by sampling from a much larger chemical space. Although this has increased the possibility of finding diverse and novel molecules from previously unexplored chemical space, this has also posed a great challenge for medicinal chemists to synthesize at least some of the de novo designed novel molecules for experimental validation. To address this challenge, in this work, we propose a novel forward synthesis-based generative AI method, which is used to explore the synthesizable chemical space. The method uses a structure-based drug design framework, where the target protein structure and a target-specific seed fragment from co-crystal structures can be the initial inputs. A random fragment from a purchasable fragment library can also be the input if a target-specific fragment is unavailable. Then a template-based forward synthesis route prediction and molecule generation is performed in parallel using the Monte Carlo Tree Search (MCTS) method where, the subsequent fragments for molecule growth can again be obtained from a purchasable fragment library. The rewards for each iteration of MCTS are computed using a drug-target affinity (DTA) model based on the docking pose of the generated reaction intermediates at the binding site of the target protein of interest. With the help of the proposed method, it is now possible to overcome one of the major obstacles posed to the AI-based drug design approaches through the ability of the method to design novel target-specific synthesizable molecules.


Subject(s)
Artificial Intelligence , Drug Discovery , Drug Discovery/methods , Drug Design , Proteins/chemistry , Small Molecule Libraries/chemistry
2.
J Chem Inf Model ; 63(16): 5066-5076, 2023 08 28.
Article in English | MEDLINE | ID: mdl-37585609

ABSTRACT

Generative artificial intelligence algorithms have shown to be successful in exploring large chemical spaces and designing novel and diverse molecules. There has been considerable interest in developing predictive models using artificial intelligence for drug-like properties, which can potentially reduce the late-stage attrition of drug candidates or predict the properties of novel AI-designed molecules. Concurrently, it is important to understand the contribution of functional groups toward these properties and modify them to obtain property-optimized lead compounds. As a result, there is an increasing interest in the development of explainable property prediction models. However, current explainable approaches are mostly atom-based, where, often, only a fraction of a fragment is shown to be significant. To address the above challenges, we have developed a novel domain-aware molecular fragmentation approach termed post-processing of BRICS (pBRICS), which can fragment small molecules into their functional groups. Multitask models were developed to predict various properties, including the absorption, distribution, metabolism, excretion, and toxicity (ADMET) properties. The fragment importance was explained using the gradient-weighted class activation mapping (Grad-CAM) approach. The method was validated on data sets of experimentally available matched molecular pairs (MMPs). The explanations from the model can be useful for medicinal chemists to identify the fragments responsible for poor drug-like properties and optimize the molecule. The explainability approach was also used to identify the reason behind false positive and false negative MMP predictions. Based on evidence from the existing literature and our analysis, some of these mispredictions were justified. We propose that the quantity, quality, and diversity of the training data will improve the accuracy of property prediction algorithms for novel molecules.


Subject(s)
Algorithms , Artificial Intelligence
3.
ACS Infect Dis ; 9(3): 459-469, 2023 03 10.
Article in English | MEDLINE | ID: mdl-36790094

ABSTRACT

Emergence of novel zoonotic infections among the human population has increased the burden on global healthcare systems to curb their spread. To meet the evolutionary agility of pathogens, it is essential to revamp the existing diagnostic methods for early detection and characterization of the pathogens at the molecular level. Padlock probes (PLPs), which can leverage the power of isothermal nucleic acid amplification techniques (NAAT) such as rolling circle amplification (RCA), are known for their high sensitivity and specificity in detecting a diverse pathogen panel of interest. However, due to the complexity involved in deciding the target regions for PLP design and the need for optimization of multiple experimental parameters, the applicability of RCA has been limited in point-of-care testing for pathogen detection. To address this gap, we have developed a novel and integrated PLP design pipeline named AutoPLP, which can automate the probe design process for a diverse pathogen panel of interest. The pipeline is composed of three modules which can perform sequence data curation, multiple sequence alignment, conservation analysis, filtration based on experimental parameters (Tm, GC content, and secondary structure formation), and in silico probe validation via potential cross-hybridization check with host genome. The modules can also take into account the backbone and restriction site information, appropriate combinations of which are incorporated along with the probe arms to design a complete probe sequence. The potential applications of AutoPLP are showcased through the design of PLPs for the detection of rabies virus and drug-resistant strains of Mycobacterium tuberculosis.


Subject(s)
Mycobacterium tuberculosis , Humans , Base Sequence , Mycobacterium tuberculosis/genetics
4.
J Mol Graph Model ; 118: 108361, 2023 01.
Article in English | MEDLINE | ID: mdl-36257148

ABSTRACT

Mycobacterium tuberculosis (Mtb) is a pathogen of major concern due to its ability to withstand both first- and second-line antibiotics, leading to drug resistance. Thus, there is a critical need for identification of novel anti-tuberculosis agents targeting Mtb-specific proteins. The ceaseless search for novel antimicrobial agents to combat drug-resistant bacteria can be accelerated by the development of advanced deep learning methods, to explore both existing and uncharted regions of the chemical space. The adaptation of deep learning methods to under-explored pathogens such as Mtb is a challenging aspect, as most of the existing methods rely on the availability of sufficient target-specific ligand data to design novel small molecules with optimized bioactivity. In this work, we report the design of novel anti-tuberculosis agents targeting the Mtb chorismate mutase protein using a structure-based drug design algorithm. The structure-based deep learning method relies on the knowledge of the target protein's binding site structure alone for conditional generation of novel small molecules. The method eliminates the need for curation of a high-quality target-specific small molecule dataset, which remains a challenge even for many druggable targets, including Mtb chorismate mutase. Novel molecules are proposed, that show high complementarity to the target binding site. The graph attention model could identify the probable key binding site residues, which influenced the conditional molecule generator to design new molecules with pharmacophoric features similar to the known inhibitors.


Subject(s)
Deep Learning , Mycobacterium tuberculosis , Antitubercular Agents/chemistry , Mycobacterium tuberculosis/metabolism , Chorismate Mutase/metabolism , Drug Design
5.
Future Med Chem ; 14(20): 1441-1453, 2022 Oct.
Article in English | MEDLINE | ID: mdl-36169035

ABSTRACT

Aim: In the early stages of drug discovery, various experimental and computational methods are used to measure the specificity of small molecules against a target protein. The selectivity of small molecules remains a challenge leading to off-target side effects. Methods: We have developed a multitask deep learning model for predicting the selectivity on closely related homologs of the target protein. The model has been tested on the Janus-activated kinase and dopamine receptor families of proteins. Results & conclusion: The feature-based representation (extended connectivity fingerprint 4) with Extreme Gradient Boosting performed better when compared with deep neural network models in most of the evaluation metrics. Both the Extreme Gradient Boosting and deep neural network models outperformed the graph-based models. Furthermore, to decipher the model decision on selectivity, the important fragments associated with each homologous protein were identified.


Subject(s)
Machine Learning , Neural Networks, Computer , Proteins , Drug Discovery/methods , Receptors, Dopamine
6.
J Chem Inf Model ; 62(11): 2685-2695, 2022 06 13.
Article in English | MEDLINE | ID: mdl-35581002

ABSTRACT

The aim of drug design and development is to produce a drug that can inhibit the target protein and possess a balanced physicochemical and toxicity profile. Traditionally, this is a multistep process where different parameters such as activity and physicochemical and pharmacokinetic properties are optimized sequentially, which often leads to high attrition rate during later stages of drug design and development. We have developed a deep learning-based de novo drug design method that can design novel small molecules by optimizing target specificity as well as multiple parameters (including late-stage parameters) in a single step. All possible combinations of parameters were optimized to understand the effect of each parameter over the other parameters. An explainable predictive model was used to identify the molecular fragments responsible for the property being optimized. The proposed method was applied against the human 5-hydroxy tryptamine receptor 1B (5-HT1B), a protein from the central nervous system (CNS). Various physicochemical properties specific to CNS drugs were considered along with the target specificity and blood-brain barrier permeability (BBBP), which act as an additional challenge for CNS drug delivery. The contribution of each parameter toward molecule design was identified by analyzing the properties of generated small molecules from optimization of all possible parameter combinations. The final optimized generative model was able to design similar inhibitors compared to known inhibitors of 5-HT1B. In addition, the functional groups of the generated small molecules that guide the BBBP predictive model were identified through feature attribution techniques.


Subject(s)
Central Nervous System , Drug Design , Blood-Brain Barrier/metabolism , Central Nervous System/metabolism , Central Nervous System Agents/chemistry , Central Nervous System Agents/pharmacokinetics , Humans , Pharmaceutical Preparations/metabolism
7.
J Chem Inf Model ; 62(21): 5100-5109, 2022 Nov 14.
Article in English | MEDLINE | ID: mdl-34792338

ABSTRACT

In recent years, deep learning-based methods have emerged as promising tools for de novo drug design. Most of these methods are ligand-based, where an initial target-specific ligand data set is necessary to design potent molecules with optimized properties. Although there have been attempts to develop alternative ways to design target-specific ligand data sets, availability of such data sets remains a challenge while designing molecules against novel target proteins. In this work, we propose a deep learning-based method, where the knowledge of the active site structure of the target protein is sufficient to design new molecules. First, a graph attention model was used to learn the structure and features of the amino acids in the active site of proteins that are experimentally known to form protein-ligand complexes. Next, the learned active site features were used along with a pretrained generative model for conditional generation of new molecules. A bioactivity prediction model was then used in a reinforcement learning framework to optimize the conditional generative model. We validated our method against two well-studied proteins, Janus kinase 2 (JAK2) and dopamine receptor D2 (DRD2), where we produce molecules similar to the known inhibitors. The graph attention model could identify the probable key active site residues, which influenced the conditional molecule generator to design new molecules with pharmacophoric features similar to the known inhibitors.


Subject(s)
Deep Learning , Ligands , Models, Molecular , Drug Design , Proteins
8.
J Chem Inf Model ; 61(2): 621-630, 2021 02 22.
Article in English | MEDLINE | ID: mdl-33491455

ABSTRACT

In the world plagued by the emergence of new diseases, it is essential that we accelerate the drug design process to develop new therapeutics against them. In recent years, deep learning-based methods have shown some success in ligand-based drug design. Yet, these methods face the problem of data scarcity while designing drugs against a novel target. In this work, the potential of deep learning and molecular modeling approaches was leveraged to develop a drug design pipeline, which can be useful for cases where there is limited or no availability of target-specific ligand datasets. Inhibitors of the homologues of the target protein were screened at the active site of the target protein to create an initial target-specific dataset. Transfer learning was used to learn the features of the target-specific dataset. A deep predictive model was utilized to predict the docking scores of newly designed molecules. Both these models were combined using reinforcement learning to design new chemical entities with an optimized docking score. The pipeline was validated by designing inhibitors against the human JAK2 protein, where none of the existing JAK2 inhibitors were used for training. The ability of the method to reproduce existing molecules from the validation dataset and design molecules with better binding energy demonstrates the potential of the proposed approach.


Subject(s)
Deep Learning , Drug Design , Catalytic Domain , Humans , Ligands , Proteins
9.
Mol Omics ; 15(6): 431-441, 2019 12 02.
Article in English | MEDLINE | ID: mdl-31631203

ABSTRACT

Malaria continues to be a major concern in developing countries despite continuous efforts to find a cure for the disease. Understanding the pathogenesis mechanism is necessary to identify more effective drug targets against malaria. Many years of experimental research have generated a large amount of data for the malarial parasite, Plasmodium falciparum. These data are useful to understand the importance of certain parasite proteins, but it often remains unclear how these proteins come together, interact with other proteins and carry out their function. Identification of all proteins involved in pathogenesis is an important step towards understanding the molecular mechanism of pathogenesis. In this study, dynamic stage-specific protein-protein interaction networks were created based on gene expression data during the parasite's intra-erythrocytic stages and static protein-protein interaction data. Using previously known proteins of a biological event as seed proteins, the random walk with restart (RWR) method was used on the dynamic protein-protein interaction networks to identify novel proteins related to that event. Two screening procedures namely, permutation test and GO enrichment test were performed to increase the reliability of the RWR predictions. The proposed method was first validated on Plasmodium falciparum proteins related to invasion, where it could reproduce the existing knowledge from a small set of seed proteins. It was then used to identify novel Maurer's clefts resident proteins, where it could identify 152 parasite proteins. We show that the current approach can annotate conserved proteins with unknown function. The predicted proteins can help build a mechanistic model for disease pathogenesis, which will be useful in identifying new drug targets.


Subject(s)
Erythrocytes/metabolism , Erythrocytes/parasitology , Host-Parasite Interactions , Malaria, Falciparum/metabolism , Malaria, Falciparum/parasitology , Plasmodium falciparum/physiology , Algorithms , Computational Biology/methods , High-Throughput Screening Assays , Humans , Protein Binding , Protein Interaction Mapping , Protein Interaction Maps , Protozoan Proteins , Reproducibility of Results
SELECTION OF CITATIONS
SEARCH DETAIL
...